; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004607 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004607
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationChr08:18810499..18811740
RNA-Seq ExpressionHG10004607
SyntenyHG10004607
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029751.1 hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-20388.16Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AG PKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP

Query:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP
        VIEDGNEDGGAVFPTSTQGIP WSN GFP SPRKCR+GIRDRKLKDRPS+L PN KVEC S QSA KEDGSCRI++DNG+AT CDY+RPVQHLQGV E P
Subjt:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP

Query:  ENNTEARVQQPSGKQVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQGLGSV
        ENN EARVQ+P+GKQVLQ  +QVEDREEA QSNR S+LRS LLAPLGIPFCSAS GGA KTRPVD GG+FSFSD+GHLLDTESLRRRMEQIAAVQGLGSV
Subjt:  ENNTEARVQQPSGKQVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQGLGSV

Query:  SADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDW
        SADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEK LAH QQ+QGKVINGMLPNNQLH  HSNGNGEV HE RL CS+SLLDFKVAMELNP QLGEDW
Subjt:  SADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDW

Query:  PLLLEKICMRAFEE
        PLLLEKI MRAF E
Subjt:  PLLLEKICMRAFEE

XP_004138880.1 uncharacterized protein LOC101213741 [Cucumis sativus]6.3e-20086.4Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAK APPIPVAG PKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP

Query:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP
        ++EDGNEDGGAVFPTSTQ IP WSNG    SPRKCR+GIRDRKLKDRPS+LGPNGKVEC SH SA          +DNGDAT CDYKRPVQ+LQG+AE P
Subjt:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP

Query:  ENNTEARVQQPSGKQVLQNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQ
        ENN E RV QPSGKQ LQNKIQ     VEDREEAGQSN  S+LRS LLAPLGIPFCSAS GGARKTRPVD GGDFS SD+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNTEARVQQPSGKQVLQNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEK L+HKQQ QGKVINGMLPNNQLHGRHSNG+ EV HEHRLQCS+SLLDFKVAMELNPTQ
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKICMR F E
Subjt:  LGEDWPLLLEKICMRAFEE

XP_008445087.1 PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo]2.7e-20387.83Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAG PKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP

Query:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP
        ++EDGNEDGGAVFPTSTQ IP WSNG    SPRKCR+GIRDRKLKDRPSVLGPNGKVEC SH SA          +DNGDAT CDYKRPVQHLQGVAE P
Subjt:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP

Query:  ENNTEARVQQPSGKQVLQNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQ
        ENN E RV QPSGKQVL NKIQ     VEDREEAGQSN  S+LRS LLAPLGIPFCSASTGG  KTRPVD GGDFSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNTEARVQQPSGKQVLQNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEK LAHKQQ+QGKVINGMLPNNQLHGRHSNGN EV HEHRLQCS+SLLDFKVAMELNPTQ
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKICMRAF E
Subjt:  LGEDWPLLLEKICMRAFEE

XP_022962598.1 uncharacterized protein LOC111463000 [Cucurbita moschata]5.5e-20488.41Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AG PKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP

Query:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP
        VIEDGNEDGGAVFPTSTQGIP WSN GFP SPRKCR+GIRDRKLKDRPS+L PN KVEC S QSA KEDGSCRI++DNG+AT CDY+RPVQHLQGV E P
Subjt:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP

Query:  ENNTEARVQQPSGKQVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQGLGSV
        ENN EARVQ+PSGKQVLQ  +QVEDREEA QSNR S+LRS LLAPLGIPFCSAS GGA KTRPVD GG+FSFSD+GHLLDTESLRRRMEQIAAVQGLGSV
Subjt:  ENNTEARVQQPSGKQVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQGLGSV

Query:  SADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDW
        SADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEK LAH QQ+QGKVINGMLPNNQLH  HSNGNGEV HE RL CS+SLLDFKVAMELNP QLGEDW
Subjt:  SADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDW

Query:  PLLLEKICMRAFEE
        PLLLEKI MRAF E
Subjt:  PLLLEKICMRAFEE

XP_023546134.1 uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo]1.6e-20387.53Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AG PKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP

Query:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP
        VIEDGNEDGGAVFPTSTQGIP WSN GFP SPRKCR+GIRDRKLKDRPS+L PN KVEC S QSA KEDGSCRI++DNG+AT CDY+RPVQHLQGV E P
Subjt:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP

Query:  ENNTEARVQQPSGKQVLQNKIQ---VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQGL
        ENN EARVQ+PSGKQVLQ +++   VEDREEA QSNR S+LRS LLAPLGIPFCSAS GGA KTRPVD GG+FSFSD+GHLLDTESLRRRMEQIAAVQGL
Subjt:  ENNTEARVQQPSGKQVLQNKIQ---VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQGL

Query:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLG
        GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEK LAH QQ+QGKVINGMLPNNQLH  HSNGNGEV HE RL CS+SLLDFKVAMELNP QLG
Subjt:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLG

Query:  EDWPLLLEKICMRAFEE
        EDWPLLLEKI MRAF E
Subjt:  EDWPLLLEKICMRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LM32 Uncharacterized protein3.1e-20086.4Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAK APPIPVAG PKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP

Query:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP
        ++EDGNEDGGAVFPTSTQ IP WSNG    SPRKCR+GIRDRKLKDRPS+LGPNGKVEC SH SA          +DNGDAT CDYKRPVQ+LQG+AE P
Subjt:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP

Query:  ENNTEARVQQPSGKQVLQNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQ
        ENN E RV QPSGKQ LQNKIQ     VEDREEAGQSN  S+LRS LLAPLGIPFCSAS GGARKTRPVD GGDFS SD+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNTEARVQQPSGKQVLQNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEK L+HKQQ QGKVINGMLPNNQLHGRHSNG+ EV HEHRLQCS+SLLDFKVAMELNPTQ
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKICMR F E
Subjt:  LGEDWPLLLEKICMRAFEE

A0A1S3BCQ5 uncharacterized protein LOC1034882311.3e-20387.83Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAG PKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP

Query:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP
        ++EDGNEDGGAVFPTSTQ IP WSNG    SPRKCR+GIRDRKLKDRPSVLGPNGKVEC SH SA          +DNGDAT CDYKRPVQHLQGVAE P
Subjt:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP

Query:  ENNTEARVQQPSGKQVLQNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQ
        ENN E RV QPSGKQVL NKIQ     VEDREEAGQSN  S+LRS LLAPLGIPFCSASTGG  KTRPVD GGDFSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNTEARVQQPSGKQVLQNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEK LAHKQQ+QGKVINGMLPNNQLHGRHSNGN EV HEHRLQCS+SLLDFKVAMELNPTQ
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKICMRAF E
Subjt:  LGEDWPLLLEKICMRAFEE

A0A5A7VF96 SAGA-Tad1 domain-containing protein1.3e-20387.83Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAG PKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP

Query:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP
        ++EDGNEDGGAVFPTSTQ IP WSNG    SPRKCR+GIRDRKLKDRPSVLGPNGKVEC SH SA          +DNGDAT CDYKRPVQHLQGVAE P
Subjt:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP

Query:  ENNTEARVQQPSGKQVLQNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQ
        ENN E RV QPSGKQVL NKIQ     VEDREEAGQSN  S+LRS LLAPLGIPFCSASTGG  KTRPVD GGDFSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNTEARVQQPSGKQVLQNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEK LAHKQQ+QGKVINGMLPNNQLHGRHSNGN EV HEHRLQCS+SLLDFKVAMELNPTQ
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKICMRAF E
Subjt:  LGEDWPLLLEKICMRAFEE

A0A6J1HF85 uncharacterized protein LOC1114630002.7e-20488.41Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AG PKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP

Query:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP
        VIEDGNEDGGAVFPTSTQGIP WSN GFP SPRKCR+GIRDRKLKDRPS+L PN KVEC S QSA KEDGSCRI++DNG+AT CDY+RPVQHLQGV E P
Subjt:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP

Query:  ENNTEARVQQPSGKQVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQGLGSV
        ENN EARVQ+PSGKQVLQ  +QVEDREEA QSNR S+LRS LLAPLGIPFCSAS GGA KTRPVD GG+FSFSD+GHLLDTESLRRRMEQIAAVQGLGSV
Subjt:  ENNTEARVQQPSGKQVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQGLGSV

Query:  SADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDW
        SADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEK LAH QQ+QGKVINGMLPNNQLH  HSNGNGEV HE RL CS+SLLDFKVAMELNP QLGEDW
Subjt:  SADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDW

Query:  PLLLEKICMRAFEE
        PLLLEKI MRAF E
Subjt:  PLLLEKICMRAFEE

A0A6J1K7Q1 uncharacterized protein LOC1114924146.8e-20086.33Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AG PKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP

Query:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP
        VIEDGNEDGGAVF TSTQGIP WSN GF  SPRKCR+GIRDRKLKDRPS+L PN KVEC S QSA KEDGSCRI++DNG+AT CDY+RPVQHLQGV E P
Subjt:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP

Query:  ENNTEARVQQPSGKQVLQNKIQ---VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQGL
        ENN EARVQ+PSGKQVLQ +++   VEDREEA QSNR S+LRS LLAPLGIPFCSAS GGA KTRPVD GG+FSFSD+GHLLDTESLRRRMEQIAAVQGL
Subjt:  ENNTEARVQQPSGKQVLQNKIQ---VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQGL

Query:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLG
        GSVSADCANILNKVLDVYLKQLIRSCVDLVG WP FEPEK LAH QQ+QGKVINGMLPNNQLH  HSNGN EV HE RL CS+SLLDFKVAMELNP QLG
Subjt:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLG

Query:  EDWPLLLEKICMRAFEE
        EDWPLLLEKI MRAF E
Subjt:  EDWPLLLEKICMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein4.3e-3732.36Show/hide
Query:  RIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNE
        R++  E+K+ I +K+G  R+  YF  L +FL+ ++SK+EFDKLC + +GREN+ LHN+L++SILKNA  AK+ PP      PK S       PV      
Subjt:  RIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNE

Query:  DGGAVFPTSTQGIPWSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARV
                           FP SPRKCR+    RK +DRPS LGP GK                                           P++ T    
Subjt:  DGGAVFPTSTQGIPWSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARV

Query:  QQPSGKQVL-QNKIQVEDREEAGQSNRLSVL--RSGLLAPLGIPFCSASTGGARKTRPVDFGG--DFSFSDIGHLLDTESLRRRMEQIAAVQGLGSVSAD
        +  S  Q L    + VED EE  Q      +  RS L APLG+ F   S     K R   + G    +    G L D  +LR R+E+   ++G+  +S D
Subjt:  QQPSGKQVL-QNKIQVEDREEAGQSNRLSVL--RSGLLAPLGIPFCSASTGGARKTRPVDFGG--DFSFSDIGHLLDTESLRRRMEQIAAVQGLGSVSAD

Query:  CANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLL
         AN+LN+ L+ Y+++LI  C+             SLA +Q+                              R   +VS+LDF  AME+NP  LGE+WP+ 
Subjt:  CANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLL

Query:  LEKICMRAFEE
        LEKIC RA EE
Subjt:  LEKICMRAFEE

AT2G24530.1 unknown protein1.4e-9348.22Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP
        MQ  Q  RI L ELK  IVKK G +RS+RYF+YL RFLSQKL+K+EFDK C R+LGRENL LHNQLI+SIL+NA  AK+ PP   AG     +  A    
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP

Query:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP
           DG E  G + P  +Q  P WSNG  P SPRK R+G+++RK +DRPS LG NGKVE   HQ   +ED    + ++NG     DY+R  ++   VA++ 
Subjt:  VIEDGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP

Query:  ENNTEARVQQPSGKQVLQNK-----IQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGD-FSFSDIGHLLDTESLRRRMEQIAAV
        +      V++P     + NK     + + D +   +  R+++  S L+APLGIPFCSAS GG+ +T PV    +  S  D G L D E LR+RME IA  
Subjt:  ENNTEARVQQPSGKQVLQNK-----IQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGD-FSFSDIGHLLDTESLRRRMEQIAAV

Query:  QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA-FEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNP
        QGL  VS +CA  LN +LDVYLK+LI SC DLVGA     +P K    KQQ Q K++NG+ P N L  +  NG+ ++  +H    SVS+LDF+ AMELNP
Subjt:  QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA-FEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNP

Query:  TQLGEDWPLLLEKICMRAFEE
         QLGEDWP L E+I +R+FEE
Subjt:  TQLGEDWPLLLEKICMRAFEE

AT4G31440.1 unknown protein2.7e-7645.24Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP
        MQ  Q  RIDL ELK  IVKK+G +RS RYF+YL RFLSQKL+K+EFDK C R+LGRENL LHN+LI+SIL+NA  AK+ P +  +G P  S    K   
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISP

Query:  VIEDGNEDGGAVFPTSTQG-IPWSNGGFPASPRKCRTG-IRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQ
          EDG E+  ++ P   +  +  SNG       K R G   DR ++D+P  LG NGKV                         P  Y RP ++       
Subjt:  VIEDGNEDGGAVFPTSTQG-IPWSNGGFPASPRKCRTG-IRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQ

Query:  PENNTEARVQQPSGKQVLQNKIQVE---DREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVD-FGGDFSFSDIGHLLDTESLRRRMEQIAAVQ
        P+    A +  P+ ++ +  K QV     R++  Q   LS     ++APLGIPFCSAS GG R+T PV       S  D G L DTE LR+RME IA  Q
Subjt:  PENNTEARVQQPSGKQVLQNKIQVE---DREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVD-FGGDFSFSDIGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA-FEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPT
        GLG VSA+C+ +LN +LD+YLK+L++SCVDL GA      P K    KQQ + +++NG+  NN  H + SN   ++  E   Q SVSLLDF+VAMELNP 
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA-FEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPT

Query:  QLGEDWPLLLEKICMRAFEE
        QLGEDWPLL E+I +  FEE
Subjt:  QLGEDWPLLLEKICMRAFEE

AT4G33890.1 unknown protein9.5e-3731.03Show/hide
Query:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              + 
Subjt:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIE

Query:  DGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGK---VECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP
         GN D       ++Q  P   +  F  S RKCR+    RKL+DRPS LGP GK   +   + +S  K   +  ++            RP   +  V E  
Subjt:  DGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGK---VECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP

Query:  ENNTEARVQQPSGKQVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFS-----DIGHLLDTESLRRRMEQIAAVQ
        E      V+Q +G                G  +  S  R  L APLG+   S   G  RK+         SF+     + G L DT +LR R+E+   ++
Subjt:  ENNTEARVQQPSGKQVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFS-----DIGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ
        GL  ++ D  ++LN  LDV++++LI  C+ L       +  + +                           N +   + R    VS+ DF+  MELN   
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWP+ +EKIC RA ++
Subjt:  LGEDWPLLLEKICMRAFEE

AT4G33890.2 unknown protein9.5e-3731.03Show/hide
Query:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              + 
Subjt:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIE

Query:  DGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGK---VECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP
         GN D       ++Q  P   +  F  S RKCR+    RKL+DRPS LGP GK   +   + +S  K   +  ++            RP   +  V E  
Subjt:  DGNEDGGAVFPTSTQGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGK---VECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQP

Query:  ENNTEARVQQPSGKQVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFS-----DIGHLLDTESLRRRMEQIAAVQ
        E      V+Q +G                G  +  S  R  L APLG+   S   G  RK+         SF+     + G L DT +LR R+E+   ++
Subjt:  ENNTEARVQQPSGKQVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFS-----DIGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ
        GL  ++ D  ++LN  LDV++++LI  C+ L       +  + +                           N +   + R    VS+ DF+  MELN   
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWP+ +EKIC RA ++
Subjt:  LGEDWPLLLEKICMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTCGGAGCCGATCGGTCGAAACGGTACTTCTTTTACTTGAATAGGTT
CTTGAGTCAAAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTTGTCGTGTACTTGGGAGAGAAAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTAAAGAATG
CTTGTCAAGCTAAGGCTGCACCACCAATACCTGTAGCAGGCTGTCCGAAAACTTCGACGCAATCTGCAAAAATTTCCCCTGTCATAGAAGATGGGAATGAGGATGGTGGA
GCTGTTTTTCCTACTTCCACTCAAGGTATTCCTTGGTCCAATGGAGGTTTTCCGGCGTCCCCGAGAAAGTGCAGGACTGGGATACGCGACCGCAAACTCAAAGACAGGCC
TAGTGTACTAGGACCAAATGGGAAGGTTGAATGTAACTCACATCAATCAGCAGGCAAGGAAGACGGCAGCTGTAGAATCATCGTGGATAATGGTGATGCAACACCGTGTG
ACTATAAGAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACAACCTGAAAACAATACTGAGGCTAGAGTTCAGCAACCATCAGGAAAGCAAGTCTTACAAAATAAGATC
CAGGTTGAAGACAGGGAAGAAGCGGGACAGTCAAATCGCTTGAGTGTACTTCGGAGTGGATTACTTGCACCTCTCGGGATTCCTTTTTGCTCAGCCAGTACAGGCGGGGC
CCGCAAAACGAGGCCGGTGGATTTTGGGGGAGATTTTAGCTTCAGTGATATTGGTCATTTGTTGGATACTGAGTCGTTGAGACGACGCATGGAACAAATTGCTGCAGTAC
AGGGCCTGGGCAGTGTTTCTGCAGATTGTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGCTAATTAGGTCTTGTGTTGACTTGGTTGGAGCATGGCCT
GCATTTGAGCCTGAGAAATCCCTTGCCCATAAGCAGCAGGTTCAGGGGAAGGTTATCAATGGCATGTTGCCGAATAATCAATTACATGGACGACATAGCAACGGAAATGG
AGAAGTTGAACACGAGCACAGATTACAGTGCTCGGTATCATTGCTCGACTTCAAGGTAGCAATGGAGCTTAATCCAACACAACTTGGGGAAGACTGGCCTTTGCTACTGG
AGAAAATTTGTATGCGAGCATTCGAGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTCGGAGCCGATCGGTCGAAACGGTACTTCTTTTACTTGAATAGGTT
CTTGAGTCAAAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTTGTCGTGTACTTGGGAGAGAAAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTAAAGAATG
CTTGTCAAGCTAAGGCTGCACCACCAATACCTGTAGCAGGCTGTCCGAAAACTTCGACGCAATCTGCAAAAATTTCCCCTGTCATAGAAGATGGGAATGAGGATGGTGGA
GCTGTTTTTCCTACTTCCACTCAAGGTATTCCTTGGTCCAATGGAGGTTTTCCGGCGTCCCCGAGAAAGTGCAGGACTGGGATACGCGACCGCAAACTCAAAGACAGGCC
TAGTGTACTAGGACCAAATGGGAAGGTTGAATGTAACTCACATCAATCAGCAGGCAAGGAAGACGGCAGCTGTAGAATCATCGTGGATAATGGTGATGCAACACCGTGTG
ACTATAAGAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACAACCTGAAAACAATACTGAGGCTAGAGTTCAGCAACCATCAGGAAAGCAAGTCTTACAAAATAAGATC
CAGGTTGAAGACAGGGAAGAAGCGGGACAGTCAAATCGCTTGAGTGTACTTCGGAGTGGATTACTTGCACCTCTCGGGATTCCTTTTTGCTCAGCCAGTACAGGCGGGGC
CCGCAAAACGAGGCCGGTGGATTTTGGGGGAGATTTTAGCTTCAGTGATATTGGTCATTTGTTGGATACTGAGTCGTTGAGACGACGCATGGAACAAATTGCTGCAGTAC
AGGGCCTGGGCAGTGTTTCTGCAGATTGTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGCTAATTAGGTCTTGTGTTGACTTGGTTGGAGCATGGCCT
GCATTTGAGCCTGAGAAATCCCTTGCCCATAAGCAGCAGGTTCAGGGGAAGGTTATCAATGGCATGTTGCCGAATAATCAATTACATGGACGACATAGCAACGGAAATGG
AGAAGTTGAACACGAGCACAGATTACAGTGCTCGGTATCATTGCTCGACTTCAAGGTAGCAATGGAGCTTAATCCAACACAACTTGGGGAAGACTGGCCTTTGCTACTGG
AGAAAATTTGTATGCGAGCATTCGAGGAATGA
Protein sequenceShow/hide protein sequence
MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGG
AVFPTSTQGIPWSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVLQNKI
QVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLRRRMEQIAAVQGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWP
AFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE