; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10006883 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10006883
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein indeterminate-domain 5, chloroplastic
Genome locationChr07:22937323..22939421
RNA-Seq ExpressionHG10006883
SyntenyHG10006883
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR022755 - Zinc finger, double-stranded RNA binding
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140400.1 protein indeterminate-domain 5, chloroplastic isoform X1 [Cucumis sativus]1.1e-24981.31Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        +PDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL
        CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLG AIGSHLYG NSNVGLTLSQVPQ+SSLQDH+NI QSPHDVL
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL

Query:  RLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGF
        RLGGGRTGQFTHLLPPSI SSFRPPPQQAMPSSNA F  LSDQTNQNSFHEDHHQSQSQQGLFGNK FHGLMQFPSDIQTHA   NNNNNSASNLFNL F
Subjt:  RLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGF

Query:  ISNPT---------------------------------------------------------------AAVPSLYSTAAPGGCSSGTS-GGPIPHMSATA
        ISNPT                                                               AAVPSLYS  APGGCSSGTS GG IPHMSATA
Subjt:  ISNPT---------------------------------------------------------------AAVPSLYSTAAPGGCSSGTS-GGPIPHMSATA

Query:  LLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTK
        LLQKAAQLGSTTSSSNTTATLLRTFGSSS+S GKASDRTLFPPSYGGVVFGENESNLQDLMNSFA  SSGSGMFG    SFG         +ESLEDPTK
Subjt:  LLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTK

Query:  LQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNYQ
        LQQNLSTVSMG GTDRLTRDFLGVGQIVRSMS  GGGGGGGY+QREHKQ  QGIV+EGNESNTAPSS AFGGGNGNYQ
Subjt:  LQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNYQ

XP_008460216.1 PREDICTED: protein indeterminate-domain 5, chloroplastic [Cucumis melo]7.2e-24981.14Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        +PDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL
        CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLG AIGSHLYG NSNVGLTLSQVPQ+SSLQDH+NI QSPHDVL
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL

Query:  RLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGF
        RLGGGRTGQFTHLLPPSI SSFRPPPQQAMPSSNA F  LSDQTNQNSFHEDHHQSQSQQGLFGNK FHGLMQFPSDIQTHA   NNN+NSASNLFNL F
Subjt:  RLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGF

Query:  ISNPT---------------------------------------------------------------AAVPSLYSTAAPGGCSSGTS-GGPIPHMSATA
        ISNPT                                                               AAVPSLYS  APGGCSSGTS GG IPHMSATA
Subjt:  ISNPT---------------------------------------------------------------AAVPSLYSTAAPGGCSSGTS-GGPIPHMSATA

Query:  LLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTK
        LLQKAAQLGSTTSSSNTTATLLRTFGSSS+S GKASDRTLFPPSYGGVVF ENESNLQDLMNSFA  SSGSGMFG    SFG         +ESLEDPTK
Subjt:  LLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTK

Query:  LQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNYQ
        LQQNLSTVSMG GTDRLTRDFLGVGQIVRSMS  GGGGGGGYSQREHKQ  QGIV+EGNESNTAPSS AFGGGNGNYQ
Subjt:  LQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNYQ

XP_031741807.1 protein indeterminate-domain 5, chloroplastic isoform X2 [Cucumis sativus]5.0e-24281.03Show/hide
Query:  MATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDKCSKRYAVQSDWKAH
        MATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDKCSKRYAVQSDWKAH
Subjt:  MATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDKCSKRYAVQSDWKAH

Query:  SKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVLRLGGGRTGQFTHLL
        SKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLG AIGSHLYG NSNVGLTLSQVPQ+SSLQDH+NI QSPHDVLRLGGGRTGQFTHLL
Subjt:  SKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVLRLGGGRTGQFTHLL

Query:  PPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGFISNPT---------
        PPSI SSFRPPPQQAMPSSNA F  LSDQTNQNSFHEDHHQSQSQQGLFGNK FHGLMQFPSDIQTHA   NNNNNSASNLFNL FISNPT         
Subjt:  PPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGFISNPT---------

Query:  ------------------------------------------------------AAVPSLYSTAAPGGCSSGTS-GGPIPHMSATALLQKAAQLGSTTSS
                                                              AAVPSLYS  APGGCSSGTS GG IPHMSATALLQKAAQLGSTTSS
Subjt:  ------------------------------------------------------AAVPSLYSTAAPGGCSSGTS-GGPIPHMSATALLQKAAQLGSTTSS

Query:  SNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGT
        SNTTATLLRTFGSSS+S GKASDRTLFPPSYGGVVFGENESNLQDLMNSFA  SSGSGMFG    SFG         +ESLEDPTKLQQNLSTVSMG GT
Subjt:  SNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGT

Query:  DRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNYQ
        DRLTRDFLGVGQIVRSMS  GGGGGGGY+QREHKQ  QGIV+EGNESNTAPSS AFGGGNGNYQ
Subjt:  DRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNYQ

XP_038877219.1 protein indeterminate-domain 5, chloroplastic isoform X1 [Benincasa hispida]2.8e-25381.93Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        +PDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL
        CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLG AIGSHLYG NSNVGLTL++VPQISSLQDH+NI QSPHDVL
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL

Query:  RLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGF
        RLGGGRTGQFTHLLPPSI SSFRPPPQQAMPSSNA F  LSD TNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHA  NNNNNN ASNLFNLGF
Subjt:  RLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGF

Query:  ISNP-----------------------------------------------------------------TAAVPSLYSTAAPGGCSSGTS--GGPIPHMS
        ISNP                                                                 +AAVPSLYSTAAPGGCSSGTS  GG IPHMS
Subjt:  ISNP-----------------------------------------------------------------TAAVPSLYSTAAPGGCSSGTS--GGPIPHMS

Query:  ATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLED
        ATALLQKAAQLGSTTSSSNTTATLLRTFGSSS+S GKASDRTLFPPSYGGVVFGENESNLQDLMNSF +GS GSGMFGSGMSSFG         +ESL+D
Subjt:  ATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLED

Query:  PTKLQQNLSTVSM-GAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNY
        PTKLQQNLSTVSM G GTDRLTRDFLGVGQIVRSMS  G GGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNY
Subjt:  PTKLQQNLSTVSM-GAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNY

XP_038877220.1 protein indeterminate-domain 5, chloroplastic isoform X2 [Benincasa hispida]1.3e-24581.66Show/hide
Query:  MATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDKCSKRYAVQSDWKAH
        MATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDKCSKRYAVQSDWKAH
Subjt:  MATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDKCSKRYAVQSDWKAH

Query:  SKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVLRLGGGRTGQFTHLL
        SKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLG AIGSHLYG NSNVGLTL++VPQISSLQDH+NI QSPHDVLRLGGGRTGQFTHLL
Subjt:  SKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVLRLGGGRTGQFTHLL

Query:  PPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGFISNP----------
        PPSI SSFRPPPQQAMPSSNA F  LSD TNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHA  NNNNNN ASNLFNLGFISNP          
Subjt:  PPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGFISNP----------

Query:  -------------------------------------------------------TAAVPSLYSTAAPGGCSSGTS--GGPIPHMSATALLQKAAQLGST
                                                               +AAVPSLYSTAAPGGCSSGTS  GG IPHMSATALLQKAAQLGST
Subjt:  -------------------------------------------------------TAAVPSLYSTAAPGGCSSGTS--GGPIPHMSATALLQKAAQLGST

Query:  TSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSM-
        TSSSNTTATLLRTFGSSS+S GKASDRTLFPPSYGGVVFGENESNLQDLMNSF +GS GSGMFGSGMSSFG         +ESL+DPTKLQQNLSTVSM 
Subjt:  TSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSM-

Query:  GAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNY
        G GTDRLTRDFLGVGQIVRSMS  G GGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNY
Subjt:  GAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNY

TrEMBL top hitse value%identityAlignment
A0A0A0KMB9 C2H2-type domain-containing protein5.3e-25081.31Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        +PDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL
        CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLG AIGSHLYG NSNVGLTLSQVPQ+SSLQDH+NI QSPHDVL
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL

Query:  RLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGF
        RLGGGRTGQFTHLLPPSI SSFRPPPQQAMPSSNA F  LSDQTNQNSFHEDHHQSQSQQGLFGNK FHGLMQFPSDIQTHA   NNNNNSASNLFNL F
Subjt:  RLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGF

Query:  ISNPT---------------------------------------------------------------AAVPSLYSTAAPGGCSSGTS-GGPIPHMSATA
        ISNPT                                                               AAVPSLYS  APGGCSSGTS GG IPHMSATA
Subjt:  ISNPT---------------------------------------------------------------AAVPSLYSTAAPGGCSSGTS-GGPIPHMSATA

Query:  LLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTK
        LLQKAAQLGSTTSSSNTTATLLRTFGSSS+S GKASDRTLFPPSYGGVVFGENESNLQDLMNSFA  SSGSGMFG    SFG         +ESLEDPTK
Subjt:  LLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTK

Query:  LQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNYQ
        LQQNLSTVSMG GTDRLTRDFLGVGQIVRSMS  GGGGGGGY+QREHKQ  QGIV+EGNESNTAPSS AFGGGNGNYQ
Subjt:  LQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNYQ

A0A1S3CCG7 protein indeterminate-domain 5, chloroplastic3.5e-24981.14Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        +PDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL
        CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLG AIGSHLYG NSNVGLTLSQVPQ+SSLQDH+NI QSPHDVL
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL

Query:  RLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGF
        RLGGGRTGQFTHLLPPSI SSFRPPPQQAMPSSNA F  LSDQTNQNSFHEDHHQSQSQQGLFGNK FHGLMQFPSDIQTHA   NNN+NSASNLFNL F
Subjt:  RLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFF--LSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGF

Query:  ISNPT---------------------------------------------------------------AAVPSLYSTAAPGGCSSGTS-GGPIPHMSATA
        ISNPT                                                               AAVPSLYS  APGGCSSGTS GG IPHMSATA
Subjt:  ISNPT---------------------------------------------------------------AAVPSLYSTAAPGGCSSGTS-GGPIPHMSATA

Query:  LLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTK
        LLQKAAQLGSTTSSSNTTATLLRTFGSSS+S GKASDRTLFPPSYGGVVF ENESNLQDLMNSFA  SSGSGMFG    SFG         +ESLEDPTK
Subjt:  LLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTK

Query:  LQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNYQ
        LQQNLSTVSMG GTDRLTRDFLGVGQIVRSMS  GGGGGGGYSQREHKQ  QGIV+EGNESNTAPSS AFGGGNGNYQ
Subjt:  LQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNYQ

A0A6J1E1A5 protein indeterminate-domain 5, chloroplastic-like2.0e-22577.02Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        +PDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEP+CVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGAN-SNVGLTLSQVPQISSLQDHTNIAQSPHDV
        CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLG AIGSHLYG N +NVGLTLSQVPQ+SSLQDH NI+QS HDV
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGAN-SNVGLTLSQVPQISSLQDHTNIAQSPHDV

Query:  LRLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSN---AFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNL
        LRLGG R GQF+HLLPPSI SSFR PPQQAMPSS+   AFFL+DQTNQNSFHED HQSQSQQGLFGNK FHGLMQFPSDIQ+H SSNNN   +A+NLFNL
Subjt:  LRLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSN---AFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNL

Query:  GFISNPT-----------------------------------------------------AAVPSLYSTAAPGGCS-SGTSGGPIPHMSATALLQKAAQL
        GFISNPT                                                      AVPSLYS A       +G SGG +PHMSATALLQKAAQL
Subjt:  GFISNPT-----------------------------------------------------AAVPSLYSTAAPGGCS-SGTSGGPIPHMSATALLQKAAQL

Query:  GSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMF--GSGMSSFGGFE--GSNRSNMESLEDPTKLQQN
        GSTTSSSNTTATLLRTFGSSSSSGGK SDR LFPPSYGG VFGENE+NLQDLMNSFA+G S SG+F  G+GM+SFGGF+  G+  +NME+LEDP KLQQN
Subjt:  GSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMF--GSGMSSFGGFE--GSNRSNMESLEDPTKLQQN

Query:  LSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGN
        L+ VSMG GTDRLTRDFLGVGQIVRSMS GGGGGGGG     HKQ  QGIVL+ +ESN+APSSQAFGGGN
Subjt:  LSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGN

A0A6J1G1A8 protein indeterminate-domain 5, chloroplastic-like5.0e-22476.4Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        +PDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKS KEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL
        CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHP NLG A+G HLYG NSNVGLTLSQVPQ+SSLQD  NI QS  DVL
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL

Query:  RL-GGGRTGQFTHLLPPSIASSFRPPPQQAMPSS---NAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNL
        RL GGGRTGQF HLLPPSI SSFRPPPQQAMPSS    AFFL+DQT+QNSFHED H SQSQQGLFGNKAFHGLMQF SD+Q+H S   N+NN +SNLFNL
Subjt:  RL-GGGRTGQFTHLLPPSIASSFRPPPQQAMPSS---NAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNL

Query:  GFISNPT-----------------------------------------------------------------AAVPSLYS-TAAPGGCSSGTSGGPIPHM
        GFISNPT                                                                  AVPSLYS T   GG  SGTSG  I HM
Subjt:  GFISNPT-----------------------------------------------------------------AAVPSLYS-TAAPGGCSSGTSGGPIPHM

Query:  SATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNR--SNMES
        SATALLQKAAQLGSTTSSSNTTATLLR+FGSSS+SGGKASDRTLFPPSYGG VFGENESNLQDLMNSF TGSS  G+FG GMSSFG F+G N   +NME+
Subjt:  SATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNR--SNMES

Query:  LEDPTKLQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMS----GGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAF-GGGNGNYQ
        LEDP KLQQNLS+VSMG GTDRLTRDFLGVGQIVRSMS    GGGG GGGGYSQREHKQ  QGIV+EGNESN+A SSQAF GGGNGNYQ
Subjt:  LEDPTKLQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMS----GGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAF-GGGNGNYQ

A0A6J1HPQ0 protein indeterminate-domain 5, chloroplastic-like1.3e-22476.4Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        +PDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKS KEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL
        CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHP NLG A+G HLYG NSNVGLTLSQVPQISSLQD  NI QS  DVL
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL

Query:  RL-GGGRTGQFTHLLPPSIASSFRPPPQQAMPSS---NAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNL
        RL GGGRTGQF HLLPPSI SSFRPPPQQAMPSS    AFFL+DQT+QNSFHED H SQSQQGLFGNKAFHGLMQF SD+Q+H S   N+NN +SNLFNL
Subjt:  RL-GGGRTGQFTHLLPPSIASSFRPPPQQAMPSS---NAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNL

Query:  GFISNPT-------------------------------------------------------------------AAVPSLYS-TAAPGGCSSGTSGGPIP
        GFISNPT                                                                    AVPSLYS T   GG  SGTSG  IP
Subjt:  GFISNPT-------------------------------------------------------------------AAVPSLYS-TAAPGGCSSGTSGGPIP

Query:  HMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNR--SNM
        HMSATALLQKAAQLGSTTSSSNTTATLLR+FGSSS+SGGKASDRTLFPPSYGG VFGENESNLQDLMNSF TGSS  G+FG GMSSFG F+G N   +NM
Subjt:  HMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNR--SNM

Query:  ESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMS--GGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAF-GGGNGNYQ
        E+LEDP KLQQNLS+VSMG GTDRLTRDFLGVGQIVRSMS  GGGGG GGGYSQREHKQ  QGIV++GNESN+A +SQAF GGGNGNYQ
Subjt:  ESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMS--GGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAF-GGGNGNYQ

SwissProt top hitse value%identityAlignment
Q8GYC1 Protein indeterminate-domain 4, chloroplastic1.3e-9948.85Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        +PDAEV+ALSPKTLMATNRFIC+VCNKGFQREQNLQLHRRGHNLPWKLKQKSTKE KRKVYLCPEPTCVHHDPSRALGDLTGIKKHY RKHGEKKWKC+K
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVG-------LTLSQVPQISSLQDHTNIA
        CSKRYAVQSDWKAHSKTCGT+EYRCDCGT+FSRRDS+ITHRAFCDAL QE+AR+P        + +  A+S VG       L          L DH N  
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVG-------LTLSQVPQISSLQDHTNIA

Query:  QSPHDVLRLGGGRTGQFTHLLPPSIASSFRPP---PQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSA
         +P     L              +IASS       PQ + P+      S Q   N+   +++QS   Q        HGL+QF      +  S+  NN   
Subjt:  QSPHDVLRLGGGRTGQFTHLLPPSIASSFRPP---PQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSA

Query:  SNLFNLGFI----SNPTAAVPSLYST-AAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGG--VV
         + FNLGF      N   ++PSLYST           + G   ++SATALLQKA Q+GS T  SN  + L R   SSS+S    ++       +GG  ++
Subjt:  SNLFNLGFI----SNPTAAVPSLYST-AAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGG--VV

Query:  FGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQ
          +N  NLQ LMNS A  + G    GSG S F    G N  NM                   +G+D+LT DFLGVG +VR+++ GGGGGG G        
Subjt:  FGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQ

Query:  TAQGIVLEGNESNTAPSSQAFGGG
        +A+G V    E+     +  FG G
Subjt:  TAQGIVLEGNESNTAPSSQAFGGG

Q8RWX7 Protein indeterminate-domain 6, chloroplastic1.6e-8645.76Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        +PDAEVIALSPKT+MATNRF+CEVCNKGFQREQNLQLHRRGHNLPWKLKQKS KE +RKVYLCPEP+CVHHDP+RALGDLTGIKKHY RKHGEKKWKCDK
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL
        CSKRYAVQSDWKAHSKTCGT+EYRCDCGT+FSRRDS+ITHRAFCDAL QESAR+     P +      A    G         SS   H +   +P+   
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL

Query:  RLGGGRTGQFTHLLPPSIASSFR---PPPQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLG
          G        + L  S +  F    P      P    F +    NQ         +Q+ Q L  +   HGL+            NNNNN+   N FNL 
Subjt:  RLGGGRTGQFTHLLPPSIASSFR---PPPQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLG

Query:  FI----SNPTAAVPSLYSTAAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGE-NESNLQ
        +     ++    VPSL++  A                                  +N  + LLR   SSSSS    +D            FG+ +  NLQ
Subjt:  FI----SNPTAAVPSLYSTAAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGE-NESNLQ

Query:  DLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGV-GQIVRSMSGGGGGGGG
         LMNS A  +   G                     SL D        + +SMG G+DRLT DFLGV G IV +++G GG  GG
Subjt:  DLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGV-GQIVRSMSGGGGGGGG

Q944L3 Zinc finger protein BALDIBIS6.8e-8551.58Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEP-KRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCD
        DPDAEVIALSP +LM TNRFICEVCNKGF+R+QNLQLHRRGHNLPWKLKQ++ KE  K+KVY+CPE TCVHHDP+RALGDLTGIKKH+SRKHGEKKWKCD
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEP-KRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCD

Query:  KCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARH------PPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIA
        KCSK+YAV SDWKAHSK CGT+EYRCDCGTLFSR+DSFITHRAFCDALA+ESAR       P  L  A+   +   N N      Q+   SS  D     
Subjt:  KCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARH------PPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIA

Query:  QSPHDVLRLGGGRTGQFTHLLPPSI-ASSFRPPPQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASN
         + +++  LG          LP ++ ASS  P P+ A  S           QN +H                     +Q  S  Q   + NNNNNN   N
Subjt:  QSPHDVLRLGGGRTGQFTHLLPPSI-ASSFRPPPQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASN

Query:  LFNLGFISNP-------TAAVPSLYSTAAPGGCSS-GTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSS
        +   G   N          +  SL+S+ A    ++   +GG I  MSATALLQKAAQ+GS  SSS+++ +  +TFG  +S
Subjt:  LFNLGFISNP-------TAAVPSLYSTAAPGGCSS-GTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSS

Q9ZUL3 Protein indeterminate-domain 5, chloroplastic8.3e-11548.65Show/hide
Query:  DAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDKCS
        DAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKE KRKVYLCPEP+CVHHDPSRALGDLTGIKKHY RKHGEKKWKCDKCS
Subjt:  DAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDKCS

Query:  KRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHL-YGANSN--VGLTLSQVPQISSLQDHTNIAQSPHDV
        KRYAVQSDWKAHSKTCGT+EYRCDCGTLFSRRDSFITHRAFCDALAQESARHP +L      H  YG N+N       S +  +S +    N+   P DV
Subjt:  KRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHL-YGANSN--VGLTLSQVPQISSLQDHTNIAQSPHDV

Query:  LRLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHAS--SNNNNNNSASNLFNLG
        LRLG G  G           ++ R        +++ +F+ +Q       +DHH    Q  L GN   + + Q P   Q +    S++N+N++ SN+FNL 
Subjt:  LRLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHAS--SNNNNNNSASNLFNLG

Query:  FI----------SNPTAAVPSLYSTA------------APGGCSSGTSG-----------------------------GPIPHMSATALLQKAAQLGSTT
        F+          SNP AA  +  S+             A GG   G++G                                PHMSATALLQKAAQ+GST+
Subjt:  FI----------SNPTAAVPSLYSTA------------APGGCSSGTSG-----------------------------GPIPHMSATALLQKAAQLGSTT

Query:  SSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGA
        S++N         GS++++   AS       S+G  ++GENESNLQDLMNSF+   +   + G   S FG + G N+                    + A
Subjt:  SSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGA

Query:  GTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSS
            +TRDFLGVGQIV+SMSG GG       Q++ +Q  Q     GN      SS
Subjt:  GTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSS

Q9ZWA6 Zinc finger protein MAGPIE2.0e-8444.28Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        DP+AEVIALSPKTLMATNRF+CE+C KGFQR+QNLQLHRRGHNLPWKLKQ+++KE +++VY+CPE +CVHH P+RALGDLTGIKKH+ RKHGEKKWKC+K
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL
        C+KRYAVQSDWKAHSKTCGTREYRCDCGT+FSRRDSFITHRAFCDALA+E+AR    L  A  SHL    +  G  L+    + +L    ++ Q P    
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL

Query:  RLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFFLSDQTNQNSFH----------EDHHQSQSQQGLFGNKAFHGLMQFPSD-IQTHASSNN-----
          G  +     H   P   ++F    Q  M  ++   L    N N             + H   +    +FGN   HG +   SD + TH ++ N     
Subjt:  RLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFFLSDQTNQNSFH----------EDHHQSQSQQGLFGNKAFHGLMQFPSD-IQTHASSNN-----

Query:  NNNNSASNLFNLGFISNPTAAVPSLYSTAAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTT------ATLLRTFGSSSSSGGKASDRTLFPPS
         N N A++L           +VPSL+S+       +  +   + +MSATALLQKAAQ+G+T+S+S TT      +  L++F S S+   +      F  S
Subjt:  NNNNSASNLFNLGFISNPTAAVPSLYSTAAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTT------ATLLRTFGSSSSSGGKASDRTLFPPS

Query:  YGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGVG
        +G        SN  +LM++   G    G   +G++   G        M  L++    ++ +   + G G    TRDFLGVG
Subjt:  YGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGVG

Arabidopsis top hitse value%identityAlignment
AT1G14580.1 C2H2-like zinc finger protein1.2e-8745.76Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        +PDAEVIALSPKT+MATNRF+CEVCNKGFQREQNLQLHRRGHNLPWKLKQKS KE +RKVYLCPEP+CVHHDP+RALGDLTGIKKHY RKHGEKKWKCDK
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL
        CSKRYAVQSDWKAHSKTCGT+EYRCDCGT+FSRRDS+ITHRAFCDAL QESAR+     P +      A    G         SS   H +   +P+   
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL

Query:  RLGGGRTGQFTHLLPPSIASSFR---PPPQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLG
          G        + L  S +  F    P      P    F +    NQ         +Q+ Q L  +   HGL+            NNNNN+   N FNL 
Subjt:  RLGGGRTGQFTHLLPPSIASSFR---PPPQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLG

Query:  FI----SNPTAAVPSLYSTAAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGE-NESNLQ
        +     ++    VPSL++  A                                  +N  + LLR   SSSSS    +D            FG+ +  NLQ
Subjt:  FI----SNPTAAVPSLYSTAAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGE-NESNLQ

Query:  DLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGV-GQIVRSMSGGGGGGGG
         LMNS A  +   G                     SL D        + +SMG G+DRLT DFLGV G IV +++G GG  GG
Subjt:  DLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGV-GQIVRSMSGGGGGGGG

AT1G14580.2 C2H2-like zinc finger protein1.2e-8745.76Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        +PDAEVIALSPKT+MATNRF+CEVCNKGFQREQNLQLHRRGHNLPWKLKQKS KE +RKVYLCPEP+CVHHDP+RALGDLTGIKKHY RKHGEKKWKCDK
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL
        CSKRYAVQSDWKAHSKTCGT+EYRCDCGT+FSRRDS+ITHRAFCDAL QESAR+     P +      A    G         SS   H +   +P+   
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVL

Query:  RLGGGRTGQFTHLLPPSIASSFR---PPPQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLG
          G        + L  S +  F    P      P    F +    NQ         +Q+ Q L  +   HGL+            NNNNN+   N FNL 
Subjt:  RLGGGRTGQFTHLLPPSIASSFR---PPPQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLG

Query:  FI----SNPTAAVPSLYSTAAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGE-NESNLQ
        +     ++    VPSL++  A                                  +N  + LLR   SSSSS    +D            FG+ +  NLQ
Subjt:  FI----SNPTAAVPSLYSTAAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGE-NESNLQ

Query:  DLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGV-GQIVRSMSGGGGGGGG
         LMNS A  +   G                     SL D        + +SMG G+DRLT DFLGV G IV +++G GG  GG
Subjt:  DLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGV-GQIVRSMSGGGGGGGG

AT2G02070.1 indeterminate(ID)-domain 55.9e-11648.65Show/hide
Query:  DAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDKCS
        DAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKE KRKVYLCPEP+CVHHDPSRALGDLTGIKKHY RKHGEKKWKCDKCS
Subjt:  DAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDKCS

Query:  KRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHL-YGANSN--VGLTLSQVPQISSLQDHTNIAQSPHDV
        KRYAVQSDWKAHSKTCGT+EYRCDCGTLFSRRDSFITHRAFCDALAQESARHP +L      H  YG N+N       S +  +S +    N+   P DV
Subjt:  KRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHL-YGANSN--VGLTLSQVPQISSLQDHTNIAQSPHDV

Query:  LRLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHAS--SNNNNNNSASNLFNLG
        LRLG G  G           ++ R        +++ +F+ +Q       +DHH    Q  L GN   + + Q P   Q +    S++N+N++ SN+FNL 
Subjt:  LRLGGGRTGQFTHLLPPSIASSFRPPPQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHAS--SNNNNNNSASNLFNLG

Query:  FI----------SNPTAAVPSLYSTA------------APGGCSSGTSG-----------------------------GPIPHMSATALLQKAAQLGSTT
        F+          SNP AA  +  S+             A GG   G++G                                PHMSATALLQKAAQ+GST+
Subjt:  FI----------SNPTAAVPSLYSTA------------APGGCSSGTSG-----------------------------GPIPHMSATALLQKAAQLGSTT

Query:  SSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGA
        S++N         GS++++   AS       S+G  ++GENESNLQDLMNSF+   +   + G   S FG + G N+                    + A
Subjt:  SSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGA

Query:  GTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSS
            +TRDFLGVGQIV+SMSG GG       Q++ +Q  Q     GN      SS
Subjt:  GTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSS

AT2G02080.1 indeterminate(ID)-domain 49.1e-10148.85Show/hide
Query:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK
        +PDAEV+ALSPKTLMATNRFIC+VCNKGFQREQNLQLHRRGHNLPWKLKQKSTKE KRKVYLCPEPTCVHHDPSRALGDLTGIKKHY RKHGEKKWKC+K
Subjt:  DPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDK

Query:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVG-------LTLSQVPQISSLQDHTNIA
        CSKRYAVQSDWKAHSKTCGT+EYRCDCGT+FSRRDS+ITHRAFCDAL QE+AR+P        + +  A+S VG       L          L DH N  
Subjt:  CSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVG-------LTLSQVPQISSLQDHTNIA

Query:  QSPHDVLRLGGGRTGQFTHLLPPSIASSFRPP---PQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSA
         +P     L              +IASS       PQ + P+      S Q   N+   +++QS   Q        HGL+QF      +  S+  NN   
Subjt:  QSPHDVLRLGGGRTGQFTHLLPPSIASSFRPP---PQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSA

Query:  SNLFNLGFI----SNPTAAVPSLYST-AAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGG--VV
         + FNLGF      N   ++PSLYST           + G   ++SATALLQKA Q+GS T  SN  + L R   SSS+S    ++       +GG  ++
Subjt:  SNLFNLGFI----SNPTAAVPSLYST-AAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGG--VV

Query:  FGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQ
          +N  NLQ LMNS A  + G    GSG S F    G N  NM                   +G+D+LT DFLGVG +VR+++ GGGGGG G        
Subjt:  FGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQ

Query:  TAQGIVLEGNESNTAPSSQAFGGG
        +A+G V    E+     +  FG G
Subjt:  TAQGIVLEGNESNTAPSSQAFGGG

AT2G02080.2 indeterminate(ID)-domain 43.1e-9347.84Show/hide
Query:  MATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDKCSKRYAVQSDWKAH
        MATNRFIC+VCNKGFQREQNLQLHRRGHNLPWKLKQKSTKE KRKVYLCPEPTCVHHDPSRALGDLTGIKKHY RKHGEKKWKC+KCSKRYAVQSDWKAH
Subjt:  MATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKWKCDKCSKRYAVQSDWKAH

Query:  SKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVG-------LTLSQVPQISSLQDHTNIAQSPHDVLRLGGGRT
        SKTCGT+EYRCDCGT+FSRRDS+ITHRAFCDAL QE+AR+P        + +  A+S VG       L          L DH N   +P     L     
Subjt:  SKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVG-------LTLSQVPQISSLQDHTNIAQSPHDVLRLGGGRT

Query:  GQFTHLLPPSIASSFRPP---PQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGFI----S
                 +IASS       PQ + P+      S Q   N+   +++QS   Q        HGL+QF      +  S+  NN    + FNLGF      
Subjt:  GQFTHLLPPSIASSFRPP---PQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGFI----S

Query:  NPTAAVPSLYST-AAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGG--VVFGENESNLQDLMNS
        N   ++PSLYST           + G   ++SATALLQKA Q+GS T  SN  + L R   SSS+S    ++       +GG  ++  +N  NLQ LMNS
Subjt:  NPTAAVPSLYST-AAPGGCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGG--VVFGENESNLQDLMNS

Query:  FATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNT
         A  + G    GSG S F    G N  NM                   +G+D+LT DFLGVG +VR+++ GGGGGG G        +A+G V    E+  
Subjt:  FATGSSGSGMFGSGMSSFGGFEGSNRSNMESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNT

Query:  APSSQAFGGG
           +  FG G
Subjt:  APSSQAFGGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGGGGAAATTGTTGTGTTGTGTTGTGTGTGTTTTCAGATCCAGATGCGGAAGTGATAGCGTTGAGTCCAAAGACATTAATGGCAACAAACAGATTCATATGTGA
AGTATGTAACAAAGGATTTCAAAGAGAGCAAAATCTACAACTACACAGAAGAGGACATAATTTGCCATGGAAATTAAAGCAAAAAAGCACAAAAGAGCCAAAAAGAAAAG
TGTATTTATGCCCAGAACCCACATGCGTACACCATGACCCTTCAAGAGCACTTGGAGATTTAACTGGCATCAAAAAGCACTACTCCCGAAAGCACGGCGAGAAGAAGTGG
AAGTGTGATAAATGCTCTAAACGCTACGCCGTTCAATCTGACTGGAAAGCCCATTCCAAAACCTGTGGTACCAGAGAATACCGTTGCGATTGTGGCACTCTCTTCTCCAG
ACGGGACAGTTTCATTACTCATAGAGCCTTTTGTGATGCATTGGCTCAAGAAAGTGCAAGACACCCACCAAATTTAGGGCCAGCCATTGGAAGCCATTTATATGGAGCTA
ATAGCAATGTGGGTTTGACATTATCACAAGTCCCTCAAATCTCTTCACTTCAAGACCACACCAATATTGCTCAATCACCCCACGACGTCCTCCGTCTCGGTGGCGGTCGA
ACCGGCCAATTCACTCATCTCCTCCCTCCTTCTATTGCCTCTTCCTTCCGACCCCCGCCACAACAAGCAATGCCGTCCTCCAATGCCTTCTTCCTTTCGGATCAAACTAA
CCAAAATAGCTTCCATGAAGATCATCATCAAAGCCAATCCCAACAAGGGTTGTTTGGAAATAAAGCCTTTCATGGCTTAATGCAATTCCCTTCTGATATCCAAACCCATG
CAAGTAGTAACAACAACAACAACAATTCTGCCTCAAATCTCTTCAATTTGGGCTTCATTTCAAATCCAACGGCTGCAGTCCCATCTCTCTACAGCACTGCCGCCCCGGGA
GGATGTAGTAGCGGTACAAGCGGAGGACCGATCCCACATATGTCCGCTACGGCACTTCTCCAAAAGGCAGCACAATTAGGCTCAACAACGTCGAGTAGCAACACTACAGC
AACATTGCTAAGAACGTTCGGAAGCTCCTCGAGCTCAGGTGGTAAGGCGTCTGATAGAACGCTGTTCCCGCCGAGCTACGGCGGAGTAGTGTTTGGCGAAAATGAGAGCA
ATCTCCAGGATTTGATGAACTCGTTCGCAACTGGGAGCTCGGGAAGTGGGATGTTCGGGAGCGGGATGAGCTCATTCGGGGGATTTGAAGGGAGTAATCGAAGCAATATG
GAAAGTTTGGAGGATCCAACGAAGTTACAACAGAATCTAAGCACAGTGAGTATGGGAGCTGGGACAGATAGGTTAACAAGAGACTTCTTAGGGGTTGGACAGATTGTAAG
AAGTATGAGCGGCGGCGGCGGTGGTGGTGGCGGCGGTTATTCACAGAGAGAACATAAACAAACGGCGCAAGGGATAGTTTTGGAGGGTAATGAGAGTAATACAGCGCCGT
CAAGCCAAGCATTTGGTGGTGGAAATGGAAACTACCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGGGGAAATTGTTGTGTTGTGTTGTGTGTGTTTTCAGATCCAGATGCGGAAGTGATAGCGTTGAGTCCAAAGACATTAATGGCAACAAACAGATTCATATGTGA
AGTATGTAACAAAGGATTTCAAAGAGAGCAAAATCTACAACTACACAGAAGAGGACATAATTTGCCATGGAAATTAAAGCAAAAAAGCACAAAAGAGCCAAAAAGAAAAG
TGTATTTATGCCCAGAACCCACATGCGTACACCATGACCCTTCAAGAGCACTTGGAGATTTAACTGGCATCAAAAAGCACTACTCCCGAAAGCACGGCGAGAAGAAGTGG
AAGTGTGATAAATGCTCTAAACGCTACGCCGTTCAATCTGACTGGAAAGCCCATTCCAAAACCTGTGGTACCAGAGAATACCGTTGCGATTGTGGCACTCTCTTCTCCAG
ACGGGACAGTTTCATTACTCATAGAGCCTTTTGTGATGCATTGGCTCAAGAAAGTGCAAGACACCCACCAAATTTAGGGCCAGCCATTGGAAGCCATTTATATGGAGCTA
ATAGCAATGTGGGTTTGACATTATCACAAGTCCCTCAAATCTCTTCACTTCAAGACCACACCAATATTGCTCAATCACCCCACGACGTCCTCCGTCTCGGTGGCGGTCGA
ACCGGCCAATTCACTCATCTCCTCCCTCCTTCTATTGCCTCTTCCTTCCGACCCCCGCCACAACAAGCAATGCCGTCCTCCAATGCCTTCTTCCTTTCGGATCAAACTAA
CCAAAATAGCTTCCATGAAGATCATCATCAAAGCCAATCCCAACAAGGGTTGTTTGGAAATAAAGCCTTTCATGGCTTAATGCAATTCCCTTCTGATATCCAAACCCATG
CAAGTAGTAACAACAACAACAACAATTCTGCCTCAAATCTCTTCAATTTGGGCTTCATTTCAAATCCAACGGCTGCAGTCCCATCTCTCTACAGCACTGCCGCCCCGGGA
GGATGTAGTAGCGGTACAAGCGGAGGACCGATCCCACATATGTCCGCTACGGCACTTCTCCAAAAGGCAGCACAATTAGGCTCAACAACGTCGAGTAGCAACACTACAGC
AACATTGCTAAGAACGTTCGGAAGCTCCTCGAGCTCAGGTGGTAAGGCGTCTGATAGAACGCTGTTCCCGCCGAGCTACGGCGGAGTAGTGTTTGGCGAAAATGAGAGCA
ATCTCCAGGATTTGATGAACTCGTTCGCAACTGGGAGCTCGGGAAGTGGGATGTTCGGGAGCGGGATGAGCTCATTCGGGGGATTTGAAGGGAGTAATCGAAGCAATATG
GAAAGTTTGGAGGATCCAACGAAGTTACAACAGAATCTAAGCACAGTGAGTATGGGAGCTGGGACAGATAGGTTAACAAGAGACTTCTTAGGGGTTGGACAGATTGTAAG
AAGTATGAGCGGCGGCGGCGGTGGTGGTGGCGGCGGTTATTCACAGAGAGAACATAAACAAACGGCGCAAGGGATAGTTTTGGAGGGTAATGAGAGTAATACAGCGCCGT
CAAGCCAAGCATTTGGTGGTGGAAATGGAAACTACCAGTGA
Protein sequenceShow/hide protein sequence
MNRGNCCVVLCVFSDPDAEVIALSPKTLMATNRFICEVCNKGFQREQNLQLHRRGHNLPWKLKQKSTKEPKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYSRKHGEKKW
KCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAQESARHPPNLGPAIGSHLYGANSNVGLTLSQVPQISSLQDHTNIAQSPHDVLRLGGGR
TGQFTHLLPPSIASSFRPPPQQAMPSSNAFFLSDQTNQNSFHEDHHQSQSQQGLFGNKAFHGLMQFPSDIQTHASSNNNNNNSASNLFNLGFISNPTAAVPSLYSTAAPG
GCSSGTSGGPIPHMSATALLQKAAQLGSTTSSSNTTATLLRTFGSSSSSGGKASDRTLFPPSYGGVVFGENESNLQDLMNSFATGSSGSGMFGSGMSSFGGFEGSNRSNM
ESLEDPTKLQQNLSTVSMGAGTDRLTRDFLGVGQIVRSMSGGGGGGGGGYSQREHKQTAQGIVLEGNESNTAPSSQAFGGGNGNYQ