; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021859 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021859
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptiontetrapyrrole-binding protein, chloroplastic-like
Genome locationtig00153840:888622..893264
RNA-Seq ExpressionSgr021859
SyntenySgr021859
Gene Ontology termsGO:0010019 - chloroplast-nucleus signaling pathway (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0046906 - tetrapyrrole binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR008629 - GUN4-like
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily
IPR037215 - GUN4-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8696313.1 Tetrapyrrole-binding protein [Hibiscus syriacus]7.0e-10150.7Show/hide
Query:  MGSATDEQFAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAE
        MGSA  +++AAF+EKV+RTV++DNLSP VTE VLRTALD                     QCALVEM+  K+++ V+S +++ PFMMSGMPRPVRAR + 
Subjt:  MGSATDEQFAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAE

Query:  AEMFDDRPIKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTARSHHSLLHRHQPEIS
         EMF DRP KPGR +   WL+  DP+FE                      Q+ +EEKLAKQQ E LK NYKKYE+++++M DGT R         QP   
Subjt:  AEMFDDRPIKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTARSHHSLLHRHQPEIS

Query:  PTALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISA
                                ST AAA    S + +  SST    +T SF+ LQ HLS  NFRQADEETRRLLI LA E  +KRGYV FSEVQFI  
Subjt:  PTALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISA

Query:  DDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFDAM
         DL+ ID  W++HS+ KFGYSVQK I++KV++DFTKLF+K+GWMKKLDTE+EQYNYR+FP EF+WEL ++TPEGHLPLTNALRGT+L + IL HPAF+  
Subjt:  DDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFDAM

Query:  GEEEADEENDNGGLKKGLKSMSERIFKRDY
         E+E     +NGG++ G K+  ++ FK +Y
Subjt:  GEEEADEENDNGGLKKGLKSMSERIFKRDY

KAF9829801.1 hypothetical protein H0E87_023226 [Populus deltoides]4.4e-13559.12Show/hide
Query:  SATDEQFAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAEAE
        S  +  +A F EKVKRTVY+DNLSPQVTE V+RTAL QFGTV ++QFIPNY  P N  +CALVEM+  ++A++V+S I QFPFMMSGMPRP RA  AE E
Subjt:  SATDEQFAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAEAE

Query:  MFDDRPIKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTARSHHSLLHRHQ------
        MFDDRPIKPGR+I   W+++ DPDFEVA K+KCL RKHAAE  FLL+QQ+ +EEKLAKQQ+E LK NYKK+ I+++V + G   S    + +H+      
Subjt:  MFDDRPIKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTARSHHSLLHRHQ------

Query:  ----------PEISPTALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQK
                  P+  P++L SS  L P +P      SLS +     T++S+S  T+S+TP TS T S D LQ HLS +NFR+ADEETRRLLI LAGEAAQ 
Subjt:  ----------PEISPTALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQK

Query:  RGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFE-KVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGT
        RGYV+FSEVQFIS +DL+ ID+LW+ HS+ KFGYSVQKRI++ K N+DFTK F+K+GWMKKLDTE+EQYNYR+FP EF+W+L + TPEGHLPLTNALRGT
Subjt:  RGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFE-KVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGT

Query:  QLMSNILSHPAFDA---MGEEEADEENDNGGLKKGLKS-----MSERIFKRDYSF
        QL+ NIL+HPAF+     GE +  E N+NGGL KGL+      +S+R+ K DYSF
Subjt:  QLMSNILSHPAFDA---MGEEEADEENDNGGLKKGLKS-----MSERIFKRDYSF

KAG6741356.1 hypothetical protein POTOM_054590 [Populus tomentosa]5.7e-11952.74Show/hide
Query:  FAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAEAEMFDDRP
        +AAF EKVKRTVYVDNLSPQVTE V+RTAL QFGTV ++QFIPNY  P N S CALVEM+  ++A++V+S I QFPFMMSGMPRP RA  AE EMFDDRP
Subjt:  FAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAEAEMFDDRP

Query:  IKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTA-----------------------
        IKPGR+I   W+++ DPDFEVA K+K L RKHAAE  FLL+QQ+ +EEKLAKQQ+E LK NYKK+ I+++V+ DGTA                       
Subjt:  IKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTA-----------------------

Query:  -------------------------RSHHSLLHRHQPEISPTALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPD------------TS
                                     SL H +  +I        +F      I+   S+  S  A  +   +    T     D            T 
Subjt:  -------------------------RSHHSLLHRHQPEISPTALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPD------------TS

Query:  STVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFE-KVNRDFTKLFMKIGWMKKL
          V     Q HLS +NFR+ADEETRRLLI LAGEAAQ RGYV+FSEVQFIS +DL+ ID+LW+ HS+ KFGYSVQKRI++ K N+DFTK F+K+GWMKKL
Subjt:  STVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFE-KVNRDFTKLFMKIGWMKKL

Query:  DTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFDA---MGEEEADEENDNGGLKKGLKS-----MSERIFKRDYSF
        DTE+EQYNYR+FP EF+W+L + TPEGHLPLTNALRGTQL+ NIL+HPAF+     GE +  E N+NGGL KGL+      +S+R+ K DYSF
Subjt:  DTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFDA---MGEEEADEENDNGGLKKGLKS-----MSERIFKRDYSF

XP_022953248.1 tetrapyrrole-binding protein, chloroplastic-like [Cucurbita moschata]3.6e-8972.51Show/hide
Query:  HSLLHRHQPEISPTALSSS--LFLKPATPIATVA-SSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQ
        H+L   H+P +SP   SSS  L    +TP AT A ++ SS +AAAA+SFSLS+       DTSS VSFDEL+ HL+ +NFRQADEETRRLLIALAG+AAQ
Subjt:  HSLLHRHQPEISPTALSSS--LFLKPATPIATVA-SSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQ

Query:  KRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGT
        KRGYVYFSEVQF+S+D+L+AIDDLWQ++SDGKFGYSVQKR+FEKVNRDFTKLFMK+GWMKKLDTE+EQYNYR+FPTEF+WELTEETPEGHLPLTNALRGT
Subjt:  KRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGT

Query:  QLMSNILSHPAFDAMGEEEADEEN----DNGGLKKGLKSMSERIFKRDYSF
        QLM+NILSHPAF+    E   +EN    +NGG KKGLK+MSERIFKRDYSF
Subjt:  QLMSNILSHPAFDAMGEEEADEEN----DNGGLKKGLKSMSERIFKRDYSF

XP_038884158.1 tetrapyrrole-binding protein, chloroplastic [Benincasa hispida]8.0e-8973.62Show/hide
Query:  HSLLHRHQPEISPTALSSS--LFLKP---ATPIAT-VASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGE
        HSL   H P +SPT+ SSS  LFLKP   A P  T  A++ SS +A+AA SFSLSA       DTSS +SFDEL   L+ ++FRQADEETRRLLIALAGE
Subjt:  HSLLHRHQPEISPTALSSS--LFLKP---ATPIAT-VASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGE

Query:  AAQKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNAL
         AQKRGYVYFSEVQFI+ADDL+AID+LWQ++SDGKFGYSVQKRIFEKVN+DFTKLFMK+GWMKKLDTEMEQYNYR+FPTEF+WELTEETPEGHLPLTNAL
Subjt:  AAQKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNAL

Query:  RGTQLMSNILSHPAFDAMGEEEADEE----NDNGGLKKGLKSMSERIFKRDYSF
        RGTQLM NILSHPAF+    EE  EE    ++NGGLKKGLKS++ERIFKRDYSF
Subjt:  RGTQLMSNILSHPAFDAMGEEEADEE----NDNGGLKKGLKSMSERIFKRDYSF

TrEMBL top hitse value%identityAlignment
A0A1S3BE43 tetrapyrrole-binding protein, chloroplastic1.1e-8872.22Show/hide
Query:  SLLHRHQPEISPTALSSS--LFLKPATPIA---TVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAA
        SL H H+P +SPT+ SSS  LFLKPA+  A   T  ++ SS +AAAA SFSLS+       DTSS +SFDEL   L+ ++FR+ADEETRRLLIALAG+ A
Subjt:  SLLHRHQPEISPTALSSS--LFLKPATPIA---TVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAA

Query:  QKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRG
         KRGYVYFSEVQFI+A+DL+AIDDLWQ+HSDGKFGYSVQKRIFEKVN+DFTKLFMK+GWMKKLDTE+EQYNYR+FPTEF+WELTEETPEGHLPLTNALRG
Subjt:  QKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRG

Query:  TQLMSNILSHPAFDAMGEEEADEE----NDNGGLKKGLKSMSERIFKRDYSF
        TQLMSNIL+HPAF+    EE  EE    ++NGGLKKGLKS++ER+FKRDYSF
Subjt:  TQLMSNILSHPAFDAMGEEEADEE----NDNGGLKKGLKSMSERIFKRDYSF

A0A2N9ILK2 RRM domain-containing protein9.8e-12557.73Show/hide
Query:  SATDEQFAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAEAE
        SA    +AAFEEKVKRTVY+DNLSPQVTEPV+RTAL+QFG V S+QFIPNY+EP N  QC LVEM+NS++A +V+S I+Q+PFMMSGMPRPVRARPAE  
Subjt:  SATDEQFAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAEAE

Query:  MFDDRPIKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTARSHHSLLHRHQPEISPT
        MF DRPI PGR+I   WL++ DPDFE A+K+K LTRKHAAEAAFLLK  +L                                                 
Subjt:  MFDDRPIKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTARSHHSLLHRHQPEISPT

Query:  ALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISADD
         LSSS    P  P  T   SLS T + + T+FS    T+S+TP TS  +SFD LQ HLS +NF+QADEETRRLLI LAGE AQKRGYV+FSEVQFI   D
Subjt:  ALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISADD

Query:  LRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFDAMGE
        L+ ID+LW++HS+ KFGYSVQK+I++K +RDF+K F K+ W+KKLDTE+EQYNYR+FPTEF+WEL ++TPEGHLPLTNALRGTQL+++IL+HPAF  M E
Subjt:  LRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFDAMGE

Query:  EEADEE-------NDNGGLK---KGLKSMSERIFKRDYSF
        EE  E+       ++NGGLK    G K +S R+F  DYSF
Subjt:  EEADEE-------NDNGGLK---KGLKSMSERIFKRDYSF

A0A5A7SPC7 Tetrapyrrole-binding protein1.1e-8872.22Show/hide
Query:  SLLHRHQPEISPTALSSS--LFLKPATPIA---TVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAA
        SL H H+P +SPT+ SSS  LFLKPA+  A   T  ++ SS +AAAA SFSLS+       DTSS +SFDEL   L+ ++FR+ADEETRRLLIALAG+ A
Subjt:  SLLHRHQPEISPTALSSS--LFLKPATPIA---TVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAA

Query:  QKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRG
         KRGYVYFSEVQFI+A+DL+AIDDLWQ+HSDGKFGYSVQKRIFEKVN+DFTKLFMK+GWMKKLDTE+EQYNYR+FPTEF+WELTEETPEGHLPLTNALRG
Subjt:  QKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRG

Query:  TQLMSNILSHPAFDAMGEEEADEE----NDNGGLKKGLKSMSERIFKRDYSF
        TQLMSNIL+HPAF+    EE  EE    ++NGGLKKGLKS++ER+FKRDYSF
Subjt:  TQLMSNILSHPAFDAMGEEEADEE----NDNGGLKKGLKSMSERIFKRDYSF

A0A6A2ZX88 Tetrapyrrole-binding protein3.4e-10150.7Show/hide
Query:  MGSATDEQFAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAE
        MGSA  +++AAF+EKV+RTV++DNLSP VTE VLRTALD                     QCALVEM+  K+++ V+S +++ PFMMSGMPRPVRAR + 
Subjt:  MGSATDEQFAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAE

Query:  AEMFDDRPIKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTARSHHSLLHRHQPEIS
         EMF DRP KPGR +   WL+  DP+FE                      Q+ +EEKLAKQQ E LK NYKKYE+++++M DGT R         QP   
Subjt:  AEMFDDRPIKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTARSHHSLLHRHQPEIS

Query:  PTALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISA
                                ST AAA    S + +  SST    +T SF+ LQ HLS  NFRQADEETRRLLI LA E  +KRGYV FSEVQFI  
Subjt:  PTALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISA

Query:  DDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFDAM
         DL+ ID  W++HS+ KFGYSVQK I++KV++DFTKLF+K+GWMKKLDTE+EQYNYR+FP EF+WEL ++TPEGHLPLTNALRGT+L + IL HPAF+  
Subjt:  DDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFDAM

Query:  GEEEADEENDNGGLKKGLKSMSERIFKRDY
         E+E     +NGG++ G K+  ++ FK +Y
Subjt:  GEEEADEENDNGGLKKGLKSMSERIFKRDY

A0A6J1GMW2 tetrapyrrole-binding protein, chloroplastic-like1.7e-8972.51Show/hide
Query:  HSLLHRHQPEISPTALSSS--LFLKPATPIATVA-SSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQ
        H+L   H+P +SP   SSS  L    +TP AT A ++ SS +AAAA+SFSLS+       DTSS VSFDEL+ HL+ +NFRQADEETRRLLIALAG+AAQ
Subjt:  HSLLHRHQPEISPTALSSS--LFLKPATPIATVA-SSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQ

Query:  KRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGT
        KRGYVYFSEVQF+S+D+L+AIDDLWQ++SDGKFGYSVQKR+FEKVNRDFTKLFMK+GWMKKLDTE+EQYNYR+FPTEF+WELTEETPEGHLPLTNALRGT
Subjt:  KRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGT

Query:  QLMSNILSHPAFDAMGEEEADEEN----DNGGLKKGLKSMSERIFKRDYSF
        QLM+NILSHPAF+    E   +EN    +NGG KKGLK+MSERIFKRDYSF
Subjt:  QLMSNILSHPAFDAMGEEEADEEN----DNGGLKKGLKSMSERIFKRDYSF

SwissProt top hitse value%identityAlignment
O19887 Uncharacterized protein ycf533.3e-2135.21Show/hide
Query:  FDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEME
        ++ L+  LS  NF +A++ T+++L+ LAGE +++R ++YF++V  I+   +  ++ LW+ +S G FG+S+QKRI+  VN+++ K + KIGW+        
Subjt:  FDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEME

Query:  QYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSH
           +  +P EF+W +    P GHLPL N LRG  ++  I  +
Subjt:  QYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSH

P51202 Uncharacterized protein ycf531.0e-2238.16Show/hide
Query:  TSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKK
        ++  + + +LQ+ L   N   AD+ T++ LI LAG   + R ++YF++++ I  +DL+ ID LW  HS GKFG+ VQ++I+  + +++ + + KIGW   
Subjt:  TSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKK

Query:  LDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFD
           E+ +   R +P EF W  T   P+GHLPL N LRG Q++S + SH A+D
Subjt:  LDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFD

P72583 Ycf53-like protein9.1e-1933.85Show/hide
Query:  KPATPIATVASSLSS-TAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISADDLRAIDDL
        KP   +  V  +L +       T    +  T      ++  + +  LQ  L +++F  ADE TR  L  LAG  A +R ++YF+EV+   A DL  I+ L
Subjt:  KPATPIATVASSLSS-TAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISADDLRAIDDL

Query:  WQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFDAMG
        W  HS+G FG+SVQ+R++    ++FTKL+ KIGW            +  +P  F W+L+   P+GHLPL N LRG ++  ++  HP +   G
Subjt:  WQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFDAMG

Q1XDT5 Uncharacterized protein ycf531.9e-2440.13Show/hide
Query:  TSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKK
        ++  +++ +LQ+ L+ R+  +AD+ T++ LI LAG  AQ R ++YF++++ I A DL+ ID LW  HS GKFG  VQ++I+  V +D+ K + KIGW   
Subjt:  TSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKK

Query:  LDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFD
           E+++   R +P EF W      P GHLPL N LRG Q++S + +H A++
Subjt:  LDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFD

Q9LX31 Tetrapyrrole-binding protein, chloroplastic1.3e-5752.17Show/hide
Query:  SHHSLLHRHQPEISPTALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVS-FDELQLHLSTRNFRQADEETRRLLIALAGEAAQ
        +HH      Q    PT+LS    LK  T  AT +   S+++ +++T+   + +T++++  T+ T + FD L+ HL  +NFRQADEETRRLLI ++GEAA 
Subjt:  SHHSLLHRHQPEISPTALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVS-FDELQLHLSTRNFRQADEETRRLLIALAGEAAQ

Query:  KRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGT
        KRGYV+FSEV+ IS +DL+AID+LW +HSDG+FGYSVQ++I+ KV +DFT+ F+K+ WMK LDTE+ QYNYR+FP EF WEL +ETP GHLPLTNALRGT
Subjt:  KRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGT

Query:  QLMSNILSHPAF----DAMGEEEADEENDNGGLKKGLKSM--SERIFKRDYSF
        QL+  +LSHPAF    D  GE E DE N    + K    +   +R+FK +YSF
Subjt:  QLMSNILSHPAF----DAMGEEEADEENDNGGLKKGLKSM--SERIFKRDYSF

Arabidopsis top hitse value%identityAlignment
AT1G05970.1 RNA-binding (RRM/RBD/RNP motifs) family protein3.8e-4447.31Show/hide
Query:  MGSATDEQFAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAE
        M +  +  +  F E+V+RTVYVD L+P  T PV+ +A +QFGTV  + FIPNY+ P       LVEM+N +  ++VIS ++Q PFM++GMPRPVRA  AE
Subjt:  MGSATDEQFAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAE

Query:  AEMFDDRPIKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTAR
          MF D+P KPGR + F W++ +DPDF+ A+++K L RKH+AE +F+LK +  E EKL+KQQ E    ++KK+E+++ ++ DG A+
Subjt:  AEMFDDRPIKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTAR

AT1G05970.2 RNA-binding (RRM/RBD/RNP motifs) family protein4.0e-4647.85Show/hide
Query:  MGSATDEQFAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAE
        M +  +  +  F E+V+RTVYVD L+P  T PV+ +A +QFGTV  + FIPNY+ P       LVEM+N +  ++VIS ++Q PFM++GMPRPVRA  AE
Subjt:  MGSATDEQFAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAE

Query:  AEMFDDRPIKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTAR
          MF D+P KPGR + F W++ +DPDF+ A+++K L RKH+AE +F+LK+Q+ E EKL+KQQ E    ++KK+E+++ ++ DG A+
Subjt:  AEMFDDRPIKPGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTAR

AT3G59400.1 enzyme binding;tetrapyrrole binding9.2e-5952.17Show/hide
Query:  SHHSLLHRHQPEISPTALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVS-FDELQLHLSTRNFRQADEETRRLLIALAGEAAQ
        +HH      Q    PT+LS    LK  T  AT +   S+++ +++T+   + +T++++  T+ T + FD L+ HL  +NFRQADEETRRLLI ++GEAA 
Subjt:  SHHSLLHRHQPEISPTALSSSLFLKPATPIATVASSLSSTAAAAATSFSLSAATSSSTPDTSSTVS-FDELQLHLSTRNFRQADEETRRLLIALAGEAAQ

Query:  KRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGT
        KRGYV+FSEV+ IS +DL+AID+LW +HSDG+FGYSVQ++I+ KV +DFT+ F+K+ WMK LDTE+ QYNYR+FP EF WEL +ETP GHLPLTNALRGT
Subjt:  KRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKVNRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGT

Query:  QLMSNILSHPAF----DAMGEEEADEENDNGGLKKGLKSM--SERIFKRDYSF
        QL+  +LSHPAF    D  GE E DE N    + K    +   +R+FK +YSF
Subjt:  QLMSNILSHPAF----DAMGEEEADEENDNGGLKKGLKSM--SERIFKRDYSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCGGCTACGGATGAACAATTTGCCGCATTTGAGGAGAAGGTAAAACGGACTGTCTATGTTGATAACCTCTCTCCCCAAGTGACTGAGCCTGTGTTGAGAACTGC
TTTAGATCAGTTTGGAACTGTTGTCAGCATTCAATTTATTCCAAACTACATGGAGCCATTGAATAGCTCTCAATGTGCTCTAGTGGAGATGAAGAACTCAAAAGAGGCCA
AATCTGTCATCTCGGTGATAGCTCAGTTCCCTTTCATGATGTCTGGAATGCCAAGACCGGTGAGGGCACGCCCCGCAGAAGCAGAAATGTTTGATGATCGCCCAATAAAG
CCTGGTAGGAAGATAAGTTTTGTCTGGTTGGAACAGGATGATCCTGATTTTGAAGTTGCAAAGAAAATGAAGTGTCTTACCCGTAAACACGCCGCTGAAGCTGCATTCTT
GCTCAAGCAACAAATGTTGGAAGAGGAGAAGCTTGCAAAGCAACAGCAAGAAGCCCTCAAAGGAAACTACAAGAAGTATGAGATTGTAGAAAGTGTGATGATTGATGGAA
CTGCACGCAGCCACCACTCTCTCCTCCACCGCCACCAACCTGAAATCTCTCCCACCGCCCTCTCCTCCTCTCTCTTCCTCAAACCAGCCACCCCCATAGCCACCGTCGCT
TCCTCCCTCTCCTCCACCGCAGCCGCCGCCGCCACTTCCTTCTCTCTCTCCGCCGCGACCTCCTCCTCCACCCCCGACACATCCTCCACCGTCTCCTTCGACGAACTCCA
GCTCCACCTCTCGACCAGAAACTTCCGGCAAGCCGACGAGGAGACCCGCCGGCTTCTGATCGCCCTCGCCGGCGAGGCAGCGCAGAAGCGAGGCTACGTCTACTTCTCCG
AGGTCCAGTTCATCTCCGCGGACGATCTCAGAGCCATCGACGACCTCTGGCAAGAACACAGCGACGGCAAATTCGGGTACAGCGTCCAGAAACGGATTTTCGAGAAAGTG
AACAGAGATTTCACAAAACTGTTCATGAAAATCGGGTGGATGAAGAAGCTGGACACAGAGATGGAGCAGTACAATTACAGGTCGTTTCCGACGGAGTTCTTGTGGGAGCT
GACGGAGGAGACGCCGGAGGGGCATCTGCCATTGACGAACGCTCTGAGAGGGACGCAGCTGATGAGCAACATTCTGAGCCATCCGGCGTTCGACGCCATGGGAGAGGAGG
AAGCTGATGAGGAGAACGACAATGGAGGATTGAAGAAGGGGTTGAAATCGATGAGTGAGAGAATATTCAAAAGGGATTACAGCTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTCGGCTACGGATGAACAATTTGCCGCATTTGAGGAGAAGGTAAAACGGACTGTCTATGTTGATAACCTCTCTCCCCAAGTGACTGAGCCTGTGTTGAGAACTGC
TTTAGATCAGTTTGGAACTGTTGTCAGCATTCAATTTATTCCAAACTACATGGAGCCATTGAATAGCTCTCAATGTGCTCTAGTGGAGATGAAGAACTCAAAAGAGGCCA
AATCTGTCATCTCGGTGATAGCTCAGTTCCCTTTCATGATGTCTGGAATGCCAAGACCGGTGAGGGCACGCCCCGCAGAAGCAGAAATGTTTGATGATCGCCCAATAAAG
CCTGGTAGGAAGATAAGTTTTGTCTGGTTGGAACAGGATGATCCTGATTTTGAAGTTGCAAAGAAAATGAAGTGTCTTACCCGTAAACACGCCGCTGAAGCTGCATTCTT
GCTCAAGCAACAAATGTTGGAAGAGGAGAAGCTTGCAAAGCAACAGCAAGAAGCCCTCAAAGGAAACTACAAGAAGTATGAGATTGTAGAAAGTGTGATGATTGATGGAA
CTGCACGCAGCCACCACTCTCTCCTCCACCGCCACCAACCTGAAATCTCTCCCACCGCCCTCTCCTCCTCTCTCTTCCTCAAACCAGCCACCCCCATAGCCACCGTCGCT
TCCTCCCTCTCCTCCACCGCAGCCGCCGCCGCCACTTCCTTCTCTCTCTCCGCCGCGACCTCCTCCTCCACCCCCGACACATCCTCCACCGTCTCCTTCGACGAACTCCA
GCTCCACCTCTCGACCAGAAACTTCCGGCAAGCCGACGAGGAGACCCGCCGGCTTCTGATCGCCCTCGCCGGCGAGGCAGCGCAGAAGCGAGGCTACGTCTACTTCTCCG
AGGTCCAGTTCATCTCCGCGGACGATCTCAGAGCCATCGACGACCTCTGGCAAGAACACAGCGACGGCAAATTCGGGTACAGCGTCCAGAAACGGATTTTCGAGAAAGTG
AACAGAGATTTCACAAAACTGTTCATGAAAATCGGGTGGATGAAGAAGCTGGACACAGAGATGGAGCAGTACAATTACAGGTCGTTTCCGACGGAGTTCTTGTGGGAGCT
GACGGAGGAGACGCCGGAGGGGCATCTGCCATTGACGAACGCTCTGAGAGGGACGCAGCTGATGAGCAACATTCTGAGCCATCCGGCGTTCGACGCCATGGGAGAGGAGG
AAGCTGATGAGGAGAACGACAATGGAGGATTGAAGAAGGGGTTGAAATCGATGAGTGAGAGAATATTCAAAAGGGATTACAGCTTTTGA
Protein sequenceShow/hide protein sequence
MGSATDEQFAAFEEKVKRTVYVDNLSPQVTEPVLRTALDQFGTVVSIQFIPNYMEPLNSSQCALVEMKNSKEAKSVISVIAQFPFMMSGMPRPVRARPAEAEMFDDRPIK
PGRKISFVWLEQDDPDFEVAKKMKCLTRKHAAEAAFLLKQQMLEEEKLAKQQQEALKGNYKKYEIVESVMIDGTARSHHSLLHRHQPEISPTALSSSLFLKPATPIATVA
SSLSSTAAAAATSFSLSAATSSSTPDTSSTVSFDELQLHLSTRNFRQADEETRRLLIALAGEAAQKRGYVYFSEVQFISADDLRAIDDLWQEHSDGKFGYSVQKRIFEKV
NRDFTKLFMKIGWMKKLDTEMEQYNYRSFPTEFLWELTEETPEGHLPLTNALRGTQLMSNILSHPAFDAMGEEEADEENDNGGLKKGLKSMSERIFKRDYSF