; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10008790 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10008790
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionexosome complex component RRP4 homolog
Genome locationChr10:26083443..26092052
RNA-Seq ExpressionHG10008790
SyntenyHG10008790
Gene Ontology termsGO:0000178 - exosome (RNase complex) (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR004088 - K Homology domain, type 1
IPR026699 - Exosome complex RNA-binding protein 1/RRP40/RRP4
IPR036612 - K Homology domain, type 1 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057746.1 ABC transporter F family member 4-like [Cucumis melo var. makuwa]5.6e-11077.32Show/hide
Query:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGST------NEHDSASAIPLS
        MGCGNSKLNPEGEL+PPRIRPLL+R+KF+ELRKRKNGT+LRDG LSKKVLLK+GES EEN+M VDNR     + C + +  T      +EH+SAS IP S
Subjt:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGST------NEHDSASAIPLS

Query:  N---NATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDD-NKKEGEEGRADNEENRG-FICPGSPSFRVYFVEETQDDKENV
        N   NATK+GEQSN +LEE P  ++QLKT HPD+TP LELKQDKTM++HKCIQ+GD+ NKKEGE+GR DNEENRG  ICPGSPSFR YFVEETQDDKE V
Subjt:  N---NATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDD-NKKEGEEGRADNEENRG-FICPGSPSFRVYFVEETQDDKENV

Query:  EMKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA
        EMKDA GM DVSHKKSPSHDSVES+TSAKS EGQENKV+KKGKKGTTFNRVISK+RPV VGVKNLLNVKSCYHLSCSGNDRA+LLARKAEA
Subjt:  EMKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA

KAE8652373.1 hypothetical protein Csa_013810 [Cucumis sativus]9.1e-7661.72Show/hide
Query:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGSTN-------EHDSASAIPL
        MGCGNSKLNP GEL+PPRIRPL +R+K +ELRKRKNGT+LRDG LSKKVLLKDGES EEN+M VDNR     + C + +  TN       EH+SAS IP 
Subjt:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGSTN-------EHDSASAIPL

Query:  SNN---ATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDDNKKEGEEGRADNEENRGFICPGSPSFRVYFVEETQDDKENVE
        SNN   ATK GEQSNH+LEE P  ++QLK  HPD+TP LELKQDKTM++HK                                                E
Subjt:  SNN---ATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDDNKKEGEEGRADNEENRGFICPGSPSFRVYFVEETQDDKENVE

Query:  MKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA
        MKDA GM DVSHKKSPS DSVES+TSAK  EGQENK +KKGKK TTFNRV+SKKRPV VGVKNLLNVKSCYHLSCSGNDRA+LLARKAEA
Subjt:  MKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA

XP_008464387.1 PREDICTED: uncharacterized protein LOC103502290 [Cucumis melo]5.6e-11077.32Show/hide
Query:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGST------NEHDSASAIPLS
        MGCGNSKLNPEGEL+PPRIRPLL+R+KF+ELRKRKNGT+LRDG LSKKVLLK+GES EEN+M VDNR     + C + +  T      +EH+SAS IP S
Subjt:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGST------NEHDSASAIPLS

Query:  N---NATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDD-NKKEGEEGRADNEENRG-FICPGSPSFRVYFVEETQDDKENV
        N   NATK+GEQSN +LEE P  ++QLKT HPD+TP LELKQDKTM++HKCIQ+GD+ NKKEGE+GR DNEENRG  ICPGSPSFR YFVEETQDDKE V
Subjt:  N---NATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDD-NKKEGEEGRADNEENRG-FICPGSPSFRVYFVEETQDDKENV

Query:  EMKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA
        EMKDA GM DVSHKKSPSHDSVES+TSAKS EGQENKV+KKGKKGTTFNRVISK+RPV VGVKNLLNVKSCYHLSCSGNDRA+LLARKAEA
Subjt:  EMKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA

XP_011649820.1 probable DNA-directed RNA polymerase I subunit RPA43 isoform X1 [Cucumis sativus]1.3e-10675.68Show/hide
Query:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGSTN-------EHDSASAIPL
        MGCGNSKLNP GEL+PPRIRPL +R+K +ELRKRKNGT+LRDG LSKKVLLKDGES EEN+M VDNR     + C + +  TN       EH+SAS IP 
Subjt:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGSTN-------EHDSASAIPL

Query:  SNN---ATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDD-NKKEGEEGRADNEENRG-FICPGSPSFRVYFVEETQDDKEN
        SNN   ATK GEQSNH+LEE P  ++QLK  HPD+TP LELKQDKTM++HKCIQ+GD+ NKKEGE+GR DNEENRG FICPGSPSFR+YFVEETQDDKE 
Subjt:  SNN---ATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDD-NKKEGEEGRADNEENRG-FICPGSPSFRVYFVEETQDDKEN

Query:  VEMKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA
        VEMKDA GM DVSHKKSPS DSVES+TSAK  EGQENK +KKGKK TTFNRV+SKKRPV VGVKNLLNVKSCYHLSCSGNDRA+LLARKAEA
Subjt:  VEMKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA

XP_022921428.1 uncharacterized protein LOC111429712 [Cucurbita moschata]3.0e-7159.66Show/hide
Query:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGE-SEENSML-VDNRKSMCCSHEGST-----------NEHDSASAIP
        MGCGNSKL PEGE I P IRPLL R+KF E RKRKNGT+LR+  LSKKVLLK+GE  EENS+L V NR S+  SH+G T           +EHDS +A  
Subjt:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGE-SEENSML-VDNRKSMCCSHEGST-----------NEHDSASAIP

Query:  LSNNATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHK-CIQQGDD-NKKEGEEGRADNEENRGFICPGSPSFRVYFVEETQDDKENVE
                    NHLLE+             D    +ELK++KTM++ +  +++GDD NK+EGEEGR DNE+NR  ICPGSPSFRVYFVE+T +DK+NVE
Subjt:  LSNNATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHK-CIQQGDD-NKKEGEEGRADNEENRGFICPGSPSFRVYFVEETQDDKENVE

Query:  MKDAG-MEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKS-CYHLSCSGNDRASLLARKAE
        M D G MED S KKSPS DSVESS+S KS EGQENK +KKGKKGTT NR  S++RPVGVG+K+LLNV + CYHLSC+GNDR + LARKAE
Subjt:  MKDAG-MEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKS-CYHLSCSGNDRASLLARKAE

TrEMBL top hitse value%identityAlignment
A0A0A0LNT2 Uncharacterized protein6.3e-10775.68Show/hide
Query:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGSTN-------EHDSASAIPL
        MGCGNSKLNP GEL+PPRIRPL +R+K +ELRKRKNGT+LRDG LSKKVLLKDGES EEN+M VDNR     + C + +  TN       EH+SAS IP 
Subjt:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGSTN-------EHDSASAIPL

Query:  SNN---ATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDD-NKKEGEEGRADNEENRG-FICPGSPSFRVYFVEETQDDKEN
        SNN   ATK GEQSNH+LEE P  ++QLK  HPD+TP LELKQDKTM++HKCIQ+GD+ NKKEGE+GR DNEENRG FICPGSPSFR+YFVEETQDDKE 
Subjt:  SNN---ATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDD-NKKEGEEGRADNEENRG-FICPGSPSFRVYFVEETQDDKEN

Query:  VEMKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA
        VEMKDA GM DVSHKKSPS DSVES+TSAK  EGQENK +KKGKK TTFNRV+SKKRPV VGVKNLLNVKSCYHLSCSGNDRA+LLARKAEA
Subjt:  VEMKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA

A0A1S3CLT6 uncharacterized protein LOC1035022902.7e-11077.32Show/hide
Query:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGST------NEHDSASAIPLS
        MGCGNSKLNPEGEL+PPRIRPLL+R+KF+ELRKRKNGT+LRDG LSKKVLLK+GES EEN+M VDNR     + C + +  T      +EH+SAS IP S
Subjt:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGST------NEHDSASAIPLS

Query:  N---NATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDD-NKKEGEEGRADNEENRG-FICPGSPSFRVYFVEETQDDKENV
        N   NATK+GEQSN +LEE P  ++QLKT HPD+TP LELKQDKTM++HKCIQ+GD+ NKKEGE+GR DNEENRG  ICPGSPSFR YFVEETQDDKE V
Subjt:  N---NATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDD-NKKEGEEGRADNEENRG-FICPGSPSFRVYFVEETQDDKENV

Query:  EMKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA
        EMKDA GM DVSHKKSPSHDSVES+TSAKS EGQENKV+KKGKKGTTFNRVISK+RPV VGVKNLLNVKSCYHLSCSGNDRA+LLARKAEA
Subjt:  EMKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA

A0A5D3BHA4 ABC transporter F family member 4-like2.7e-11077.32Show/hide
Query:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGST------NEHDSASAIPLS
        MGCGNSKLNPEGEL+PPRIRPLL+R+KF+ELRKRKNGT+LRDG LSKKVLLK+GES EEN+M VDNR     + C + +  T      +EH+SAS IP S
Subjt:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGES-EENSMLVDNR----KSMCCSHEGST------NEHDSASAIPLS

Query:  N---NATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDD-NKKEGEEGRADNEENRG-FICPGSPSFRVYFVEETQDDKENV
        N   NATK+GEQSN +LEE P  ++QLKT HPD+TP LELKQDKTM++HKCIQ+GD+ NKKEGE+GR DNEENRG  ICPGSPSFR YFVEETQDDKE V
Subjt:  N---NATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDD-NKKEGEEGRADNEENRG-FICPGSPSFRVYFVEETQDDKENV

Query:  EMKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA
        EMKDA GM DVSHKKSPSHDSVES+TSAKS EGQENKV+KKGKKGTTFNRVISK+RPV VGVKNLLNVKSCYHLSCSGNDRA+LLARKAEA
Subjt:  EMKDA-GMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA

A0A6J1E3W4 uncharacterized protein LOC1114297121.5e-7159.66Show/hide
Query:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGE-SEENSML-VDNRKSMCCSHEGST-----------NEHDSASAIP
        MGCGNSKL PEGE I P IRPLL R+KF E RKRKNGT+LR+  LSKKVLLK+GE  EENS+L V NR S+  SH+G T           +EHDS +A  
Subjt:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGE-SEENSML-VDNRKSMCCSHEGST-----------NEHDSASAIP

Query:  LSNNATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHK-CIQQGDD-NKKEGEEGRADNEENRGFICPGSPSFRVYFVEETQDDKENVE
                    NHLLE+             D    +ELK++KTM++ +  +++GDD NK+EGEEGR DNE+NR  ICPGSPSFRVYFVE+T +DK+NVE
Subjt:  LSNNATKDGEQSNHLLEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHK-CIQQGDD-NKKEGEEGRADNEENRGFICPGSPSFRVYFVEETQDDKENVE

Query:  MKDAG-MEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKS-CYHLSCSGNDRASLLARKAE
        M D G MED S KKSPS DSVESS+S KS EGQENK +KKGKKGTT NR  S++RPVGVG+K+LLNV + CYHLSC+GNDR + LARKAE
Subjt:  MKDAG-MEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKS-CYHLSCSGNDRASLLARKAE

A0A6J1JK15 uncharacterized protein LOC111485806 isoform X23.7e-6758.87Show/hide
Query:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGE-SEENSML-VDNRKSMCCSHEGSTNEHDSASAIPLSNNATKDGEQ
        MGCGNSKL PEGE I P IRPLL R+KF E RKRKNGT+LRD  LSKKVLL +GE  EENS+L V NR  +  SH+G T                   +Q
Subjt:  MGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGE-SEENSML-VDNRKSMCCSHEGSTNEHDSASAIPLSNNATKDGEQ

Query:  SNHLLE-EVPLTNVQLKTIHP-DETPDLELKQDKTMDD-HKCIQQGDDNKK-EGEEGRADNEENRGFICPGSPSFRVYFVEETQDDKENVEMKDAG-MED
         NH  E + P  N  L+   P D    ++LK+DKTM++ HK +++GDD  K EGEEGR DNE+NR  ICPGSPSFRVYFVE+T ++K+NVEM D G MED
Subjt:  SNHLLE-EVPLTNVQLKTIHP-DETPDLELKQDKTMDD-HKCIQQGDDNKK-EGEEGRADNEENRGFICPGSPSFRVYFVEETQDDKENVEMKDAG-MED

Query:  VSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKS-CYHLSCSGNDRASLLARKAEA
         S KKSPS DSVES++S KS E QE K +KKGKKGTT NR  S+KRPVGVG+K+LLNV + CYHLSC+GNDR + L  KAE+
Subjt:  VSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKS-CYHLSCSGNDRASLLARKAEA

SwissProt top hitse value%identityAlignment
Q09704 Exosome complex component rrp41.8e-1029.76Show/hide
Query:  LIQVAPKRWRLDIHYSQDAVLMLSSMNLPDGIQ---------------------------------------------LERGQLLTISPYLVKRRKQHFH
        + +V PKRW++DI   Q+AVLMLSS+NLP GIQ                                             L  G  L + P LV R K H +
Subjt:  LIQVAPKRWRLDIHYSQDAVLMLSSMNLPDGIQ---------------------------------------------LERGQLLTISPYLVKRRKQHFH

Query:  HLEQYGIDIILGCNGFIWIGEHVDPTEDMMIEDQVNKSEQKGSKFEGTFGNQMQE-RVYTPLE-TRQNICRTANAIRVL-----SILGFIITVEVIMETR
         L   G+DIIL  NG++W+ +H    E+      + + E++ S  E  + N+  E   YT L  +R +IC    A R L     SI  F  +  V    +
Subjt:  HLEQYGIDIILGCNGFIWIGEHVDPTEDMMIEDQVNKSEQKGSKFEGTFGNQMQE-RVYTPLE-TRQNICRTANAIRVL-----SILGFIITVEVIMETR

Query:  DFRLP
        D  +P
Subjt:  DFRLP

Q13868 Exosome complex component RRP42.4e-0732.77Show/hide
Query:  LIQVAPKRWRLDIHYSQDAVLMLSSMNLPDGI---------------------------------------------QLERGQLLTISPYLVKRRKQHFH
        + +V  KRW+++ +   D+VL+LSSMNLP G                                              +L +G L+ +SP LVKR+K HFH
Subjt:  LIQVAPKRWRLDIHYSQDAVLMLSSMNLPDGI---------------------------------------------QLERGQLLTISPYLVKRRKQHFH

Query:  HLEQYGIDIILGCNGFIWI
         L   G  +ILG NGFIWI
Subjt:  HLEQYGIDIILGCNGFIWI

Q2KID0 Exosome complex component RRP41.4e-0731.25Show/hide
Query:  LIQVAPKRWRLDIHYSQDAVLMLSSMNLPDGI---------------------------------------------QLERGQLLTISPYLVKRRKQHFH
        + +V  KRW+++ +   D+VL+LSSMNLP G                                              +L +G L+ +SP LVKR+K HFH
Subjt:  LIQVAPKRWRLDIHYSQDAVLMLSSMNLPDGI---------------------------------------------QLERGQLLTISPYLVKRRKQHFH

Query:  HLEQYGIDIILGCNGFIWIGEHVDPTED
         L   G  +ILG NGFIW+    +  ED
Subjt:  HLEQYGIDIILGCNGFIWIGEHVDPTED

Q8VBV3 Exosome complex component RRP41.9e-0733.07Show/hide
Query:  LIQVAPKRWRLDIHYSQDAVLMLSSMNLPDGI---------------------------------------------QLERGQLLTISPYLVKRRKQHFH
        + +V  KRW+++ +   D+VL+LSSMNLP G                                              +L +G L+ +SP LVKR+K HFH
Subjt:  LIQVAPKRWRLDIHYSQDAVLMLSSMNLPDGI---------------------------------------------QLERGQLLTISPYLVKRRKQHFH

Query:  HLEQYGIDIILGCNGFIWI---GEHVD
         L   G  +ILG NGFIWI    EH D
Subjt:  HLEQYGIDIILGCNGFIWI---GEHVD

Q9ZVT7 Exosome complex component RRP4 homolog1.0e-3745.83Show/hide
Query:  LIQVAPKRWRLDIHYSQDAVLMLSSMNLPDGIQ---------------------------------------------LERGQLLTISPYLVKRRKQHFH
        +I+VA KRWR++++++QD VLMLSSMN+PDGIQ                                             LE+GQLL + PYLVKR K HFH
Subjt:  LIQVAPKRWRLDIHYSQDAVLMLSSMNLPDGIQ---------------------------------------------LERGQLLTISPYLVKRRKQHFH

Query:  HLEQYGIDIILGCNGFIWIGEHVDPTEDMMIEDQVNKSEQKGSKFEGTFGNQMQERVYTPLETRQNICRTANAIRVLSILGFIITVEVIMET
        ++E  GID+I+GCNGFIW+GEHV+  + M I+DQ ++ E   S   G      +E+ + PLETRQ ICR  NAIRVLS LGF +T+EVIMET
Subjt:  HLEQYGIDIILGCNGFIWIGEHVDPTEDMMIEDQVNKSEQKGSKFEGTFGNQMQERVYTPLETRQNICRTANAIRVLSILGFIITVEVIMET

Arabidopsis top hitse value%identityAlignment
AT1G03360.1 ribosomal RNA processing 47.2e-3945.83Show/hide
Query:  LIQVAPKRWRLDIHYSQDAVLMLSSMNLPDGIQ---------------------------------------------LERGQLLTISPYLVKRRKQHFH
        +I+VA KRWR++++++QD VLMLSSMN+PDGIQ                                             LE+GQLL + PYLVKR K HFH
Subjt:  LIQVAPKRWRLDIHYSQDAVLMLSSMNLPDGIQ---------------------------------------------LERGQLLTISPYLVKRRKQHFH

Query:  HLEQYGIDIILGCNGFIWIGEHVDPTEDMMIEDQVNKSEQKGSKFEGTFGNQMQERVYTPLETRQNICRTANAIRVLSILGFIITVEVIMET
        ++E  GID+I+GCNGFIW+GEHV+  + M I+DQ ++ E   S   G      +E+ + PLETRQ ICR  NAIRVLS LGF +T+EVIMET
Subjt:  HLEQYGIDIILGCNGFIWIGEHVDPTEDMMIEDQVNKSEQKGSKFEGTFGNQMQERVYTPLETRQNICRTANAIRVLSILGFIITVEVIMET

AT5G50830.1 unknown protein1.7e-0829.1Show/hide
Query:  MGCGNSKLN-------PEGELI--PPRIRPLLLRSKFMELRKRKNGTNLRDG-TLSKKVLLKDGESEENSMLVDNRKSMCCSHE-GSTNEHDSASAIPLS
        MGCG S+L         EG ++  P  IRP LLR +  E++KR +   L+   TLSKK LL+   SE+     +N  S+  S +     +H+      + 
Subjt:  MGCGNSKLN-------PEGELI--PPRIRPLLLRSKFMELRKRKNGTNLRDG-TLSKKVLLKDGESEENSMLVDNRKSMCCSHE-GSTNEHDSASAIPLS

Query:  NNATKDGEQSNHLLEEVPLTNVQLKTIHPD-------ETPDLELKQDKTMDDHKCI-----QQGDDNKK-EGEEGRADNEENRGFICPGSPSFRVYFVE-
               +    + EEV +   +    H D       E  ++  KQ++   D   +     ++GDD K  + +EG  +N + R  I PGSPSFRVY V+ 
Subjt:  NNATKDGEQSNHLLEEVPLTNVQLKTIHPD-------ETPDLELKQDKTMDDHKCI-----QQGDDNKK-EGEEGRADNEENRGFICPGSPSFRVYFVE-

Query:  ETQDDKENVEMKDAGMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKS-CY-HLSCSGNDRASLLARKA
         + DD E  +++DA       +KS   +SV   T+    +G   K  KK ++G  F   + +        K L NV + CY    C GN  + L+  K+
Subjt:  ETQDDKENVEMKDAGMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKS-CY-HLSCSGNDRASLLARKA

AT5G50830.2 unknown protein4.1e-1029.77Show/hide
Query:  MGCGNSKLN-------PEGELI--PPRIRPLLLRSKFMELRKRKNGTNLRDG-TLSKKVLLKDGESEENSMLVDNRKSMCCSHE-GSTNEHDSASAIPLS
        MGCG S+L         EG ++  P  IRP LLR +  E++KR +   L+   TLSKK LL+   SE+     +N  S+  S +     +H+      + 
Subjt:  MGCGNSKLN-------PEGELI--PPRIRPLLLRSKFMELRKRKNGTNLRDG-TLSKKVLLKDGESEENSMLVDNRKSMCCSHE-GSTNEHDSASAIPLS

Query:  NNATKDGEQSNHLLEEVPLTNVQLKTIHPD-------ETPDLELKQDKTMDDHKCI-----QQGDDNKK-EGEEGRADNEENRGFICPGSPSFRVYFVE-
               +    + EEV +   +    H D       E  ++  KQ++   D   +     ++GDD K  + +EG  +N + R  I PGSPSFRVY V+ 
Subjt:  NNATKDGEQSNHLLEEVPLTNVQLKTIHPD-------ETPDLELKQDKTMDDHKCI-----QQGDDNKK-EGEEGRADNEENRGFICPGSPSFRVYFVE-

Query:  ETQDDKENVEMKDAGMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKS-CY-HLSCSGNDRASLLARKA
         + DD E  +++DA       +KS   +SV  +T  K V+G   K  KK ++G  F   + +        K L NV + CY    C GN  + L+  K+
Subjt:  ETQDDKENVEMKDAGMEDVSHKKSPSHDSVESSTSAKSVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKS-CY-HLSCSGNDRASLLARKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTTATCCAGGTTGCTCCAAAACGTTGGAGATTGGATATACATTATAGCCAAGATGCTGTTTTGATGCTTTCTTCGATGAACTTGCCTGATGGTATTCAGCTTGAACGTGG
TCAGCTTCTAACAATTTCACCATATCTAGTAAAGAGACGCAAACAACACTTCCATCATTTGGAGCAGTATGGTATTGACATAATACTTGGATGCAATGGATTTATCTGGA
TTGGGGAGCATGTCGATCCTACAGAGGACATGATGATAGAAGATCAAGTTAATAAATCTGAACAAAAGGGTTCAAAATTTGAAGGAACTTTTGGTAACCAAATGCAAGAA
AGAGTATATACACCATTAGAGACACGACAGAATATATGCAGAACTGCAAATGCTATTCGAGTATTGTCTATCTTAGGTTTTATAATAACAGTAGAAGTCATAATGGAGAC
AAGGGATTTTAGATTACCACCACTTGGCCTGGATGAAGAAGACTCTGGATTTCTGTACCATTATTGGAAGCAGCAGCGGCAGCAGGAGGAAGACGAAGAAAACAGGACTG
TCACAATGCCATATGTCCTATTCTTGTTTCCAAAACAGTTGCGAATTCTCTCTTTCAAATTCTCTGTTTTTGTCTTTAAGCTTTCTTCATCTCATCCTTTTTTTTCTGTG
TCTCTGTTGTGCAACTCAAATGCAATGGGTTGTGGAAACTCCAAGCTTAATCCGGAAGGAGAATTGATTCCTCCCAGGATTCGCCCACTTCTTCTCCGGAGTAAATTTAT
GGAGTTGAGGAAACGTAAGAATGGAACCAATCTTAGAGATGGAACGCTGTCAAAGAAAGTGCTTTTGAAAGATGGAGAATCAGAAGAAAACTCCATGCTTGTTGATAACA
GGAAGAGTATGTGTTGTTCACATGAAGGCAGCACAAATGAACATGATTCAGCTTCAGCTATTCCCCTATCTAACAATGCAACCAAAGATGGGGAACAGAGCAATCATCTC
TTGGAGGAGGTGCCTCTGACAAACGTACAATTGAAAACTATCCACCCTGATGAAACTCCTGATCTTGAACTGAAACAAGACAAAACAATGGATGACCATAAATGTATCCA
ACAAGGAGATGATAACAAGAAAGAAGGAGAAGAGGGGAGGGCTGACAACGAAGAGAATCGCGGCTTCATCTGTCCTGGATCTCCCAGTTTCAGAGTTTATTTTGTTGAAG
AAACACAAGATGACAAAGAAAATGTTGAAATGAAAGATGCAGGTATGGAAGATGTCTCACACAAGAAGTCGCCAAGTCATGACAGCGTTGAGAGCTCCACTAGTGCAAAA
TCTGTCGAGGGCCAGGAGAACAAGGTGATGAAGAAAGGGAAAAAAGGAACCACTTTCAATAGGGTCATCAGTAAAAAAAGACCAGTTGGTGTTGGTGTGAAGAATCTGTT
GAATGTTAAATCTTGCTATCATTTAAGTTGTTCTGGCAATGACAGAGCCAGTCTTCTAGCTAGAAAAGCCGAAGCTTAA
mRNA sequenceShow/hide mRNA sequence
CTTATCCAGGTTGCTCCAAAACGTTGGAGATTGGATATACATTATAGCCAAGATGCTGTTTTGATGCTTTCTTCGATGAACTTGCCTGATGGTATTCAGCTTGAACGTGG
TCAGCTTCTAACAATTTCACCATATCTAGTAAAGAGACGCAAACAACACTTCCATCATTTGGAGCAGTATGGTATTGACATAATACTTGGATGCAATGGATTTATCTGGA
TTGGGGAGCATGTCGATCCTACAGAGGACATGATGATAGAAGATCAAGTTAATAAATCTGAACAAAAGGGTTCAAAATTTGAAGGAACTTTTGGTAACCAAATGCAAGAA
AGAGTATATACACCATTAGAGACACGACAGAATATATGCAGAACTGCAAATGCTATTCGAGTATTGTCTATCTTAGGTTTTATAATAACAGTAGAAGTCATAATGGAGAC
AAGGGATTTTAGATTACCACCACTTGGCCTGGATGAAGAAGACTCTGGATTTCTGTACCATTATTGGAAGCAGCAGCGGCAGCAGGAGGAAGACGAAGAAAACAGGACTG
TCACAATGCCATATGTCCTATTCTTGTTTCCAAAACAGTTGCGAATTCTCTCTTTCAAATTCTCTGTTTTTGTCTTTAAGCTTTCTTCATCTCATCCTTTTTTTTCTGTG
TCTCTGTTGTGCAACTCAAATGCAATGGGTTGTGGAAACTCCAAGCTTAATCCGGAAGGAGAATTGATTCCTCCCAGGATTCGCCCACTTCTTCTCCGGAGTAAATTTAT
GGAGTTGAGGAAACGTAAGAATGGAACCAATCTTAGAGATGGAACGCTGTCAAAGAAAGTGCTTTTGAAAGATGGAGAATCAGAAGAAAACTCCATGCTTGTTGATAACA
GGAAGAGTATGTGTTGTTCACATGAAGGCAGCACAAATGAACATGATTCAGCTTCAGCTATTCCCCTATCTAACAATGCAACCAAAGATGGGGAACAGAGCAATCATCTC
TTGGAGGAGGTGCCTCTGACAAACGTACAATTGAAAACTATCCACCCTGATGAAACTCCTGATCTTGAACTGAAACAAGACAAAACAATGGATGACCATAAATGTATCCA
ACAAGGAGATGATAACAAGAAAGAAGGAGAAGAGGGGAGGGCTGACAACGAAGAGAATCGCGGCTTCATCTGTCCTGGATCTCCCAGTTTCAGAGTTTATTTTGTTGAAG
AAACACAAGATGACAAAGAAAATGTTGAAATGAAAGATGCAGGTATGGAAGATGTCTCACACAAGAAGTCGCCAAGTCATGACAGCGTTGAGAGCTCCACTAGTGCAAAA
TCTGTCGAGGGCCAGGAGAACAAGGTGATGAAGAAAGGGAAAAAAGGAACCACTTTCAATAGGGTCATCAGTAAAAAAAGACCAGTTGGTGTTGGTGTGAAGAATCTGTT
GAATGTTAAATCTTGCTATCATTTAAGTTGTTCTGGCAATGACAGAGCCAGTCTTCTAGCTAGAAAAGCCGAAGCTTAA
Protein sequenceShow/hide protein sequence
LIQVAPKRWRLDIHYSQDAVLMLSSMNLPDGIQLERGQLLTISPYLVKRRKQHFHHLEQYGIDIILGCNGFIWIGEHVDPTEDMMIEDQVNKSEQKGSKFEGTFGNQMQE
RVYTPLETRQNICRTANAIRVLSILGFIITVEVIMETRDFRLPPLGLDEEDSGFLYHYWKQQRQQEEDEENRTVTMPYVLFLFPKQLRILSFKFSVFVFKLSSSHPFFSV
SLLCNSNAMGCGNSKLNPEGELIPPRIRPLLLRSKFMELRKRKNGTNLRDGTLSKKVLLKDGESEENSMLVDNRKSMCCSHEGSTNEHDSASAIPLSNNATKDGEQSNHL
LEEVPLTNVQLKTIHPDETPDLELKQDKTMDDHKCIQQGDDNKKEGEEGRADNEENRGFICPGSPSFRVYFVEETQDDKENVEMKDAGMEDVSHKKSPSHDSVESSTSAK
SVEGQENKVMKKGKKGTTFNRVISKKRPVGVGVKNLLNVKSCYHLSCSGNDRASLLARKAEA