; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC03G044290 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC03G044290
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionTHO complex subunit 4A
Genome locationCicolChr03:2119931..2123758
RNA-Seq ExpressionCcUC03G044290
SyntenyCcUC03G044290
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR025715 - Chromatin target of PRMT1 protein, C-terminal
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583468.1 THO complex subunit 4A, partial [Cucurbita argyrosperma subsp. sororia]8.7e-11793.42Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFRGRGGASSGPGPSRRFRNRGLNR APYSTA+APETAWSHDMFVDHGAAYPSQP RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQ+DALAAIKRYNNVQLDGK MKLEIVG+NIVTPA+PAS N +FGNPNGF RGG VLGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

KAG7019223.1 THO complex subunit 4A [Cucurbita argyrosperma subsp. argyrosperma]1.9e-11693.42Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFRGRGGASSGPGPSRRFRNRGLNR APYSTA+APETAWSHDMFVDHGAAYPSQP RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQ+DALAAIKRYNNVQLDGK MKLEIVG+NIVTPAVPAS N +FGNPNGF RGG VLGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAM+IN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_008457549.1 PREDICTED: THO complex subunit 4A [Cucumis melo]8.7e-11791.84Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFR RGGASSGPGPSRRFRNRGLNR  PYST+KAPETAWSHDMFVDHGAAYPS PPRASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEI+FSR +DALAAIKRYNNVQLDGKPMKLEIVG+NIVTPAVPA +N SFGN NGFPRGGR +GRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGR  GSGSGRGRGEKLSAEDLDADL+KYHEEAMQIN
Subjt:  GRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_022964653.1 THO complex subunit 4B-like [Cucurbita moschata]1.5e-11693.42Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFRGRGGASSGPGPSRRFRNRGLNR APYSTA+APETAWSHDMFVDHG+AYPSQP RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQ+DALAAIKRYNNVQLDGK MKLEIVG+NIVTPAVPAS N +FGNPNGF RGG VLGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_038895300.1 THO complex subunit 4A-like [Benincasa hispida]6.3e-12396.3Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFRGRGGASSGPGPSRRFRNRGLNR APYS AKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVG+NIVTPAVPAS+N  FGNPNGFPRGGR LGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

TrEMBL top hitse value%identityAlignment
A0A0A0LUQ3 RRM domain-containing protein2.1e-11690.4Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFR RGGASSGPGPSRRFRNRGLNR  PYST+KAPETAWSHDMFVDHGAAYPS PPRASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGD+KRYSINYDKSGRSKGTAEIVFSRQ+DALAAIKRYNNVQLDGKPMKLEIVG+NIVTPAVPA +N SFGNPNGFPRGGR +GRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRG-GRGR------GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRG GRGR      GSGSGRG GEKLSAEDLDADL+KYHEEAMQIN
Subjt:  GRGPGRG-GRGR------GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A1S3C5R6 THO complex subunit 4A4.2e-11791.84Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFR RGGASSGPGPSRRFRNRGLNR  PYST+KAPETAWSHDMFVDHGAAYPS PPRASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEI+FSR +DALAAIKRYNNVQLDGKPMKLEIVG+NIVTPAVPA +N SFGN NGFPRGGR +GRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGR  GSGSGRGRGEKLSAEDLDADL+KYHEEAMQIN
Subjt:  GRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1G9Y5 THO complex subunit 4A-like isoform X22.3e-11087.65Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        M +PLD SLDDIIK NKK GSSNFRGRGGASSGP PSRRF NRGLNR APYS AKAPET WSHD+FVDHG AYPS P RAS IETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQ+DALAAIKRYNNVQLDGKPMKLEIVG+NIVTP +PAS+NP+FGN +GF RGGR LGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGR  G+GRG GEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1HLI4 THO complex subunit 4B-like7.2e-11793.42Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFRGRGGASSGPGPSRRFRNRGLNR APYSTA+APETAWSHDMFVDHG+AYPSQP RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQ+DALAAIKRYNNVQLDGK MKLEIVG+NIVTPAVPAS N +FGNPNGF RGG VLGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1I3N4 THO complex subunit 4A-like1.6e-11693.42Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFRGRGGASSGPGPSRRFRNRGLNR APYSTA+APETAWSHDMFVDHGAAYPSQP RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQ+DALAAIKRYNNVQLDGK MKLEIVG+NIVTPAVPAS N +FGNPNGF RGG VLGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRG GS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

SwissProt top hitse value%identityAlignment
B5FXN8 THO complex subunit 41.1e-4246.33Show/hide
Query:  MAEPLDMSLDDIIKKNKKP-----GSSNFRGRGGASSGPGPSR-----------RFRNR-------GLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQP
        MA+ +DMSLDDIIK N+       G    RGRGG + G GP R             RNR       G NRPAPYS  K     W HD+F D G       
Subjt:  MAEPLDMSLDDIIKKNKKP-----GSSNFRGRGGASSGPGPSR-----------RFRNR-------GLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQP

Query:  PRASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNP
           + +ETG KL VSNLD+GVS+ DI+ELF+E G LK+ +++YD+SGRS GTA++ F R++DAL A+K+YN V LDG+PM +++V S I T   PA +  
Subjt:  PRASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNP

Query:  SFGNPNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH
           N  G  R   VLG   GGG  RG   G RGRG G+GR   ++LSAE+LDA L+ Y+
Subjt:  SFGNPNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH

Q3T0I4 THO complex subunit 43.4e-3943.89Show/hide
Query:  MAEPLDMSLDDIIKKNKKP----GSSNFRGRGGASSGPGPSRR-----------FRNR-----------GLNRPAPYSTAKAPETAWSHDMFVDHGAAYP
        MA+ +DMSLDDIIK N+      G    RGR G+  G G   +            RNR           G NRPAPYS  K     W HD+F D G    
Subjt:  MAEPLDMSLDDIIKKNKKP----GSSNFRGRGGASSGPGPSRR-----------FRNR-----------GLNRPAPYSTAKAPETAWSHDMFVDHGAAYP

Query:  SQPPRASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPAS
              + +ETG KL VSNLD+GVS+ DI+ELF+E G LK+ +++YD+SGRS GTA++ F R++DAL A+K+YN V LDG+PM +++V S I T   PA 
Subjt:  SQPPRASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPAS

Query:  TNPSFGNPNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH
        +     N  G  R     G   GGG  RG   G RGRG G+GR   ++LSAE+LDA L+ Y+
Subjt:  TNPSFGNPNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH

Q6NQ72 THO complex subunit 4D3.4e-3941.38Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGS-------SNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAK----APETAWSHDMFVDHGAAYPSQPPRASAIETGTKL
        M+  L+M+LD+I+K+ K   S          RGRGG   G GP+RR       RP+ ++  K         W   +F D   A       AS +E GT+L
Subjt:  MAEPLDMSLDDIIKKNKKPGS-------SNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAK----APETAWSHDMFVDHGAAYPSQPPRASAIETGTKL

Query:  YVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPA-VPASTNPSFGNPNG----
        +V+NLD GV+NEDI+ELFSE+G+++RY+I+YDK+GR  GTAE+V+ R+SDA  A+K+YNNV LDG+PM+LEI+G N  + A +    N +    NG    
Subjt:  YVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPA-VPASTNPSFGNPNG----

Query:  ---FPRGGRVLGRNRGGGRGRGP----------------------------GRGGRGRGSGSGRGRGEK---LSAEDLDADLEKYHEEAM
             +GG   GR RGG  GRGP                            GRG  GRG G GRG G+K    SA DLD DLE YH +AM
Subjt:  ---FPRGGRVLGRNRGGGRGRGP----------------------------GRGGRGRGSGSGRGRGEK---LSAEDLDADLEKYHEEAM

Q8L719 THO complex subunit 4B2.9e-6255.33Show/hide
Query:  MAEPLDMSLDDIIKKNKKP----------GSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYS----TAKAPETAWSHDMFVDH---GAAYPSQPPR----
        M+  LDMSLDDIIK N+KP          G +N  GRGG+ S  GPSRRF NR   R APYS      +A +  W +D+F       AA+          
Subjt:  MAEPLDMSLDDIIKKNKKP----------GSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYS----TAKAPETAWSHDMFVDH---GAAYPSQPPR----

Query:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVP-------
         S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAE+VFSR+ DALAA+KRYNNVQLDGK MK+EIVG+N+  PA+P       
Subjt:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVP-------

Query:  --------ASTNPSF-----GNPNGFPRG---GRVLGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ
                 + N +F     GN NG  RG   G  +GR RGGG G G  RGGR      GRGSG GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  --------ASTNPSF-----GNPNGFPRG---GRVLGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ

Q8L773 THO complex subunit 4A3.0e-6762.1Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG
        M+  LDMSLDD+I KN+K        RG G+ SGPGP+RR   NR   R APY +AKAPE+ W HDMF D    + S   R+SA IETGTKLY+SNLDYG
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG

Query:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFP-RGGRVL-GRN
        V NEDIKELF+EVG+LKRY++++D+SGRSKGTAE+V+SR+ DALAA+K+YN+VQLDGKPMK+EIVG+N+ T A P S  P+ GN NG P RGG+   G+ 
Subjt:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFP-RGGRVL-GRN

Query:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

Arabidopsis top hitse value%identityAlignment
AT5G02530.1 RNA-binding (RRM/RBD/RNP motifs) family protein2.0e-6355.33Show/hide
Query:  MAEPLDMSLDDIIKKNKKP----------GSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYS----TAKAPETAWSHDMFVDH---GAAYPSQPPR----
        M+  LDMSLDDIIK N+KP          G +N  GRGG+ S  GPSRRF NR   R APYS      +A +  W +D+F       AA+          
Subjt:  MAEPLDMSLDDIIKKNKKP----------GSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYS----TAKAPETAWSHDMFVDH---GAAYPSQPPR----

Query:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVP-------
         S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAE+VFSR+ DALAA+KRYNNVQLDGK MK+EIVG+N+  PA+P       
Subjt:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVP-------

Query:  --------ASTNPSF-----GNPNGFPRG---GRVLGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ
                 + N +F     GN NG  RG   G  +GR RGGG G G  RGGR      GRGSG GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  --------ASTNPSF-----GNPNGFPRG---GRVLGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ

AT5G02530.2 RNA-binding (RRM/RBD/RNP motifs) family protein1.2e-6355.71Show/hide
Query:  MAEPLDMSLDDIIKKNKKP----------GSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYS----TAKAPETAWSHDMFVDH---GAAYPSQPPR----
        M+  LDMSLDDIIK N+KP          G +N  GRGG+ S  GPSRRF NR   R APYS      +A +  W +D+F       AA+          
Subjt:  MAEPLDMSLDDIIKKNKKP----------GSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYS----TAKAPETAWSHDMFVDH---GAAYPSQPPR----

Query:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVP-------
         S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAE+VFSR+ DALAA+KRYNNVQLDGK MK+EIVG+N+  PA+P       
Subjt:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVP-------

Query:  --------ASTNPSF-----GNPNGFPRG-GRVLGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ
                 + N +F     GN NG  RG G  +GR RGGG G G  RGGR      GRGSG GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  --------ASTNPSF-----GNPNGFPRG-GRVLGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ

AT5G59950.1 RNA-binding (RRM/RBD/RNP motifs) family protein2.1e-6862.1Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG
        M+  LDMSLDD+I KN+K        RG G+ SGPGP+RR   NR   R APY +AKAPE+ W HDMF D    + S   R+SA IETGTKLY+SNLDYG
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG

Query:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFP-RGGRVL-GRN
        V NEDIKELF+EVG+LKRY++++D+SGRSKGTAE+V+SR+ DALAA+K+YN+VQLDGKPMK+EIVG+N+ T A P S  P+ GN NG P RGG+   G+ 
Subjt:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFP-RGGRVL-GRN

Query:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

AT5G59950.3 RNA-binding (RRM/RBD/RNP motifs) family protein7.5e-6661.29Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG
        M+  LDMSLDD+I KN+K        RG G+ SGPGP+RR   NR   R APY +  APE+ W HDMF D    + S   R+SA IETGTKLY+SNLDYG
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG

Query:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFP-RGGRVL-GRN
        V NEDIKELF+EVG+LKRY++++D+SGRSKGTAE+V+SR+ DALAA+K+YN+VQLDGKPMK+EIVG+N+ T A P S  P+ GN NG P RGG+   G+ 
Subjt:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFP-RGGRVL-GRN

Query:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

AT5G59950.5 RNA-binding (RRM/RBD/RNP motifs) family protein2.8e-6861.85Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG
        M+  LDMSLDD+I KN+K        RG G+ SGPGP+RR   NR   R APY +AKAPE+ W HDMF D    + S   R+SA IETGTKLY+SNLDYG
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG

Query:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFP--RGGRVL-GR
        V NEDIKELF+EVG+LKRY++++D+SGRSKGTAE+V+SR+ DALAA+K+YN+VQLDGKPMK+EIVG+N+ T A P S  P+ GN NG P  RGG+   G+
Subjt:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFP--RGGRVL-GR

Query:  NRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
         RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  NRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGCCTCTCGACATGAGCTTAGATGATATCATCAAGAAGAACAAGAAACCCGGATCTTCAAACTTCAGAGGTCGTGGCGGAGCTTCTTCTGGACCAGGTCCTTC
TCGCCGCTTTCGCAATCGCGGTCTTAATAGACCAGCGCCCTATTCTACTGCCAAGGCGCCCGAGACGGCTTGGTCACACGACATGTTTGTAGATCACGGTGCGGCATATC
CTTCACAGCCTCCACGGGCCTCTGCTATTGAAACTGGCACCAAGCTTTATGTTTCTAATTTGGATTATGGTGTCTCCAACGAGGACATCAAGGAACTGTTTTCTGAAGTC
GGTGATCTCAAACGATATTCTATCAATTATGATAAAAGTGGGAGATCAAAGGGAACAGCAGAAATTGTTTTTTCACGACAATCAGATGCTCTTGCTGCTATAAAGAGATA
TAACAATGTTCAGCTAGATGGGAAACCCATGAAGTTGGAGATCGTGGGATCGAACATCGTGACACCAGCTGTGCCTGCATCTACAAATCCCAGTTTTGGGAATCCAAATG
GATTTCCGAGAGGTGGACGTGTACTGGGTCGAAACCGGGGTGGTGGACGAGGACGTGGTCCTGGAAGAGGAGGGCGTGGACGTGGGAGTGGGAGTGGCAGAGGTCGTGGA
GAGAAGTTATCAGCCGAAGATCTAGATGCTGATTTGGAGAAGTACCATGAAGAAGCGATGCAGATCAATTAA
mRNA sequenceShow/hide mRNA sequence
AGAAAAGAAAAAAGAGAGACCGTATTTTGATACTTGAAGTTCTTCGCAATTGGTCTCTTCCCCGGCTGCTCACAACATCACGTCGTTTCGTCGGAACCCTAGAAATTGTC
TGTCCTCTCTGAATCTCTGATCCACCACCAATGGCAGAGCCTCTCGACATGAGCTTAGATGATATCATCAAGAAGAACAAGAAACCCGGATCTTCAAACTTCAGAGGTCG
TGGCGGAGCTTCTTCTGGACCAGGTCCTTCTCGCCGCTTTCGCAATCGCGGTCTTAATAGACCAGCGCCCTATTCTACTGCCAAGGCGCCCGAGACGGCTTGGTCACACG
ACATGTTTGTAGATCACGGTGCGGCATATCCTTCACAGCCTCCACGGGCCTCTGCTATTGAAACTGGCACCAAGCTTTATGTTTCTAATTTGGATTATGGTGTCTCCAAC
GAGGACATCAAGGAACTGTTTTCTGAAGTCGGTGATCTCAAACGATATTCTATCAATTATGATAAAAGTGGGAGATCAAAGGGAACAGCAGAAATTGTTTTTTCACGACA
ATCAGATGCTCTTGCTGCTATAAAGAGATATAACAATGTTCAGCTAGATGGGAAACCCATGAAGTTGGAGATCGTGGGATCGAACATCGTGACACCAGCTGTGCCTGCAT
CTACAAATCCCAGTTTTGGGAATCCAAATGGATTTCCGAGAGGTGGACGTGTACTGGGTCGAAACCGGGGTGGTGGACGAGGACGTGGTCCTGGAAGAGGAGGGCGTGGA
CGTGGGAGTGGGAGTGGCAGAGGTCGTGGAGAGAAGTTATCAGCCGAAGATCTAGATGCTGATTTGGAGAAGTACCATGAAGAAGCGATGCAGATCAATTAAATCATTTG
GTGTCATTTTCTGGGGGCTTGATATCCCTAACTTCATTAGGTGCTTGATGATAAACGTGATAAGGACACGGTTTTCTATTTTGTGAGTCATATCATGAAGCCCTGACCTT
GAGAGAACTGCTAGACCTTGCTAGGATGTGGAGTTTGGGCTTGAATTGTTGTTTGTTGCAGTAAACTATAAAGAGGTGTTCATATATGCTTGAATTTGTAATCTCCCGTT
ACTTATCTATTTTAATCTGCTCCAATCCTATTTATTTTCCATGAAAGTTGCACTGGAACTGGGAAAGTCGCTCACACTCCTTGTCTACATGGAAGGATTTGGGTGCATGA
CATGGGAAGTATTGGTCCTAACAAAATCAACCACATCTTCTGGTGTTGAGGCCAAAGTTCGGAATTTCTACTTTATGGCATTGCAGCGATTGGTTTTATGAATTAAAAGT
AATGTGTTAAGATCTACTTGTTGGCATGGAGAGGGAGGGAGGGGTGTGGAGATGATTACAAACTGTTCAGGAGTTGCCATTGACTTTTGAGGCCCCTGAACCATAGAGTT
AGAATTGGTATGCAATTAGAAATCCATGGTGCTTTTACCTTTGTTTCACTAGTGTGTATGGCTTTTACGTTCCTGTGGTTGTAGTATGACTACTATAGGATGCAGTCTTG
TTTCTTCTGCTAATTTTGTTTGAGATTATTCCAGA
Protein sequenceShow/hide protein sequence
MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHDMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSNEDIKELFSEV
GDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNIVTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGSGSGRGRG
EKLSAEDLDADLEKYHEEAMQIN