; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC03G052000 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC03G052000
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionTHO complex subunit 4A
Genome locationCiama_Chr03:2286870..2289779
RNA-Seq ExpressionCaUC03G052000
SyntenyCaUC03G052000
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR025715 - Chromatin target of PRMT1 protein, C-terminal
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583468.1 THO complex subunit 4A, partial [Cucurbita argyrosperma subsp. sororia]5.7e-11692.59Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFRGRGGASSGPGPSRRFRNRGLNR APYSTA+APETAWSH+MFVDHGAAYPSQP RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQ+DALAAIKRYNNVQLDGK MKLEIVG+NI+TPA+PAS N +FGNPNGF RGG VLGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_008457549.1 PREDICTED: THO complex subunit 4A [Cucumis melo]5.7e-11691.02Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFR RGGASSGPGPSRRFRNRGLNR  PYST+KAPETAWSH+MFVDHGAAYPS PPRASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEI+FSR +DALAAIKRYNNVQLDGKPMKLEIVG+NI+TPAVPA +N SFGN NGFPRGGR +GRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGR  GSGSGRGRGEKLSAEDLDADL+KYHEEAMQIN
Subjt:  GRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_022964653.1 THO complex subunit 4B-like [Cucurbita moschata]9.7e-11692.59Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFRGRGGASSGPGPSRRFRNRGLNR APYSTA+APETAWSH+MFVDHG+AYPSQP RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQ+DALAAIKRYNNVQLDGK MKLEIVG+NI+TPAVPAS N +FGNPNGF RGG VLGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_023520096.1 THO complex subunit 4A-like [Cucurbita pepo subsp. pepo]9.7e-11692.59Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFRGRGGASSGPGPSRRFRNRGLNR APYSTA+APETAWSH+MFVDHGAAYPSQP RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQ+DALAAIKRYNNVQLDGK MKLEIVG+NI+TPAVPAS N +FGNPNGF RGG VLGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGR  GS RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_038895300.1 THO complex subunit 4A-like [Benincasa hispida]4.1e-12295.47Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFRGRGGASSGPGPSRRFRNRGLNR APYS AKAPETAWSH+MFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVG+NI+TPAVPAS+N  FGNPNGFPRGGR LGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

TrEMBL top hitse value%identityAlignment
A0A0A0LUQ3 RRM domain-containing protein1.4e-11589.6Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFR RGGASSGPGPSRRFRNRGLNR  PYST+KAPETAWSH+MFVDHGAAYPS PPRASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGD+KRYSINYDKSGRSKGTAEIVFSRQ+DALAAIKRYNNVQLDGKPMKLEIVG+NI+TPAVPA +N SFGNPNGFPRGGR +GRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRG-GRGR------GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRG GRGR      GSGSGRG GEKLSAEDLDADL+KYHEEAMQIN
Subjt:  GRGPGRG-GRGR------GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A1S3C5R6 THO complex subunit 4A2.7e-11691.02Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFR RGGASSGPGPSRRFRNRGLNR  PYST+KAPETAWSH+MFVDHGAAYPS PPRASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEI+FSR +DALAAIKRYNNVQLDGKPMKLEIVG+NI+TPAVPA +N SFGN NGFPRGGR +GRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGR  GSGSGRGRGEKLSAEDLDADL+KYHEEAMQIN
Subjt:  GRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1G9Y5 THO complex subunit 4A-like isoform X21.5e-10986.83Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        M +PLD SLDDIIK NKK GSSNFRGRGGASSGP PSRRF NRGLNR APYS AKAPET WSH++FVDHG AYPS P RAS IETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQ+DALAAIKRYNNVQLDGKPMKLEIVG+NI+TP +PAS+NP+FGN +GF RGGR LGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGR  G+GRG GEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1HLI4 THO complex subunit 4B-like4.7e-11692.59Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFRGRGGASSGPGPSRRFRNRGLNR APYSTA+APETAWSH+MFVDHG+AYPSQP RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQ+DALAAIKRYNNVQLDGK MKLEIVG+NI+TPAVPAS N +FGNPNGF RGG VLGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1I3N4 THO complex subunit 4A-like1.0e-11592.59Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIK NKK GSSNFRGRGGASSGPGPSRRFRNRGLNR APYSTA+APETAWSH+MFVDHGAAYPSQP RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQ+DALAAIKRYNNVQLDGK MKLEIVG+NI+TPAVPAS N +FGNPNGF RGG VLGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRG GS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

SwissProt top hitse value%identityAlignment
B5FXN8 THO complex subunit 43.3e-4245.95Show/hide
Query:  MAEPLDMSLDDIIKKNKKP-----GSSNFRGRGGASSGPGPSR-----------RFRNR-------GLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQP
        MA+ +DMSLDDIIK N+       G    RGRGG + G GP R             RNR       G NRPAPYS  K     W H++F D G       
Subjt:  MAEPLDMSLDDIIKKNKKP-----GSSNFRGRGGASSGPGPSR-----------RFRNR-------GLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQP

Query:  PRASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNP
           + +ETG KL VSNLD+GVS+ DI+ELF+E G LK+ +++YD+SGRS GTA++ F R++DAL A+K+YN V LDG+PM +++V S I T   PA +  
Subjt:  PRASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNP

Query:  SFGNPNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH
           N  G  R   VLG   GGG  RG   G RGRG G+GR   ++LSAE+LDA L+ Y+
Subjt:  SFGNPNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH

Q3T0I4 THO complex subunit 49.9e-3943.51Show/hide
Query:  MAEPLDMSLDDIIKKNKKP----GSSNFRGRGGASSGPGPSRR-----------FRNR-----------GLNRPAPYSTAKAPETAWSHEMFVDHGAAYP
        MA+ +DMSLDDIIK N+      G    RGR G+  G G   +            RNR           G NRPAPYS  K     W H++F D G    
Subjt:  MAEPLDMSLDDIIKKNKKP----GSSNFRGRGGASSGPGPSRR-----------FRNR-----------GLNRPAPYSTAKAPETAWSHEMFVDHGAAYP

Query:  SQPPRASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPAS
              + +ETG KL VSNLD+GVS+ DI+ELF+E G LK+ +++YD+SGRS GTA++ F R++DAL A+K+YN V LDG+PM +++V S I T   PA 
Subjt:  SQPPRASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPAS

Query:  TNPSFGNPNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH
        +     N  G  R     G   GGG  RG   G RGRG G+GR   ++LSAE+LDA L+ Y+
Subjt:  TNPSFGNPNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH

Q6NQ72 THO complex subunit 4D4.5e-3941.38Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGS-------SNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAK----APETAWSHEMFVDHGAAYPSQPPRASAIETGTKL
        M+  L+M+LD+I+K+ K   S          RGRGG   G GP+RR       RP+ ++  K         W   +F D   A       AS +E GT+L
Subjt:  MAEPLDMSLDDIIKKNKKPGS-------SNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAK----APETAWSHEMFVDHGAAYPSQPPRASAIETGTKL

Query:  YVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPA-VPASTNPSFGNPNG----
        +V+NLD GV+NEDI+ELFSE+G+++RY+I+YDK+GR  GTAE+V+ R+SDA  A+K+YNNV LDG+PM+LEI+G N  + A +    N +    NG    
Subjt:  YVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPA-VPASTNPSFGNPNG----

Query:  ---FPRGGRVLGRNRGGGRGRGP----------------------------GRGGRGRGSGSGRGRGEK---LSAEDLDADLEKYHEEAM
             +GG   GR RGG  GRGP                            GRG  GRG G GRG G+K    SA DLD DLE YH +AM
Subjt:  ---FPRGGRVLGRNRGGGRGRGP----------------------------GRGGRGRGSGSGRGRGEK---LSAEDLDADLEKYHEEAM

Q8L719 THO complex subunit 4B8.4e-6254.98Show/hide
Query:  MAEPLDMSLDDIIKKNKKP----------GSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYS----TAKAPETAWSHEMFVDH---GAAYPSQPPR----
        M+  LDMSLDDIIK N+KP          G +N  GRGG+ S  GPSRRF NR   R APYS      +A +  W +++F       AA+          
Subjt:  MAEPLDMSLDDIIKKNKKP----------GSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYS----TAKAPETAWSHEMFVDH---GAAYPSQPPR----

Query:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVP-------
         S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAE+VFSR+ DALAA+KRYNNVQLDGK MK+EIVG+N+  PA+P       
Subjt:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVP-------

Query:  --------ASTNPSF-----GNPNGFPRG---GRVLGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ
                 + N +F     GN NG  RG   G  +GR RGGG G G  RGGR      GRGSG GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  --------ASTNPSF-----GNPNGFPRG---GRVLGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ

Q8L773 THO complex subunit 4A8.6e-6761.69Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG
        M+  LDMSLDD+I KN+K        RG G+ SGPGP+RR   NR   R APY +AKAPE+ W H+MF D    + S   R+SA IETGTKLY+SNLDYG
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG

Query:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFP-RGGRVL-GRN
        V NEDIKELF+EVG+LKRY++++D+SGRSKGTAE+V+SR+ DALAA+K+YN+VQLDGKPMK+EIVG+N+ T A P S  P+ GN NG P RGG+   G+ 
Subjt:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFP-RGGRVL-GRN

Query:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

Arabidopsis top hitse value%identityAlignment
AT5G02530.1 RNA-binding (RRM/RBD/RNP motifs) family protein5.9e-6354.98Show/hide
Query:  MAEPLDMSLDDIIKKNKKP----------GSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYS----TAKAPETAWSHEMFVDH---GAAYPSQPPR----
        M+  LDMSLDDIIK N+KP          G +N  GRGG+ S  GPSRRF NR   R APYS      +A +  W +++F       AA+          
Subjt:  MAEPLDMSLDDIIKKNKKP----------GSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYS----TAKAPETAWSHEMFVDH---GAAYPSQPPR----

Query:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVP-------
         S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAE+VFSR+ DALAA+KRYNNVQLDGK MK+EIVG+N+  PA+P       
Subjt:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVP-------

Query:  --------ASTNPSF-----GNPNGFPRG---GRVLGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ
                 + N +F     GN NG  RG   G  +GR RGGG G G  RGGR      GRGSG GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  --------ASTNPSF-----GNPNGFPRG---GRVLGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ

AT5G02530.2 RNA-binding (RRM/RBD/RNP motifs) family protein3.5e-6355.36Show/hide
Query:  MAEPLDMSLDDIIKKNKKP----------GSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYS----TAKAPETAWSHEMFVDH---GAAYPSQPPR----
        M+  LDMSLDDIIK N+KP          G +N  GRGG+ S  GPSRRF NR   R APYS      +A +  W +++F       AA+          
Subjt:  MAEPLDMSLDDIIKKNKKP----------GSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYS----TAKAPETAWSHEMFVDH---GAAYPSQPPR----

Query:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVP-------
         S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAE+VFSR+ DALAA+KRYNNVQLDGK MK+EIVG+N+  PA+P       
Subjt:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVP-------

Query:  --------ASTNPSF-----GNPNGFPRG-GRVLGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ
                 + N +F     GN NG  RG G  +GR RGGG G G  RGGR      GRGSG GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  --------ASTNPSF-----GNPNGFPRG-GRVLGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ

AT5G59950.1 RNA-binding (RRM/RBD/RNP motifs) family protein6.1e-6861.69Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG
        M+  LDMSLDD+I KN+K        RG G+ SGPGP+RR   NR   R APY +AKAPE+ W H+MF D    + S   R+SA IETGTKLY+SNLDYG
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG

Query:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFP-RGGRVL-GRN
        V NEDIKELF+EVG+LKRY++++D+SGRSKGTAE+V+SR+ DALAA+K+YN+VQLDGKPMK+EIVG+N+ T A P S  P+ GN NG P RGG+   G+ 
Subjt:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFP-RGGRVL-GRN

Query:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

AT5G59950.3 RNA-binding (RRM/RBD/RNP motifs) family protein2.2e-6560.89Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG
        M+  LDMSLDD+I KN+K        RG G+ SGPGP+RR   NR   R APY +  APE+ W H+MF D    + S   R+SA IETGTKLY+SNLDYG
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG

Query:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFP-RGGRVL-GRN
        V NEDIKELF+EVG+LKRY++++D+SGRSKGTAE+V+SR+ DALAA+K+YN+VQLDGKPMK+EIVG+N+ T A P S  P+ GN NG P RGG+   G+ 
Subjt:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFP-RGGRVL-GRN

Query:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

AT5G59950.5 RNA-binding (RRM/RBD/RNP motifs) family protein8.0e-6861.45Show/hide
Query:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG
        M+  LDMSLDD+I KN+K        RG G+ SGPGP+RR   NR   R APY +AKAPE+ W H+MF D    + S   R+SA IETGTKLY+SNLDYG
Subjt:  MAEPLDMSLDDIIKKNKKPGSSNFRGRG-GASSGPGPSRRFR-NRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASA-IETGTKLYVSNLDYG

Query:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFP--RGGRVL-GR
        V NEDIKELF+EVG+LKRY++++D+SGRSKGTAE+V+SR+ DALAA+K+YN+VQLDGKPMK+EIVG+N+ T A P S  P+ GN NG P  RGG+   G+
Subjt:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFP--RGGRVL-GR

Query:  NRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
         RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  NRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGCCTCTCGACATGAGCTTGGATGATATCATCAAGAAGAACAAGAAACCCGGATCTTCAAACTTCAGAGGTCGTGGCGGAGCTTCTTCTGGACCAGGTCCTTC
TCGCCGCTTTCGCAATCGCGGTCTTAATAGACCAGCGCCCTATTCTACTGCCAAGGCGCCCGAGACGGCTTGGTCACACGAAATGTTTGTAGATCACGGTGCGGCATATC
CTTCACAGCCTCCACGGGCCTCTGCTATTGAAACTGGCACCAAGCTTTATGTTTCTAATTTGGATTATGGTGTCTCCAACGAGGACATCAAGGAACTCTTTTCTGAAGTT
GGTGATCTCAAACGATATTCTATCAATTATGATAAAAGTGGGAGATCAAAGGGAACAGCAGAAATTGTTTTTTCACGACAATCAGATGCTCTTGCTGCTATAAAGAGATA
TAACAATGTTCAGCTAGATGGGAAACCCATGAAGTTGGAGATCGTGGGATCTAACATCTTGACACCAGCTGTGCCTGCATCTACAAATCCCAGTTTTGGGAATCCAAATG
GATTTCCGAGAGGTGGACGTGTACTGGGTCGAAACCGGGGTGGTGGACGAGGACGTGGTCCTGGAAGAGGAGGGCGTGGACGTGGGAGTGGGAGTGGCAGAGGTCGTGGA
GAGAAGTTATCAGCTGAAGATCTAGATGCTGATTTGGAGAAGTACCATGAAGAAGCGATGCAGATCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAGCCTCTCGACATGAGCTTGGATGATATCATCAAGAAGAACAAGAAACCCGGATCTTCAAACTTCAGAGGTCGTGGCGGAGCTTCTTCTGGACCAGGTCCTTC
TCGCCGCTTTCGCAATCGCGGTCTTAATAGACCAGCGCCCTATTCTACTGCCAAGGCGCCCGAGACGGCTTGGTCACACGAAATGTTTGTAGATCACGGTGCGGCATATC
CTTCACAGCCTCCACGGGCCTCTGCTATTGAAACTGGCACCAAGCTTTATGTTTCTAATTTGGATTATGGTGTCTCCAACGAGGACATCAAGGAACTCTTTTCTGAAGTT
GGTGATCTCAAACGATATTCTATCAATTATGATAAAAGTGGGAGATCAAAGGGAACAGCAGAAATTGTTTTTTCACGACAATCAGATGCTCTTGCTGCTATAAAGAGATA
TAACAATGTTCAGCTAGATGGGAAACCCATGAAGTTGGAGATCGTGGGATCTAACATCTTGACACCAGCTGTGCCTGCATCTACAAATCCCAGTTTTGGGAATCCAAATG
GATTTCCGAGAGGTGGACGTGTACTGGGTCGAAACCGGGGTGGTGGACGAGGACGTGGTCCTGGAAGAGGAGGGCGTGGACGTGGGAGTGGGAGTGGCAGAGGTCGTGGA
GAGAAGTTATCAGCTGAAGATCTAGATGCTGATTTGGAGAAGTACCATGAAGAAGCGATGCAGATCAATTAA
Protein sequenceShow/hide protein sequence
MAEPLDMSLDDIIKKNKKPGSSNFRGRGGASSGPGPSRRFRNRGLNRPAPYSTAKAPETAWSHEMFVDHGAAYPSQPPRASAIETGTKLYVSNLDYGVSNEDIKELFSEV
GDLKRYSINYDKSGRSKGTAEIVFSRQSDALAAIKRYNNVQLDGKPMKLEIVGSNILTPAVPASTNPSFGNPNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGSGSGRGRG
EKLSAEDLDADLEKYHEEAMQIN