; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027567 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027567
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTHO complex subunit 4A
Genome locationscaffold11:3532255..3537734
RNA-Seq ExpressionSpg027567
SyntenySpg027567
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR025715 - Chromatin target of PRMT1 protein, C-terminal
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583468.1 THO complex subunit 4A, partial [Cucurbita argyrosperma subsp. sororia]5.4e-10482.82Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS
        MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS A+  +                   T  +     +HGAAYPSQPARAS
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS

Query:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN
        AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVTPA+PAS+N NFGN
Subjt:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN

Query:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
         NGF RGG  +GRNRGGGRGRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

KAG7019223.1 THO complex subunit 4A [Cucurbita argyrosperma subsp. argyrosperma]2.0e-10382.44Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS
        MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS A+  +                   T  +     +HGAAYPSQPARAS
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS

Query:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN
        AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVTPA+PAS+N NFGN
Subjt:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN

Query:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
         NGF RGG  +GRNRGGGRGRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAM+IN
Subjt:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_008457549.1 PREDICTED: THO complex subunit 4A [Cucumis melo]1.8e-10482.2Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS
        MAEPLDMSLDDIIKNNKKSGSSNFR RGGASSGPGPSRRFRNRGLNRA PYS +K  +                   T  +     +HGAAYPS P RAS
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS

Query:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN
        AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAE++FSR ADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPA+PA SNA+FGN
Subjt:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN

Query:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
         NGFPRGGRAMGRNRGGGRGRGPGRGGRGR  GSGSGRGRGEKLSAEDLDADL+KYHEEAMQIN
Subjt:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_022964653.1 THO complex subunit 4B-like [Cucurbita moschata]1.6e-10382.44Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS
        MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS A+  +                   T  +     +HG+AYPSQPARAS
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS

Query:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN
        AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVTPA+PAS+N NFGN
Subjt:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN

Query:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
         NGF RGG  +GRNRGGGRGRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_038895300.1 THO complex subunit 4A-like [Benincasa hispida]5.5e-10985.11Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS
        MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS AK  +                   T  +     +HGAAYPSQP RAS
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS

Query:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN
        AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQ+DALAAIKRYNNVQLDGKPMKLEIVGTNIVTPA+PASSNA FGN
Subjt:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN

Query:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
         NGFPRGGRA+GRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

TrEMBL top hitse value%identityAlignment
A0A0A0LUQ3 RRM domain-containing protein4.9e-10380.67Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS
        MAEPLDMSLDDIIKNNKKSGSSNFR RGGASSGPGPSRRFRNRGLNRA PYS +K  +                   T  +     +HGAAYPS P RAS
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS

Query:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN
        AIETGTKLYVSNLDYGVSNEDIKELFSEVGD+KRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPA+PA SNA+FGN
Subjt:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN

Query:  LNGFPRGGRAMGRNRGGGRGRGPGRG-GRGR------GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
         NGFPRGGRAMGRNRGGGRGRGPGRG GRGR      GSGSGRG GEKLSAEDLDADL+KYHEEAMQIN
Subjt:  LNGFPRGGRAMGRNRGGGRGRGPGRG-GRGR------GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A1S3C5R6 THO complex subunit 4A8.9e-10582.2Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS
        MAEPLDMSLDDIIKNNKKSGSSNFR RGGASSGPGPSRRFRNRGLNRA PYS +K  +                   T  +     +HGAAYPS P RAS
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS

Query:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN
        AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAE++FSR ADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPA+PA SNA+FGN
Subjt:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN

Query:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
         NGFPRGGRAMGRNRGGGRGRGPGRGGRGR  GSGSGRGRGEKLSAEDLDADL+KYHEEAMQIN
Subjt:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1G9Y5 THO complex subunit 4A-like isoform X24.6e-10181.3Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS
        M +PLD SLDDIIKNNKKSGSSNFRGRGGASSGP PSRRF NRGLNRAAPYS AK  +     P S +L +               +HG AYPS PARAS
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS

Query:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN
         IETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVG NIVTP LPASSN NFGN
Subjt:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN

Query:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
         +GF RGGRA+GRNRGGGRGRGPGRGGRGR  G+GRG GEKLSAEDLDADLEKYHEEAMQIN
Subjt:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1HLI4 THO complex subunit 4B-like7.6e-10482.44Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS
        MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS A+  +                   T  +     +HG+AYPSQPARAS
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS

Query:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN
        AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVTPA+PAS+N NFGN
Subjt:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN

Query:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
         NGF RGG  +GRNRGGGRGRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1I3N4 THO complex subunit 4A-like1.7e-10382.44Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS
        MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS A+  +                   T  +     +HGAAYPSQPARAS
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARAS

Query:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN
        AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVTPA+PAS+N NFGN
Subjt:  AIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGN

Query:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
         NGF RGG  +GRNRGGGRGRGPGRGGRG GS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  LNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

SwissProt top hitse value%identityAlignment
B5FXN8 THO complex subunit 45.2e-3342.31Show/hide
Query:  MAEPLDMSLDDIIKNNKKS-----GSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQ
        MA+ +DMSLDDIIK N+       G    RGRGG + G GP R     G     P     V   G           P+P S          +    + S 
Subjt:  MAEPLDMSLDDIIKNNKKS-----GSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQ

Query:  PARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSN
            + +ETG KL VSNLD+GVS+ DI+ELF+E G LK+ +++YD+SGRS GTA+V F R+ADAL A+K+YN V LDG+PM +++V + I T   PA S 
Subjt:  PARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSN

Query:  ANFGNLNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH
            N  G  R    +G   GGG  RG   G RGRG G+GR   ++LSAE+LDA L+ Y+
Subjt:  ANFGNLNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH

Q6NQ72 THO complex subunit 4D4.5e-3740.85Show/hide
Query:  MAEPLDMSLDDIIKNNK--KSGSSNF-----RGRGGASSGPGPSRRFRNRGLNRAAPYSA-AKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAY
        M+  L+M+LD+I+K  K  +SG         RGRGG   G GP+RR          P +  A+      N P+    +LP  S                 
Subjt:  MAEPLDMSLDDIIKNNK--KSGSSNF-----RGRGGASSGPGPSRRFRNRGLNRAAPYSA-AKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAY

Query:  PSQPARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPA-LP
          + A AS +E GT+L+V+NLD GV+NEDI+ELFSE+G+++RY+I+YDK+GR  GTAEVV+ R++DA  A+K+YNNV LDG+PM+LEI+G N  + A L 
Subjt:  PSQPARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPA-LP

Query:  ASSNANFGNLNG-------FPRGGRAMGRNRGGGRGRGP----------------------------GRGGRGRGSGSGRGRGEK---LSAEDLDADLEK
           N N   LNG         +GG   GR RGG  GRGP                            GRG  GRG G GRG G+K    SA DLD DLE 
Subjt:  ASSNANFGNLNG-------FPRGGRAMGRNRGGGRGRGP----------------------------GRGGRGRGSGSGRGRGEK---LSAEDLDADLEK

Query:  YHEEAM
        YH +AM
Subjt:  YHEEAM

Q8L719 THO complex subunit 4B4.2e-5954.18Show/hide
Query:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGA
        M+  LDMSLDDIIK+N+K           G +N  GRGG+ S  GPSRRF NR   R APYS    +    +            +    T  +  A  G 
Subjt:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGA

Query:  AYPSQPARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPAL
           +     S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAEVVFSR+ DALAA+KRYNNVQLDGK MK+EIVGTN+  PAL
Subjt:  AYPSQPARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPAL

Query:  P-------------------ASSNANF-GNLNGFPRG---GRAMGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ
        P                    + N NF GN NG  RG   G  MGR RGGG G G  RGGR      GRGSG GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  P-------------------ASSNANF-GNLNGFPRG---GRAMGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ

Q8L773 THO complex subunit 4A1.1e-5654.89Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPAR
        M+  LDMSLDD+I  N+KS       RG G+ SGPGP+RR   NR   R+APY +AK                P  +      +  + +H +   S    
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPAR

Query:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANF
         + IETGTKLY+SNLDYGV NEDIKELF+EVG+LKRY++++D+SGRSKGTAEVV+SR+ DALAA+K+YN+VQLDGKPMK+EIVGTN+ T A P+   AN 
Subjt:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANF

Query:  GNLNGFP-RGGRAM-GRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GN NG P RGG+   G+ RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  GNLNGFP-RGGRAM-GRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

Q94EH8 THO complex subunit 4C2.1e-3439.61Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNI---PLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQ--
        M++ L+M+LD+I+K +K   S+  R     S G G SR+    G  R  P         G  +   PL++N     PSSS +            + +Q  
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNI---PLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQ--

Query:  -------PARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTP
                   S +E GT +Y++NLD GV+NEDI+EL++E+G+LKRY+I+YDK+GR  G+AEVV+ R++DA+ A+++YNNV LDG+PMKLEI+G N  T 
Subjt:  -------PARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTP

Query:  ALPASSNANFGNLNGFPRGGRAMGRN-RGG----GRGRGP-----------------GRGG-RGRGSGSGRGRGEK-----------LSAEDLDADLEKY
        + P ++  N   LNG  +    +G+  RGG    GRG GP                 GRGG RGRG G+G GRG K            SA DLD DLE Y
Subjt:  ALPASSNANFGNLNGFPRGGRAMGRN-RGG----GRGRGP-----------------GRGG-RGRGSGSGRGRGEK-----------LSAEDLDADLEKY

Query:  HEEAMQIN
        H EAM I+
Subjt:  HEEAMQIN

Arabidopsis top hitse value%identityAlignment
AT5G02530.1 RNA-binding (RRM/RBD/RNP motifs) family protein3.0e-6054.18Show/hide
Query:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGA
        M+  LDMSLDDIIK+N+K           G +N  GRGG+ S  GPSRRF NR   R APYS    +    +            +    T  +  A  G 
Subjt:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGA

Query:  AYPSQPARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPAL
           +     S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAEVVFSR+ DALAA+KRYNNVQLDGK MK+EIVGTN+  PAL
Subjt:  AYPSQPARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPAL

Query:  P-------------------ASSNANF-GNLNGFPRG---GRAMGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ
        P                    + N NF GN NG  RG   G  MGR RGGG G G  RGGR      GRGSG GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  P-------------------ASSNANF-GNLNGFPRG---GRAMGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ

AT5G02530.2 RNA-binding (RRM/RBD/RNP motifs) family protein1.7e-6054.55Show/hide
Query:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGA
        M+  LDMSLDDIIK+N+K           G +N  GRGG+ S  GPSRRF NR   R APYS    +    +            +    T  +  A  G 
Subjt:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGA

Query:  AYPSQPARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPAL
           +     S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAEVVFSR+ DALAA+KRYNNVQLDGK MK+EIVGTN+  PAL
Subjt:  AYPSQPARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPAL

Query:  P-------------------ASSNANF-GNLNGFPRG-GRAMGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ
        P                    + N NF GN NG  RG G  MGR RGGG G G  RGGR      GRGSG GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  P-------------------ASSNANF-GNLNGFPRG-GRAMGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ

AT5G59950.1 RNA-binding (RRM/RBD/RNP motifs) family protein8.1e-5854.89Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPAR
        M+  LDMSLDD+I  N+KS       RG G+ SGPGP+RR   NR   R+APY +AK                P  +      +  + +H +   S    
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPAR

Query:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANF
         + IETGTKLY+SNLDYGV NEDIKELF+EVG+LKRY++++D+SGRSKGTAEVV+SR+ DALAA+K+YN+VQLDGKPMK+EIVGTN+ T A P+   AN 
Subjt:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANF

Query:  GNLNGFP-RGGRAM-GRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GN NG P RGG+   G+ RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  GNLNGFP-RGGRAM-GRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

AT5G59950.3 RNA-binding (RRM/RBD/RNP motifs) family protein6.2e-5854.51Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPAR
        M+  LDMSLDD+I  N+KS       RG G+ SGPGP+RR   NR   R+APY +A     G ++                  +  + +H +   S    
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPAR

Query:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANF
         + IETGTKLY+SNLDYGV NEDIKELF+EVG+LKRY++++D+SGRSKGTAEVV+SR+ DALAA+K+YN+VQLDGKPMK+EIVGTN+ T A P+   AN 
Subjt:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANF

Query:  GNLNGFP-RGGRAM-GRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GN NG P RGG+   G+ RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  GNLNGFP-RGGRAM-GRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

AT5G59950.5 RNA-binding (RRM/RBD/RNP motifs) family protein1.1e-5754.68Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPAR
        M+  LDMSLDD+I  N+KS       RG G+ SGPGP+RR   NR   R+APY +AK                P  +      +  + +H +   S    
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPAR

Query:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANF
         + IETGTKLY+SNLDYGV NEDIKELF+EVG+LKRY++++D+SGRSKGTAEVV+SR+ DALAA+K+YN+VQLDGKPMK+EIVGTN+ T A P+   AN 
Subjt:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANF

Query:  GNLNGFP--RGGRAM-GRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GN NG P  RGG+   G+ RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  GNLNGFP--RGGRAM-GRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGCCTCTCGACATGAGCTTAGACGATATCATCAAGAACAACAAGAAATCCGGATCCTCAAATTTCAGGGGTCGTGGCGGAGCTTCTTCTGGACCAGGTCCCTC
TCGCCGCTTTCGCAATCGCGGTCTTAATAGAGCAGCGCCCTATTCTGCAGCCAAGGTGAAGGATTTGGGTTTTAACATCCCGCTTTCGCTGAATCTTACTCTACCATCAC
CATCCTCTAGCACTACTACTACTACTACTACTACTGCCAATCATGGTGCGGCATATCCTTCACAGCCTGCACGGGCCTCTGCTATCGAAACCGGCACGAAGCTTTATGTT
TCGAATTTGGATTATGGTGTTTCCAACGAGGATATCAAGGAACTGTTTTCTGAAGTTGGCGATCTAAAACGGTATTCTATAAATTATGATAAAAGTGGGAGATCAAAGGG
AACAGCAGAAGTAGTTTTTTCACGACAAGCAGATGCTCTAGCTGCTATAAAGAGATACAACAACGTTCAGCTAGATGGGAAACCCATGAAGTTGGAGATTGTGGGAACGA
ACATCGTGACACCTGCTCTGCCTGCTTCTTCGAATGCCAATTTTGGAAATTTAAATGGATTTCCGAGAGGTGGACGTGCAATGGGCCGAAACCGGGGTGGTGGACGAGGT
CGTGGTCCTGGAAGAGGAGGGCGTGGACGTGGCAGTGGCAGTGGTAGAGGTCGTGGTGAGAAGTTATCAGCTGAAGATCTAGATGCTGATTTGGAGAAGTACCATGAAGA
AGCAATGCAGATCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAGCCTCTCGACATGAGCTTAGACGATATCATCAAGAACAACAAGAAATCCGGATCCTCAAATTTCAGGGGTCGTGGCGGAGCTTCTTCTGGACCAGGTCCCTC
TCGCCGCTTTCGCAATCGCGGTCTTAATAGAGCAGCGCCCTATTCTGCAGCCAAGGTGAAGGATTTGGGTTTTAACATCCCGCTTTCGCTGAATCTTACTCTACCATCAC
CATCCTCTAGCACTACTACTACTACTACTACTACTGCCAATCATGGTGCGGCATATCCTTCACAGCCTGCACGGGCCTCTGCTATCGAAACCGGCACGAAGCTTTATGTT
TCGAATTTGGATTATGGTGTTTCCAACGAGGATATCAAGGAACTGTTTTCTGAAGTTGGCGATCTAAAACGGTATTCTATAAATTATGATAAAAGTGGGAGATCAAAGGG
AACAGCAGAAGTAGTTTTTTCACGACAAGCAGATGCTCTAGCTGCTATAAAGAGATACAACAACGTTCAGCTAGATGGGAAACCCATGAAGTTGGAGATTGTGGGAACGA
ACATCGTGACACCTGCTCTGCCTGCTTCTTCGAATGCCAATTTTGGAAATTTAAATGGATTTCCGAGAGGTGGACGTGCAATGGGCCGAAACCGGGGTGGTGGACGAGGT
CGTGGTCCTGGAAGAGGAGGGCGTGGACGTGGCAGTGGCAGTGGTAGAGGTCGTGGTGAGAAGTTATCAGCTGAAGATCTAGATGCTGATTTGGAGAAGTACCATGAAGA
AGCAATGCAGATCAATTAA
Protein sequenceShow/hide protein sequence
MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKVKDLGFNIPLSLNLTLPSPSSSTTTTTTTTANHGAAYPSQPARASAIETGTKLYV
SNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSNANFGNLNGFPRGGRAMGRNRGGGRG
RGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN