; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G21570 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G21570
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTHO complex subunit 4A-like
Genome locationClcChr02:33770830..33772868
RNA-Seq ExpressionClc02G21570
SyntenyClc02G21570
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR025715 - Chromatin target of PRMT1 protein, C-terminal
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035419.1 THO complex subunit 4A, partial [Cucurbita argyrosperma subsp. argyrosperma]3.9e-10890.09Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSG SRGRGRGSGPGPVRR PNRAA+RTPYSAPKAPETTWQHDMF D G+GFPVQAGRAS+IQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQR-
        IKELFSEVG+MKRYGIHYDKSGRSKGTAEVVFSRR DA AAVKKYNNVQLDGKPMKIEIVGTNI+T AVG   A NPFEN NGAPRRQQGRGG PSRQR 
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQR-

Query:  GRGFGRGRGRGRGPSEKVSAEDLDADLENFWA
        GRG GRGRGRGRGPSEKVSAEDLDADLE + A
Subjt:  GRGFGRGRGRGRGPSEKVSAEDLDADLENFWA

XP_004143092.1 THO complex subunit 4A [Cucumis sativus]3.6e-11493.51Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAA+RTPYSAPKAPETTWQHDMFAD  SGF VQ GRASAIQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQRG
        IKELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPFEN NGAPRRQQGRGGPPSRQRG
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQRG

Query:  RGFGRGRGRGRGPSEKVSAEDLDADLENFWA
         GFGRGRGRGRGPSEKVSAEDLDADLE + A
Subjt:  RGFGRGRGRGRGPSEKVSAEDLDADLENFWA

XP_008448384.1 PREDICTED: THO complex subunit 4A-like isoform X1 [Cucumis melo]2.8e-11493.94Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFAD  SGF  Q GRASAIQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQRG
        IKELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPFEN NGAPRRQQGRGGPPSRQRG
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQRG

Query:  RGFGRGRGRGRGPSEKVSAEDLDADLENFWA
        RGFGRGRGRGRGPSEKVSAEDLDADLE + A
Subjt:  RGFGRGRGRGRGPSEKVSAEDLDADLENFWA

XP_022947495.1 THO complex subunit 4A-like [Cucurbita moschata]3.9e-10890.09Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSG SRGRGRGSGPGPVRR PNRAA+RTPYSAPKAPETTWQHDMF D G+GFPVQAGRAS+IQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQR-
        IKELFSEVG+MKRYGIHYDKSGRSKGTAEVVFSRR DA AAVKKYNNVQLDGKPMKIEIVGTNI+T AVG   A NPFEN NGAPRRQQGRGG PSRQR 
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQR-

Query:  GRGFGRGRGRGRGPSEKVSAEDLDADLENFWA
        GRG GRGRGRGRGPSEKVSAEDLDADLE + A
Subjt:  GRGFGRGRGRGRGPSEKVSAEDLDADLENFWA

XP_038901991.1 THO complex subunit 4A-like [Benincasa hispida]3.9e-11694.81Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAA+RTPYSAPKAPETTWQHDMFAD GSGF  Q GRASAIQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQRG
        IKELFSEVG+MKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPFENLNGAPRRQQGRGGPPSRQRG
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQRG

Query:  RGFGRGRGRGRGPSEKVSAEDLDADLENFWA
        RGFGRGRGRGRGPSEKVSAEDLDADLE + A
Subjt:  RGFGRGRGRGRGPSEKVSAEDLDADLENFWA

TrEMBL top hitse value%identityAlignment
A0A0A0KBU2 RRM domain-containing protein1.7e-11493.51Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAA+RTPYSAPKAPETTWQHDMFAD  SGF VQ GRASAIQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQRG
        IKELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPFEN NGAPRRQQGRGGPPSRQRG
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQRG

Query:  RGFGRGRGRGRGPSEKVSAEDLDADLENFWA
         GFGRGRGRGRGPSEKVSAEDLDADLE + A
Subjt:  RGFGRGRGRGRGPSEKVSAEDLDADLENFWA

A0A1S3BK70 THO complex subunit 4A-like isoform X11.3e-11493.94Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFAD  SGF  Q GRASAIQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQRG
        IKELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPFEN NGAPRRQQGRGGPPSRQRG
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQRG

Query:  RGFGRGRGRGRGPSEKVSAEDLDADLENFWA
        RGFGRGRGRGRGPSEKVSAEDLDADLE + A
Subjt:  RGFGRGRGRGRGPSEKVSAEDLDADLENFWA

A0A5A7V158 THO complex subunit 4A-like isoform X17.1e-10894.47Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFAD  SGF  Q GRASAIQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQRG
        IKELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPFEN NGAPRRQQGRGGPPSRQRG
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQRG

Query:  RGFGRGRGRGRGPSEKV
        RGFGRGRGRGRGPSEK+
Subjt:  RGFGRGRGRGRGPSEKV

A0A6J1G721 THO complex subunit 4A-like1.9e-10890.09Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSG SRGRGRGSGPGPVRR PNRAA+RTPYSAPKAPETTWQHDMF D G+GFPVQAGRAS+IQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQR-
        IKELFSEVG+MKRYGIHYDKSGRSKGTAEVVFSRR DA AAVKKYNNVQLDGKPMKIEIVGTNI+T AVG   A NPFEN NGAPRRQQGRGG PSRQR 
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQR-

Query:  GRGFGRGRGRGRGPSEKVSAEDLDADLENFWA
        GRG GRGRGRGRGPSEKVSAEDLDADLE + A
Subjt:  GRGFGRGRGRGRGPSEKVSAEDLDADLENFWA

A0A6J1L1M6 THO complex subunit 4A-like5.4e-10889.66Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSG SRGRGRGSGPGPVRR PNRAA+RTPYSAPKAPETTWQHDMF D G+GFPVQAGRAS+IQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQR-
        IKELFSEVG+MKRYGIHYDKSGRSKGTAEVVFSRR DA AAVKKYNNVQLDGKPMKIEIVGTNI+T AVG   A NPFEN NG PRRQQGRGG PSRQR 
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQR-

Query:  GRGFGRGRGRGRGPSEKVSAEDLDADLENFWA
        GRG GRGRGRGRGPSEKVSAEDLDADLE + A
Subjt:  GRGFGRGRGRGRGPSEKVSAEDLDADLENFWA

SwissProt top hitse value%identityAlignment
O08583 THO complex subunit 48.0e-4041.98Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRG-------------------SGPGPVRRFPNRAASR--------TPYSAPKAPETTWQHDMFADSGSGFPV
        MA  +DMSLDDIIK N+  R G   GRGRG                    G GP+R  P  A            PYS PK     WQHD+F    SGF  
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRG-------------------SGPGPVRRFPNRAASR--------TPYSAPKAPETTWQHDMFADSGSGFPV

Query:  QAGRASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTT
          G  + ++TG KL +SNLD+GVS+ DI+ELF+E G +K+  +HYD+SGRS GTA+V F R+ DA+ A+K+YN V LDG+PM I++V + I        T
Subjt:  QAGRASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTT

Query:  AANPFENLN-GAPRRQQGR-----GGPPSRQRGRGFGRGRGRGRGPSEKVSAEDLDADLENF
           P +++N G   R +G      GG     RG   GRGRG GR   +++SAE+LDA L+ +
Subjt:  AANPFENLN-GAPRRQQGR-----GGPPSRQRGRGFGRGRGRGRGPSEKVSAEDLDADLENF

Q3T0I4 THO complex subunit 46.1e-4042.8Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRG-------------------SGPGPVRRFPNRAASR---------TPYSAPKAPETTWQHDMFADSGSGFP
        MA  +DMSLDDIIK N+  R G   GRGRG                    G GP+R  P  A             PYS PK     WQHD+F    SGF 
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRG-------------------SGPGPVRRFPNRAASR---------TPYSAPKAPETTWQHDMFADSGSGFP

Query:  VQAGRASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLT
           G  + ++TG KL +SNLD+GVS+ DI+ELF+E G +K+  +HYD+SGRS GTA+V F R+ DA+ A+K+YN V LDG+PM I++V + I        
Subjt:  VQAGRASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLT

Query:  TAANPFENLN-GAPRRQQGR----GGPPSRQRGRGFGRGRGRGRGPSEK--VSAEDLDADLENF
        T   P +++N G   R +G     GG  +R+  RG  RGRGRG G S K  +SAE+LDA L+ +
Subjt:  TAANPFENLN-GAPRRQQGR----GGPPSRQRGRGFGRGRGRGRGPSEK--VSAEDLDADLENF

Q6NQ72 THO complex subunit 4D2.7e-4042.21Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGN---SRGRGR-----GSGPGPVRRFP---NRAASRTPYSAP--KAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKL
        M+  L+M+LD+I+K  K +RSG    SRGRGR     G G GP RR P   N   S    + P  +     WQ  +F D      ++A  AS ++ GT+L
Subjt:  MAAPLDMSLDDIIKNNKKSRSGN---SRGRGR-----GSGPGPVRRFP---NRAASRTPYSAP--KAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKL

Query:  YISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPA----------VGL------
        +++NLD GV+NEDI+ELFSE+GE++RY IHYDK+GR  GTAEVV+ RR DA  A+KKYNNV LDG+PM++EI+G N S+ A           GL      
Subjt:  YISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPA----------VGL------

Query:  --------------------------TTAANPFENLNGAPRRQQGRGGPPSRQRGRGFGRGRGRGRGPSEK---VSAEDLDADLENFWA
                                   +   P  N  G   R  GRGG  +R RG G GRGRG GRG  +K    SA DLD DLE++ A
Subjt:  --------------------------TTAANPFENLNGAPRRQQGRGGPPSRQRGRGFGRGRGRGRGPSEK---VSAEDLDADLENFWA

Q8L719 THO complex subunit 4B4.1e-6054.58Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAASRT-PYSAP----KAPETTWQHDMFADS-------GSGFPVQAGR
        M+  LDMSLDDIIK+N+K      RG            G GS  GP RRF NR  +RT PYS P    +A +  WQ+D+FA         G       G 
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAASRT-PYSAP----KAPETTWQHDMFADS-------GSGFPVQAGR

Query:  ASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG-LTTAAN
         S+I+TGTKLYISNLDYGVSNEDIKELFSEVG++KRYGIHYD+SGRSKGTAEVVFSRR DA+AAVK+YNNVQLDGK MKIEIVGTN+S PA+  L TA  
Subjt:  ASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG-LTTAAN

Query:  PF----------ENLNG-------APRRQQGRGGPPSRQRGRGFG----------RGRG----RGRGPSEKVSAEDLDADLENF
        PF          EN NG          R +GRGG   R RG GFG          RGRG     GRG  E VSAEDLDA+L+ +
Subjt:  PF----------ENLNG-------APRRQQGRGGPPSRQRGRGFG----------RGRG----RGRGPSEKVSAEDLDADLENF

Q8L773 THO complex subunit 4A2.0e-6763.37Show/hide
Query:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKAPETTWQHDMFADSGSGFPVQAGRASA-IQTGTKLYISNLDYG
        M+  LDMSLDD+I  N+KSR  +G +RG G GSGPGP RR  PNR ++R+ PY + KAPE+TW HDMF+D       ++GR+SA I+TGTKLYISNLDYG
Subjt:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKAPETTWQHDMFADSGSGFPVQAGRASA-IQTGTKLYISNLDYG

Query:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAP-RRQQGRGGP
        V NEDIKELF+EVGE+KRY +H+D+SGRSKGTAEVV+SRR DA+AAVKKYN+VQLDGKPMKIEIVGTN+ T A      AN   N NGAP R  QGRGG 
Subjt:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAP-RRQQGRGGP

Query:  PSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLENF
          +QRG         G GRGR  G+GP+EK+SAEDLDADL+ +
Subjt:  PSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLENF

Arabidopsis top hitse value%identityAlignment
AT5G02530.1 RNA-binding (RRM/RBD/RNP motifs) family protein2.9e-6154.58Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAASRT-PYSAP----KAPETTWQHDMFADS-------GSGFPVQAGR
        M+  LDMSLDDIIK+N+K      RG            G GS  GP RRF NR  +RT PYS P    +A +  WQ+D+FA         G       G 
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAASRT-PYSAP----KAPETTWQHDMFADS-------GSGFPVQAGR

Query:  ASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG-LTTAAN
         S+I+TGTKLYISNLDYGVSNEDIKELFSEVG++KRYGIHYD+SGRSKGTAEVVFSRR DA+AAVK+YNNVQLDGK MKIEIVGTN+S PA+  L TA  
Subjt:  ASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG-LTTAAN

Query:  PF----------ENLNG-------APRRQQGRGGPPSRQRGRGFG----------RGRG----RGRGPSEKVSAEDLDADLENF
        PF          EN NG          R +GRGG   R RG GFG          RGRG     GRG  E VSAEDLDA+L+ +
Subjt:  PF----------ENLNG-------APRRQQGRGGPPSRQRGRGFG----------RGRG----RGRGPSEKVSAEDLDADLENF

AT5G02530.2 RNA-binding (RRM/RBD/RNP motifs) family protein8.4e-6154.61Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAASRT-PYSAP----KAPETTWQHDMFADS-------GSGFPVQAGR
        M+  LDMSLDDIIK+N+K      RG            G GS  GP RRF NR  +RT PYS P    +A +  WQ+D+FA         G       G 
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAASRT-PYSAP----KAPETTWQHDMFADS-------GSGFPVQAGR

Query:  ASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG-LTTAAN
         S+I+TGTKLYISNLDYGVSNEDIKELFSEVG++KRYGIHYD+SGRSKGTAEVVFSRR DA+AAVK+YNNVQLDGK MKIEIVGTN+S PA+  L TA  
Subjt:  ASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG-LTTAAN

Query:  PF----------ENLNGAPRRQ-----QGRGGPPSRQRGRGFG----------RGRG----RGRGPSEKVSAEDLDADLENF
        PF          EN NG          +GRGG   R RG GFG          RGRG     GRG  E VSAEDLDA+L+ +
Subjt:  PF----------ENLNGAPRRQ-----QGRGGPPSRQRGRGFG----------RGRG----RGRGPSEKVSAEDLDADLENF

AT5G59950.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.4e-6863.37Show/hide
Query:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKAPETTWQHDMFADSGSGFPVQAGRASA-IQTGTKLYISNLDYG
        M+  LDMSLDD+I  N+KSR  +G +RG G GSGPGP RR  PNR ++R+ PY + KAPE+TW HDMF+D       ++GR+SA I+TGTKLYISNLDYG
Subjt:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKAPETTWQHDMFADSGSGFPVQAGRASA-IQTGTKLYISNLDYG

Query:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAP-RRQQGRGGP
        V NEDIKELF+EVGE+KRY +H+D+SGRSKGTAEVV+SRR DA+AAVKKYN+VQLDGKPMKIEIVGTN+ T A      AN   N NGAP R  QGRGG 
Subjt:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAP-RRQQGRGGP

Query:  PSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLENF
          +QRG         G GRGR  G+GP+EK+SAEDLDADL+ +
Subjt:  PSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLENF

AT5G59950.3 RNA-binding (RRM/RBD/RNP motifs) family protein1.0e-6662.96Show/hide
Query:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKAPETTWQHDMFADSGSGFPVQAGRASA-IQTGTKLYISNLDYG
        M+  LDMSLDD+I  N+KSR  +G +RG G GSGPGP RR  PNR ++R+ PY +  APE+TW HDMF+D       ++GR+SA I+TGTKLYISNLDYG
Subjt:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKAPETTWQHDMFADSGSGFPVQAGRASA-IQTGTKLYISNLDYG

Query:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAP-RRQQGRGGP
        V NEDIKELF+EVGE+KRY +H+D+SGRSKGTAEVV+SRR DA+AAVKKYN+VQLDGKPMKIEIVGTN+ T A      AN   N NGAP R  QGRGG 
Subjt:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAP-RRQQGRGGP

Query:  PSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLENF
          +QRG         G GRGR  G+GP+EK+SAEDLDADL+ +
Subjt:  PSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLENF

AT5G59950.5 RNA-binding (RRM/RBD/RNP motifs) family protein1.9e-6863.11Show/hide
Query:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKAPETTWQHDMFADSGSGFPVQAGRASA-IQTGTKLYISNLDYG
        M+  LDMSLDD+I  N+KSR  +G +RG G GSGPGP RR  PNR ++R+ PY + KAPE+TW HDMF+D       ++GR+SA I+TGTKLYISNLDYG
Subjt:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKAPETTWQHDMFADSGSGFPVQAGRASA-IQTGTKLYISNLDYG

Query:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAP--RRQQGRGG
        V NEDIKELF+EVGE+KRY +H+D+SGRSKGTAEVV+SRR DA+AAVKKYN+VQLDGKPMKIEIVGTN+ T A      AN   N NGAP  R  QGRGG
Subjt:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAP--RRQQGRGG

Query:  PPSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLENF
           +QRG         G GRGR  G+GP+EK+SAEDLDADL+ +
Subjt:  PPSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLENF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCCTTTGGATATGAGTCTTGATGATATCATAAAGAACAATAAAAAATCCAGATCCGGTAATTCTAGAGGTCGCGGAAGAGGTTCTGGACCCGGTCCCGTCCG
TCGATTTCCCAATCGCGCCGCTAGTCGCACACCTTATTCCGCTCCCAAGGCGCCGGAGACGACGTGGCAGCACGATATGTTCGCCGATTCGGGTTCTGGATTCCCTGTGC
AAGCTGGGAGAGCCTCCGCCATTCAGACTGGGACGAAGCTTTACATATCTAATTTGGATTACGGCGTTTCTAACGAAGATATTAAGGAACTTTTTTCTGAAGTTGGTGAG
ATGAAACGTTACGGGATCCACTATGACAAGAGTGGGAGATCCAAGGGAACAGCAGAAGTAGTTTTCTCAAGACGACTAGATGCTGTTGCGGCCGTCAAGAAGTACAACAA
CGTTCAGCTTGATGGAAAACCAATGAAGATAGAGATCGTTGGAACTAATATTTCCACGCCTGCTGTTGGTCTTACCACTGCTGCGAACCCTTTCGAAAATTTAAATGGGG
CTCCTAGAAGGCAGCAAGGTAGGGGTGGTCCACCATCACGTCAACGTGGTCGTGGTTTTGGAAGAGGGCGTGGGCGGGGAAGAGGTCCAAGTGAAAAAGTATCCGCTGAA
GATCTTGATGCTGACTTGGAAAACTTTTGGGCTTTTCTATTGTTTAGCTTTCGTATGATGACTTCAGCAAGCCCGATCACATCTGCGAGGATGACATTGGTTGGAATAAA
ACACCAGCTGCAAATTATACAACAAATGAGTTGGGGTTCCCCGTTGAGGGAACTTCAACTCACTAAAACTAATATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCTCCTTTGGATATGAGTCTTGATGATATCATAAAGAACAATAAAAAATCCAGATCCGGTAATTCTAGAGGTCGCGGAAGAGGTTCTGGACCCGGTCCCGTCCG
TCGATTTCCCAATCGCGCCGCTAGTCGCACACCTTATTCCGCTCCCAAGGCGCCGGAGACGACGTGGCAGCACGATATGTTCGCCGATTCGGGTTCTGGATTCCCTGTGC
AAGCTGGGAGAGCCTCCGCCATTCAGACTGGGACGAAGCTTTACATATCTAATTTGGATTACGGCGTTTCTAACGAAGATATTAAGGAACTTTTTTCTGAAGTTGGTGAG
ATGAAACGTTACGGGATCCACTATGACAAGAGTGGGAGATCCAAGGGAACAGCAGAAGTAGTTTTCTCAAGACGACTAGATGCTGTTGCGGCCGTCAAGAAGTACAACAA
CGTTCAGCTTGATGGAAAACCAATGAAGATAGAGATCGTTGGAACTAATATTTCCACGCCTGCTGTTGGTCTTACCACTGCTGCGAACCCTTTCGAAAATTTAAATGGGG
CTCCTAGAAGGCAGCAAGGTAGGGGTGGTCCACCATCACGTCAACGTGGTCGTGGTTTTGGAAGAGGGCGTGGGCGGGGAAGAGGTCCAAGTGAAAAAGTATCCGCTGAA
GATCTTGATGCTGACTTGGAAAACTTTTGGGCTTTTCTATTGTTTAGCTTTCGTATGATGACTTCAGCAAGCCCGATCACATCTGCGAGGATGACATTGGTTGGAATAAA
ACACCAGCTGCAAATTATACAACAAATGAGTTGGGGTTCCCCGTTGAGGGAACTTCAACTCACTAAAACTAATATATAA
Protein sequenceShow/hide protein sequence
MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKAPETTWQHDMFADSGSGFPVQAGRASAIQTGTKLYISNLDYGVSNEDIKELFSEVGE
MKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPFENLNGAPRRQQGRGGPPSRQRGRGFGRGRGRGRGPSEKVSAE
DLDADLENFWAFLLFSFRMMTSASPITSARMTLVGIKHQLQIIQQMSWGSPLRELQLTKTNI