; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001454 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001454
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTHO complex subunit 4A-like
Genome locationChr09:17283332..17284918
RNA-Seq ExpressionHG10001454
SyntenyHG10001454
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR025715 - Chromatin target of PRMT1 protein, C-terminal
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035419.1 THO complex subunit 4A, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-11693.7Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSG SRGRGRGSGPGPVRR PNRAANRTPYSAPKAPETTWQHDMF D GAGFPVQAGRAS+IQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQR-
        IKELFSEVG+MKRYGIHYDKSGRSKGTAEVVFSRR DA AAVKKYNNVQLDGKPMKIEIVGTNI+T AVGP AAVNPFEN NGAPRRQQGRGG PSRQR 
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQR-

Query:  GRGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
        GRG GRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
Subjt:  GRGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN

XP_004143092.1 THO complex subunit 4A [Cucumis sativus]4.4e-12195.78Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFAD  +GF VQ GRASAIQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQRG
        IKELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGP AAVNPFEN NGAPRRQQGRGGPPSRQRG
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQRG

Query:  RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
         GFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
Subjt:  RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN

XP_008448384.1 PREDICTED: THO complex subunit 4A-like isoform X1 [Cucumis melo]2.8e-12095.36Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAA+RTPYSAPKAPETTWQHDMFAD  +GF  Q GRASAIQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQRG
        IKELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGP AAVNPFEN NGAPRRQQGRGGPPSRQRG
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQRG

Query:  RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
        RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
Subjt:  RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN

XP_022947495.1 THO complex subunit 4A-like [Cucurbita moschata]3.2e-11693.7Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSG SRGRGRGSGPGPVRR PNRAANRTPYSAPKAPETTWQHDMF D GAGFPVQAGRAS+IQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQR-
        IKELFSEVG+MKRYGIHYDKSGRSKGTAEVVFSRR DA AAVKKYNNVQLDGKPMKIEIVGTNI+T AVGP AAVNPFEN NGAPRRQQGRGG PSRQR 
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQR-

Query:  GRGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
        GRG GRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
Subjt:  GRGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN

XP_038901991.1 THO complex subunit 4A-like [Benincasa hispida]4.7e-12397.05Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFAD G+GF  Q GRASAIQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQRG
        IKELFSEVG+MKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGP AAVNPFENLNGAPRRQQGRGGPPSRQRG
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQRG

Query:  RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
        RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
Subjt:  RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN

TrEMBL top hitse value%identityAlignment
A0A0A0KBU2 RRM domain-containing protein2.1e-12195.78Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFAD  +GF VQ GRASAIQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQRG
        IKELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGP AAVNPFEN NGAPRRQQGRGGPPSRQRG
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQRG

Query:  RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
         GFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
Subjt:  RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN

A0A1S3BK70 THO complex subunit 4A-like isoform X11.4e-12095.36Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAA+RTPYSAPKAPETTWQHDMFAD  +GF  Q GRASAIQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQRG
        IKELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGP AAVNPFEN NGAPRRQQGRGGPPSRQRG
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQRG

Query:  RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
        RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
Subjt:  RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN

A0A6J1D4W8 THO complex subunit 4A-like6.8e-11289.71Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSR-GRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNE
        MAAPLDMSLDDIIKNNKKSRSGNSR GRGR SGPGPVRRFPNRAANRTPY+APKAPET WQHDMF D  +GF VQAGRASAIQTGTKLYISNLDYGVSNE
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSR-GRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNE

Query:  DIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQR
        DIKELFSEVG+MK Y IHYDKSGRSKGTAEVVFSRR+DAVAAVKKYNNVQLDGKPMKIEIVGTNI+TPA  P AAV PFENLNG PRRQQGRGG PSRQR
Subjt:  DIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQR

Query:  -----GRGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
             GRGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
Subjt:  -----GRGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN

A0A6J1G721 THO complex subunit 4A-like1.6e-11693.7Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSG SRGRGRGSGPGPVRR PNRAANRTPYSAPKAPETTWQHDMF D GAGFPVQAGRAS+IQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQR-
        IKELFSEVG+MKRYGIHYDKSGRSKGTAEVVFSRR DA AAVKKYNNVQLDGKPMKIEIVGTNI+T AVGP AAVNPFEN NGAPRRQQGRGG PSRQR 
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQR-

Query:  GRGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
        GRG GRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
Subjt:  GRGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN

A0A6J1L1M6 THO complex subunit 4A-like4.6e-11693.28Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED
        MAAPLDMSLDDIIKNNKKSRSG SRGRGRGSGPGPVRR PNRAANRTPYSAPKAPETTWQHDMF D GAGFPVQAGRAS+IQTGTKLYISNLDYGVSNED
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNED

Query:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQR-
        IKELFSEVG+MKRYGIHYDKSGRSKGTAEVVFSRR DA AAVKKYNNVQLDGKPMKIEIVGTNI+T AVGP AAVNPFEN NG PRRQQGRGG PSRQR 
Subjt:  IKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQR-

Query:  GRGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
        GRG GRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
Subjt:  GRGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN

SwissProt top hitse value%identityAlignment
Q3T0I4 THO complex subunit 47.9e-4144.53Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRG-------------------SGPGPVRRFPNRA--------ANR-TPYSAPKAPETTWQHDMFADSGAGFP
        MA  +DMSLDDIIK N+  R G   GRGRG                    G GP+R  P  A         NR  PYS PK     WQHD+F DSG    
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRG-------------------SGPGPVRRFPNRA--------ANR-TPYSAPKAPETTWQHDMFADSGAGFP

Query:  VQAGRASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPI
           G  + ++TG KL +SNLD+GVS+ DI+ELF+E G +K+  +HYD+SGRS GTA+V F R+ DA+ A+K+YN V LDG+PM I++V + I T    P 
Subjt:  VQAGRASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPI

Query:  AAVNPFENLNGAPRRQQGR----GGPPSRQRGRGFGRGRGRGRGPSEK--VSAEDLDADLEKYHA
         +VN      G   R +G     GG  +R+  RG  RGRGRG G S K  +SAE+LDA L+ Y+A
Subjt:  AAVNPFENLNGAPRRQQGR----GGPPSRQRGRGFGRGRGRGRGPSEK--VSAEDLDADLEKYHA

Q6NQ72 THO complex subunit 4D1.0e-4343.34Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGN---SRGRGR-----GSGPGPVRRFPNRAANRTPYS------APKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTK
        M+  L+M+LD+I+K  K +RSG    SRGRGR     G G GP RR P  A N  P S        +     WQ  +F D      ++A  AS ++ GT+
Subjt:  MAAPLDMSLDDIIKNNKKSRSGN---SRGRGR-----GSGPGPVRRFPNRAANRTPYS------APKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTK

Query:  LYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAV-----------------
        L+++NLD GV+NEDI+ELFSE+GE++RY IHYDK+GR  GTAEVV+ RR DA  A+KKYNNV LDG+PM++EI+G N S+ A                  
Subjt:  LYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAV-----------------

Query:  ----------------------GPIAAVN---PFENLNGAPRRQQGRGGPPSRQRGRGFGRGRGRGRGPSEK---VSAEDLDADLEKYHAESM
                              GP   V+   P  N  G   R  GRGG  +R RG G GRGRG GRG  +K    SA DLD DLE YHA++M
Subjt:  ----------------------GPIAAVN---PFENLNGAPRRQQGRGGPPSRQRGRGFGRGRGRGRGPSEK---VSAEDLDADLEKYHAESM

Q8L719 THO complex subunit 4B6.2e-6254.48Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAANRT-PYSAP----KAPETTWQHDMFADS-------GAGFPVQAGR
        M+  LDMSLDDIIK+N+K      RG            G GS  GP RRF NR   RT PYS P    +A +  WQ+D+FA         G       G 
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAANRT-PYSAP----KAPETTWQHDMFADS-------GAGFPVQAGR

Query:  ASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVN-
         S+I+TGTKLYISNLDYGVSNEDIKELFSEVG++KRYGIHYD+SGRSKGTAEVVFSRR DA+AAVK+YNNVQLDGK MKIEIVGTN+S PA+  +A    
Subjt:  ASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVN-

Query:  PF----------ENLNG-------APRRQQGRGGPPSRQRGRGFG----------RGRG----RGRGPSEKVSAEDLDADLEKYHAESMQ
        PF          EN NG          R +GRGG   R RG GFG          RGRG     GRG  E VSAEDLDA+L+KYH E+M+
Subjt:  PF----------ENLNG-------APRRQQGRGGPPSRQRGRGFG----------RGRG----RGRGPSEKVSAEDLDADLEKYHAESMQ

Q8L773 THO complex subunit 4A1.4e-6962.95Show/hide
Query:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAANRT-PYSAPKAPETTWQHDMFADSGAGFPVQAGRASA-IQTGTKLYISNLDYG
        M+  LDMSLDD+I  N+KSR  +G +RG G GSGPGP RR  PNR + R+ PY + KAPE+TW HDMF+D       ++GR+SA I+TGTKLYISNLDYG
Subjt:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAANRT-PYSAPKAPETTWQHDMFADSGAGFPVQAGRASA-IQTGTKLYISNLDYG

Query:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAP-RRQQGRGGP
        V NEDIKELF+EVGE+KRY +H+D+SGRSKGTAEVV+SRR DA+AAVKKYN+VQLDGKPMKIEIVGTN+ T A       N   N NGAP R  QGRGG 
Subjt:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAP-RRQQGRGGP

Query:  PSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
          +QRG         G GRGR  G+GP+EK+SAEDLDADL+KYH+  M+ N
Subjt:  PSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN

Q94EH8 THO complex subunit 4C1.6e-4141.31Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSR----------GRGRGS---------GPGPVRRFPNRAANRTPYS-------APKAPETTW--QHDMFADSGAGFP
        M+  L+M+LD+I+K +K  RS  +R          GRGRG          G GPVRR P  A N  P S       A +     W  Q+D++ ++     
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSR----------GRGRGS---------GPGPVRRFPNRAANRTPYS-------APKAPETTW--QHDMFADSGAGFP

Query:  VQAGRASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPI
        ++A   S ++ GT +YI+NLD GV+NEDI+EL++E+GE+KRY IHYDK+GR  G+AEVV+ RR DA+ A++KYNNV LDG+PMK+EI+G N  +    P+
Subjt:  VQAGRASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPI

Query:  AAVNPFENLNGAPRRQ-------------QGRGGPPSRQR-------------GRGFGRGRGRGRG--------------PSEKVSAEDLDADLEKYHAE
        AA      LNG  +R              +GRG  PS +R             GRG  RGRGRG G              P EK SA DLD DLE YHAE
Subjt:  AAVNPFENLNGAPRRQ-------------QGRGGPPSRQR-------------GRGFGRGRGRGRG--------------PSEKVSAEDLDADLEKYHAE

Query:  SMQIN
        +M I+
Subjt:  SMQIN

Arabidopsis top hitse value%identityAlignment
AT5G02530.1 RNA-binding (RRM/RBD/RNP motifs) family protein4.4e-6354.48Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAANRT-PYSAP----KAPETTWQHDMFADS-------GAGFPVQAGR
        M+  LDMSLDDIIK+N+K      RG            G GS  GP RRF NR   RT PYS P    +A +  WQ+D+FA         G       G 
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAANRT-PYSAP----KAPETTWQHDMFADS-------GAGFPVQAGR

Query:  ASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVN-
         S+I+TGTKLYISNLDYGVSNEDIKELFSEVG++KRYGIHYD+SGRSKGTAEVVFSRR DA+AAVK+YNNVQLDGK MKIEIVGTN+S PA+  +A    
Subjt:  ASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVN-

Query:  PF----------ENLNG-------APRRQQGRGGPPSRQRGRGFG----------RGRG----RGRGPSEKVSAEDLDADLEKYHAESMQ
        PF          EN NG          R +GRGG   R RG GFG          RGRG     GRG  E VSAEDLDA+L+KYH E+M+
Subjt:  PF----------ENLNG-------APRRQQGRGGPPSRQRGRGFG----------RGRG----RGRGPSEKVSAEDLDADLEKYHAESMQ

AT5G02530.2 RNA-binding (RRM/RBD/RNP motifs) family protein1.3e-6254.51Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAANRT-PYSAP----KAPETTWQHDMFADS-------GAGFPVQAGR
        M+  LDMSLDDIIK+N+K      RG            G GS  GP RRF NR   RT PYS P    +A +  WQ+D+FA         G       G 
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAANRT-PYSAP----KAPETTWQHDMFADS-------GAGFPVQAGR

Query:  ASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVN-
         S+I+TGTKLYISNLDYGVSNEDIKELFSEVG++KRYGIHYD+SGRSKGTAEVVFSRR DA+AAVK+YNNVQLDGK MKIEIVGTN+S PA+  +A    
Subjt:  ASAIQTGTKLYISNLDYGVSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVN-

Query:  PF----------ENLNGAPRRQ-----QGRGGPPSRQRGRGFG----------RGRG----RGRGPSEKVSAEDLDADLEKYHAESMQ
        PF          EN NG          +GRGG   R RG GFG          RGRG     GRG  E VSAEDLDA+L+KYH E+M+
Subjt:  PF----------ENLNGAPRRQ-----QGRGGPPSRQRGRGFG----------RGRG----RGRGPSEKVSAEDLDADLEKYHAESMQ

AT5G59950.1 RNA-binding (RRM/RBD/RNP motifs) family protein9.9e-7162.95Show/hide
Query:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAANRT-PYSAPKAPETTWQHDMFADSGAGFPVQAGRASA-IQTGTKLYISNLDYG
        M+  LDMSLDD+I  N+KSR  +G +RG G GSGPGP RR  PNR + R+ PY + KAPE+TW HDMF+D       ++GR+SA I+TGTKLYISNLDYG
Subjt:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAANRT-PYSAPKAPETTWQHDMFADSGAGFPVQAGRASA-IQTGTKLYISNLDYG

Query:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAP-RRQQGRGGP
        V NEDIKELF+EVGE+KRY +H+D+SGRSKGTAEVV+SRR DA+AAVKKYN+VQLDGKPMKIEIVGTN+ T A       N   N NGAP R  QGRGG 
Subjt:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAP-RRQQGRGGP

Query:  PSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
          +QRG         G GRGR  G+GP+EK+SAEDLDADL+KYH+  M+ N
Subjt:  PSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN

AT5G59950.3 RNA-binding (RRM/RBD/RNP motifs) family protein7.1e-6962.55Show/hide
Query:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAANRT-PYSAPKAPETTWQHDMFADSGAGFPVQAGRASA-IQTGTKLYISNLDYG
        M+  LDMSLDD+I  N+KSR  +G +RG G GSGPGP RR  PNR + R+ PY +  APE+TW HDMF+D       ++GR+SA I+TGTKLYISNLDYG
Subjt:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAANRT-PYSAPKAPETTWQHDMFADSGAGFPVQAGRASA-IQTGTKLYISNLDYG

Query:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAP-RRQQGRGGP
        V NEDIKELF+EVGE+KRY +H+D+SGRSKGTAEVV+SRR DA+AAVKKYN+VQLDGKPMKIEIVGTN+ T A       N   N NGAP R  QGRGG 
Subjt:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAP-RRQQGRGGP

Query:  PSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
          +QRG         G GRGR  G+GP+EK+SAEDLDADL+KYH+  M+ N
Subjt:  PSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN

AT5G59950.5 RNA-binding (RRM/RBD/RNP motifs) family protein1.3e-7062.7Show/hide
Query:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAANRT-PYSAPKAPETTWQHDMFADSGAGFPVQAGRASA-IQTGTKLYISNLDYG
        M+  LDMSLDD+I  N+KSR  +G +RG G GSGPGP RR  PNR + R+ PY + KAPE+TW HDMF+D       ++GR+SA I+TGTKLYISNLDYG
Subjt:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAANRT-PYSAPKAPETTWQHDMFADSGAGFPVQAGRASA-IQTGTKLYISNLDYG

Query:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAP--RRQQGRGG
        V NEDIKELF+EVGE+KRY +H+D+SGRSKGTAEVV+SRR DA+AAVKKYN+VQLDGKPMKIEIVGTN+ T A       N   N NGAP  R  QGRGG
Subjt:  VSNEDIKELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAP--RRQQGRGG

Query:  PPSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN
           +QRG         G GRGR  G+GP+EK+SAEDLDADL+KYH+  M+ N
Subjt:  PPSRQRG--------RGFGRGRGRGRGPSEKVSAEDLDADLEKYHAESMQIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCCTTTGGATATGAGTCTTGATGATATCATAAAGAACAACAAAAAATCCAGATCCGGTAATTCTAGAGGTCGCGGAAGAGGTTCTGGACCCGGTCCCGTCCG
TCGATTTCCCAATCGCGCCGCTAATCGCACACCTTATTCTGCTCCCAAGGCGCCGGAGACGACGTGGCAGCACGATATGTTCGCCGATTCGGGTGCTGGATTCCCTGTGC
AAGCTGGGCGAGCCTCTGCTATTCAGACTGGGACCAAGCTTTACATATCTAATTTGGATTACGGTGTTTCTAATGAAGATATTAAGGAACTTTTTTCTGAAGTTGGTGAG
ATGAAACGCTACGGAATTCACTATGACAAGAGTGGGAGATCTAAGGGAACCGCAGAAGTAGTTTTCTCACGACGACTAGATGCTGTTGCGGCCGTCAAGAAGTACAACAA
CGTTCAGCTTGATGGAAAACCAATGAAGATAGAGATCGTTGGAACTAATATTTCCACACCTGCTGTTGGTCCTATCGCTGCTGTGAACCCTTTCGAAAATTTAAATGGGG
CTCCAAGAAGGCAGCAAGGTAGGGGTGGTCCACCATCACGTCAACGTGGTCGTGGTTTTGGAAGGGGGCGTGGCCGAGGAAGAGGTCCAAGTGAAAAAGTATCCGCCGAA
GATCTTGATGCTGACTTGGAAAAGTATCATGCTGAGTCTATGCAGATAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCTCCTTTGGATATGAGTCTTGATGATATCATAAAGAACAACAAAAAATCCAGATCCGGTAATTCTAGAGGTCGCGGAAGAGGTTCTGGACCCGGTCCCGTCCG
TCGATTTCCCAATCGCGCCGCTAATCGCACACCTTATTCTGCTCCCAAGGCGCCGGAGACGACGTGGCAGCACGATATGTTCGCCGATTCGGGTGCTGGATTCCCTGTGC
AAGCTGGGCGAGCCTCTGCTATTCAGACTGGGACCAAGCTTTACATATCTAATTTGGATTACGGTGTTTCTAATGAAGATATTAAGGAACTTTTTTCTGAAGTTGGTGAG
ATGAAACGCTACGGAATTCACTATGACAAGAGTGGGAGATCTAAGGGAACCGCAGAAGTAGTTTTCTCACGACGACTAGATGCTGTTGCGGCCGTCAAGAAGTACAACAA
CGTTCAGCTTGATGGAAAACCAATGAAGATAGAGATCGTTGGAACTAATATTTCCACACCTGCTGTTGGTCCTATCGCTGCTGTGAACCCTTTCGAAAATTTAAATGGGG
CTCCAAGAAGGCAGCAAGGTAGGGGTGGTCCACCATCACGTCAACGTGGTCGTGGTTTTGGAAGGGGGCGTGGCCGAGGAAGAGGTCCAAGTGAAAAAGTATCCGCCGAA
GATCTTGATGCTGACTTGGAAAAGTATCATGCTGAGTCTATGCAGATAAATTAG
Protein sequenceShow/hide protein sequence
MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAANRTPYSAPKAPETTWQHDMFADSGAGFPVQAGRASAIQTGTKLYISNLDYGVSNEDIKELFSEVGE
MKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGPIAAVNPFENLNGAPRRQQGRGGPPSRQRGRGFGRGRGRGRGPSEKVSAE
DLDADLEKYHAESMQIN