; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G045750 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G045750
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionTHO complex subunit 4A-like
Genome locationCiama_Chr02:33488362..33491379
RNA-Seq ExpressionCaUC02G045750
SyntenyCaUC02G045750
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061942.1 THO complex subunit 4A-like isoform X1 [Cucumis melo var. makuwa]6.0e-8752.23Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPK                 APETTWQHDMFAD  S F  Q GRASAIQT
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT

Query:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF
        GTKLYISNLDYGVSNEDIK     ELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPF
Subjt:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF

Query:  DNLNGAPRSFLGVYIVTTVKISSLWFIHECYISELTVYDFWNTLGGSMRLYVFLVKSVHYALQICIMHLYLKQNFKDLCNDYALVFKYLYYELSLALLTS
        +N NGAPR   G                                GG                                                      
Subjt:  DNLNGAPRSFLGVYIVTTVKISSLWFIHECYISELTVYDFWNTLGGSMRLYVFLVKSVHYALQICIMHLYLKQNFKDLCNDYALVFKYLYYELSLALLTS

Query:  ILYNQWYYLYHNDVPFHTETGSKVGVVHHHVNVVVVLEEGVGGEEVQVKKYPLKILMLTWKTFGLFYCLAFPDHICEDDIGWNKTTAANYTTNELGFPVE
                      P   + G   G              G G      +K  L  L+L        +  + PDHIC+ DI WNK  AANYTTNEL FP+E
Subjt:  ILYNQWYYLYHNDVPFHTETGSKVGVVHHHVNVVVVLEEGVGGEEVQVKKYPLKILMLTWKTFGLFYCLAFPDHICEDDIGWNKTTAANYTTNELGFPVE

Query:  GTST
        GTST
Subjt:  GTST

KAG7035419.1 THO complex subunit 4A, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-8680.09Show/hide
Query:  FLFNYSLNMAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQA
        FLF++SLNMAAPLDMSLDDIIKNNKKSRSG SRGRGRGSGPGPVRR PNRAA+RTPYSAPK                 APETTWQHDMF D G+ FPVQA
Subjt:  FLFNYSLNMAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQA

Query:  GRASAIQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG
        GRAS+IQTGTKLYISNLDYGVSNEDIK     ELFSEVG+MKRYGIHYDKSGRSKGTAEVVFSRR DA AAVKKYNNVQLDGKPMKIEIVGTNI+T AVG
Subjt:  GRASAIQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG

Query:  LTTAANPFDNLNGAPR
           A NPF+N NGAPR
Subjt:  LTTAANPFDNLNGAPR

XP_004143092.1 THO complex subunit 4A [Cucumis sativus]6.6e-8683.17Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAA+RTPYSAPK                 APETTWQHDMFAD  S F VQ GRASAIQT
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT

Query:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF
        GTKLYISNLDYGVSNEDIK     ELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPF
Subjt:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF

Query:  DNLNGAPR
        +N NGAPR
Subjt:  DNLNGAPR

XP_008448388.1 PREDICTED: THO complex subunit 4A-like isoform X2 [Cucumis melo]6.6e-8683.25Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPK                 APETTWQHDMFAD  S F  Q GRASAIQT
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT

Query:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF
        GTKLYISNLDYGVSNEDIK     ELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPF
Subjt:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF

Query:  DNLNGAPRS
        +N NGAPRS
Subjt:  DNLNGAPRS

XP_038901991.1 THO complex subunit 4A-like [Benincasa hispida]2.7e-8784.13Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAA+RTPYSAPK                 APETTWQHDMFAD GS F  Q GRASAIQT
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT

Query:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF
        GTKLYISNLDYGVSNEDIK     ELFSEVG+MKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPF
Subjt:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF

Query:  DNLNGAPR
        +NLNGAPR
Subjt:  DNLNGAPR

TrEMBL top hitse value%identityAlignment
A0A0A0KBU2 RRM domain-containing protein3.2e-8683.17Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAA+RTPYSAPK                 APETTWQHDMFAD  S F VQ GRASAIQT
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT

Query:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF
        GTKLYISNLDYGVSNEDIK     ELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPF
Subjt:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF

Query:  DNLNGAPR
        +N NGAPR
Subjt:  DNLNGAPR

A0A1S3BIZ4 THO complex subunit 4A-like isoform X23.2e-8683.25Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPK                 APETTWQHDMFAD  S F  Q GRASAIQT
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT

Query:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF
        GTKLYISNLDYGVSNEDIK     ELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPF
Subjt:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF

Query:  DNLNGAPRS
        +N NGAPRS
Subjt:  DNLNGAPRS

A0A1S3BK70 THO complex subunit 4A-like isoform X19.3e-8683.17Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPK                 APETTWQHDMFAD  S F  Q GRASAIQT
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT

Query:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF
        GTKLYISNLDYGVSNEDIK     ELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPF
Subjt:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF

Query:  DNLNGAPR
        +N NGAPR
Subjt:  DNLNGAPR

A0A5A7V158 THO complex subunit 4A-like isoform X12.9e-8752.23Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT
        MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPK                 APETTWQHDMFAD  S F  Q GRASAIQT
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT

Query:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF
        GTKLYISNLDYGVSNEDIK     ELFSEVG+MKR+GIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVG T A NPF
Subjt:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF

Query:  DNLNGAPRSFLGVYIVTTVKISSLWFIHECYISELTVYDFWNTLGGSMRLYVFLVKSVHYALQICIMHLYLKQNFKDLCNDYALVFKYLYYELSLALLTS
        +N NGAPR   G                                GG                                                      
Subjt:  DNLNGAPRSFLGVYIVTTVKISSLWFIHECYISELTVYDFWNTLGGSMRLYVFLVKSVHYALQICIMHLYLKQNFKDLCNDYALVFKYLYYELSLALLTS

Query:  ILYNQWYYLYHNDVPFHTETGSKVGVVHHHVNVVVVLEEGVGGEEVQVKKYPLKILMLTWKTFGLFYCLAFPDHICEDDIGWNKTTAANYTTNELGFPVE
                      P   + G   G              G G      +K  L  L+L        +  + PDHIC+ DI WNK  AANYTTNEL FP+E
Subjt:  ILYNQWYYLYHNDVPFHTETGSKVGVVHHHVNVVVVLEEGVGGEEVQVKKYPLKILMLTWKTFGLFYCLAFPDHICEDDIGWNKTTAANYTTNELGFPVE

Query:  GTST
        GTST
Subjt:  GTST

A0A6J1G721 THO complex subunit 4A-like4.3e-8380.29Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT
        MAAPLDMSLDDIIKNNKKSRSG SRGRGRGSGPGPVRR PNRAA+RTPYSAPK                 APETTWQHDMF D G+ FPVQAGRAS+IQT
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASAIQT

Query:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF
        GTKLYISNLDYGVSNEDIK     ELFSEVG+MKRYGIHYDKSGRSKGTAEVVFSRR DA AAVKKYNNVQLDGKPMKIEIVGTNI+T AVG   A NPF
Subjt:  GTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTAANPF

Query:  DNLNGAPR
        +N NGAPR
Subjt:  DNLNGAPR

SwissProt top hitse value%identityAlignment
Q3T0I4 THO complex subunit 41.4e-3041.2Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRG-------------------SGPGPVRRFPNRAASR---------TPYSAPKVCFRVLSRFSSFEFWIEAP
        MA  +DMSLDDIIK N+  R G   GRGRG                    G GP+R  P  A             PYS PK                + P
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRGRGRG-------------------SGPGPVRRFPNRAASR---------TPYSAPKVCFRVLSRFSSFEFWIEAP

Query:  ETTWQHDMFADSGSAFPVQAGRASAIQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQL
        +  WQHD+F DSG       G  + ++TG KL +SNLD+GVS+ DI     QELF+E G +K+  +HYD+SGRS GTA+V F R+ DA+ A+K+YN V L
Subjt:  ETTWQHDMFADSGSAFPVQAGRASAIQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQL

Query:  DGKPMKIEIVGTNIST
        DG+PM I++V + I T
Subjt:  DGKPMKIEIVGTNIST

Q6NQ72 THO complex subunit 4D3.2e-3546.23Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGN---SRGRGR-----GSGPGPVRRFPNRAASR-TPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQ
        M+  L+M+LD+I+K  K +RSG    SRGRGR     G G GP RR P    +R + ++  K   RV S               WQ  +F D      ++
Subjt:  MAAPLDMSLDDIIKNNKKSRSGN---SRGRGR-----GSGPGPVRRFPNRAASR-TPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQ

Query:  AGRASAIQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPA
        A  AS ++ GT+L+++NLD GV+NEDI+     ELFSE+GE++RY IHYDK+GR  GTAEVV+ RR DA  A+KKYNNV LDG+PM++EI+G N S+ A
Subjt:  AGRASAIQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPA

Q8L719 THO complex subunit 4B4.7e-5056.82Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAASRT-PYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGS--
        M+  LDMSLDDIIK+N+K      RG            G GS  GP RRF NR  +RT PYS P     +  +        +A +  WQ+D+FA   S  
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAASRT-PYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGS--

Query:  -AF----PVQAGRASAIQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEI
         AF        G  S+I+TGTKLYISNLDYGVSNEDIK     ELFSEVG++KRYGIHYD+SGRSKGTAEVVFSRR DA+AAVK+YNNVQLDGK MKIEI
Subjt:  -AF----PVQAGRASAIQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEI

Query:  VGTNISTPAVG-LTTAANPF
        VGTN+S PA+  L TA  PF
Subjt:  VGTNISTPAVG-LTTAANPF

Q8L773 THO complex subunit 4A2.6e-5359.43Show/hide
Query:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRAS
        M+  LDMSLDD+I  N+KSR  +G +RG G GSGPGP RR  PNR ++R+ PY + K                 APE+TW HDMF+D       ++GR+S
Subjt:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRAS

Query:  A-IQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTT
        A I+TGTKLYISNLDYGV NEDIK     ELF+EVGE+KRY +H+D+SGRSKGTAEVV+SRR DA+AAVKKYN+VQLDGKPMKIEIVGTN+ T A     
Subjt:  A-IQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTT

Query:  AANPFDNLNGAP
         AN   N NGAP
Subjt:  AANPFDNLNGAP

Q94EH8 THO complex subunit 4C1.4e-3040.57Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSR----------GRGRGS---------GPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTW--QHD
        M+  L+M+LD+I+K +K  RS  +R          GRGRG          G GPVRR P    +R             S FS  +         W  Q+D
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSR----------GRGRGS---------GPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTW--QHD

Query:  MFADSGSAFPVQAGRASAIQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKI
        ++ ++     ++A   S ++ GT +YI+NLD GV+NEDI+     EL++E+GE+KRY IHYDK+GR  G+AEVV+ RR DA+ A++KYNNV LDG+PMK+
Subjt:  MFADSGSAFPVQAGRASAIQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKI

Query:  EIVGTNISTPAV
        EI+G N  +  V
Subjt:  EIVGTNISTPAV

Arabidopsis top hitse value%identityAlignment
AT5G02530.1 RNA-binding (RRM/RBD/RNP motifs) family protein3.3e-5156.82Show/hide
Query:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAASRT-PYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGS--
        M+  LDMSLDDIIK+N+K      RG            G GS  GP RRF NR  +RT PYS P     +  +        +A +  WQ+D+FA   S  
Subjt:  MAAPLDMSLDDIIKNNKKSRSGNSRG-----------RGRGSGPGPVRRFPNRAASRT-PYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGS--

Query:  -AF----PVQAGRASAIQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEI
         AF        G  S+I+TGTKLYISNLDYGVSNEDIK     ELFSEVG++KRYGIHYD+SGRSKGTAEVVFSRR DA+AAVK+YNNVQLDGK MKIEI
Subjt:  -AF----PVQAGRASAIQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEI

Query:  VGTNISTPAVG-LTTAANPF
        VGTN+S PA+  L TA  PF
Subjt:  VGTNISTPAVG-LTTAANPF

AT5G59950.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.9e-5459.43Show/hide
Query:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRAS
        M+  LDMSLDD+I  N+KSR  +G +RG G GSGPGP RR  PNR ++R+ PY + K                 APE+TW HDMF+D       ++GR+S
Subjt:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRAS

Query:  A-IQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTT
        A I+TGTKLYISNLDYGV NEDIK     ELF+EVGE+KRY +H+D+SGRSKGTAEVV+SRR DA+AAVKKYN+VQLDGKPMKIEIVGTN+ T A     
Subjt:  A-IQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTT

Query:  AANPFDNLNGAP
         AN   N NGAP
Subjt:  AANPFDNLNGAP

AT5G59950.3 RNA-binding (RRM/RBD/RNP motifs) family protein9.3e-5459.72Show/hide
Query:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASA
        M+  LDMSLDD+I  N+KSR  +G +RG G GSGPGP RR  PNR ++R   SAP               +  APE+TW HDMF+D       ++GR+SA
Subjt:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRTPYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRASA

Query:  -IQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTA
         I+TGTKLYISNLDYGV NEDIK     ELF+EVGE+KRY +H+D+SGRSKGTAEVV+SRR DA+AAVKKYN+VQLDGKPMKIEIVGTN+ T A      
Subjt:  -IQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTTA

Query:  ANPFDNLNGAP
        AN   N NGAP
Subjt:  ANPFDNLNGAP

AT5G59950.4 RNA-binding (RRM/RBD/RNP motifs) family protein1.9e-5459.43Show/hide
Query:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRAS
        M+  LDMSLDD+I  N+KSR  +G +RG G GSGPGP RR  PNR ++R+ PY + K                 APE+TW HDMF+D       ++GR+S
Subjt:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRAS

Query:  A-IQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTT
        A I+TGTKLYISNLDYGV NEDIK     ELF+EVGE+KRY +H+D+SGRSKGTAEVV+SRR DA+AAVKKYN+VQLDGKPMKIEIVGTN+ T A     
Subjt:  A-IQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTT

Query:  AANPFDNLNGAP
         AN   N NGAP
Subjt:  AANPFDNLNGAP

AT5G59950.5 RNA-binding (RRM/RBD/RNP motifs) family protein1.1e-5459.35Show/hide
Query:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRAS
        M+  LDMSLDD+I  N+KSR  +G +RG G GSGPGP RR  PNR ++R+ PY + K                 APE+TW HDMF+D       ++GR+S
Subjt:  MAAPLDMSLDDIIKNNKKSR--SGNSRGRGRGSGPGPVRR-FPNRAASRT-PYSAPKVCFRVLSRFSSFEFWIEAPETTWQHDMFADSGSAFPVQAGRAS

Query:  A-IQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTT
        A I+TGTKLYISNLDYGV NEDIK     ELF+EVGE+KRY +H+D+SGRSKGTAEVV+SRR DA+AAVKKYN+VQLDGKPMKIEIVGTN+ T A     
Subjt:  A-IQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKPMKIEIVGTNISTPAVGLTT

Query:  AANPFDNLNGAPRS
         AN   N NGAP S
Subjt:  AANPFDNLNGAPRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTTCCGCCCCAATCTCTCTCTCCCCAAACCCTTCCCTATCGACGAATCTGCGAAACCCTAATTCCCTTTTCTTCTCACTCTTTTCTTTTCGACTGCTTCTTCACTGTTTT
TCTCTTCAATTACTCTTTAAACATGGCCGCTCCTTTGGATATGAGTCTTGATGATATCATAAAGAACAATAAAAAATCCAGATCCGGTAATTCTAGAGGTCGCGGAAGAG
GTTCTGGACCCGGTCCCGTCCGTCGATTTCCCAATCGCGCCGCTAGTCGCACACCTTATTCCGCTCCCAAGGTGTGCTTTCGAGTTTTATCCCGATTCTCCTCTTTCGAA
TTCTGGATTGAAGCGCCGGAGACGACGTGGCAGCACGATATGTTCGCCGATTCGGGTTCTGCATTCCCTGTGCAAGCTGGGAGAGCCTCCGCCATTCAGACTGGGACGAA
GCTTTACATATCTAATTTGGATTACGGCGTTTCTAACGAAGATATTAAGTGGCGGTGTTTGCAGGAACTTTTTTCTGAAGTTGGTGAGATGAAACGTTACGGGATCCACT
ATGACAAGAGTGGGAGATCCAAGGGAACGGCAGAAGTAGTTTTCTCAAGACGACTAGATGCTGTTGCGGCCGTCAAGAAGTACAACAACGTTCAGCTTGATGGAAAACCA
ATGAAGATAGAGATCGTTGGAACTAATATTTCCACGCCTGCTGTTGGTCTTACCACTGCTGCGAACCCTTTCGACAATTTAAATGGGGCTCCTAGAAGTTTCTTGGGAGT
GTATATTGTTACAACTGTTAAAATTTCTTCCCTTTGGTTTATTCATGAATGCTATATTAGTGAACTTACTGTATATGACTTTTGGAATACTTTAGGTGGTAGTATGCGGT
TGTATGTCTTCTTGGTGAAATCTGTGCATTATGCATTACAAATATGCATTATGCATTTGTACTTGAAACAAAATTTTAAAGATTTGTGTAATGATTATGCCCTTGTATTC
AAATATCTTTATTATGAACTCAGTTTGGCTTTGTTGACATCCATTTTATATAATCAGTGGTACTATCTATATCATAATGATGTTCCTTTTCATACGGAAACAGGCAGCAA
GGTAGGGGTGGTCCACCATCACGTCAACGTGGTCGTGGTTTTGGAAGAGGGCGTGGGCGGGGAAGAGGTCCAAGTGAAAAAGTATCCGCTGAAGATCTTGATGCTGACTT
GGAAAACTTTTGGGCTTTTCTATTGTTTAGCTTTCCCCGATCACATCTGCGAGGATGACATTGGTTGGAATAAAACAACAGCTGCAAATTATACAACAAATGAGTTGGGG
TTCCCCGTTGAGGGAACTTCAACTCACTAA
mRNA sequenceShow/hide mRNA sequence
CTTCCGCCCCAATCTCTCTCTCCCCAAACCCTTCCCTATCGACGAATCTGCGAAACCCTAATTCCCTTTTCTTCTCACTCTTTTCTTTTCGACTGCTTCTTCACTGTTTT
TCTCTTCAATTACTCTTTAAACATGGCCGCTCCTTTGGATATGAGTCTTGATGATATCATAAAGAACAATAAAAAATCCAGATCCGGTAATTCTAGAGGTCGCGGAAGAG
GTTCTGGACCCGGTCCCGTCCGTCGATTTCCCAATCGCGCCGCTAGTCGCACACCTTATTCCGCTCCCAAGGTGTGCTTTCGAGTTTTATCCCGATTCTCCTCTTTCGAA
TTCTGGATTGAAGCGCCGGAGACGACGTGGCAGCACGATATGTTCGCCGATTCGGGTTCTGCATTCCCTGTGCAAGCTGGGAGAGCCTCCGCCATTCAGACTGGGACGAA
GCTTTACATATCTAATTTGGATTACGGCGTTTCTAACGAAGATATTAAGTGGCGGTGTTTGCAGGAACTTTTTTCTGAAGTTGGTGAGATGAAACGTTACGGGATCCACT
ATGACAAGAGTGGGAGATCCAAGGGAACGGCAGAAGTAGTTTTCTCAAGACGACTAGATGCTGTTGCGGCCGTCAAGAAGTACAACAACGTTCAGCTTGATGGAAAACCA
ATGAAGATAGAGATCGTTGGAACTAATATTTCCACGCCTGCTGTTGGTCTTACCACTGCTGCGAACCCTTTCGACAATTTAAATGGGGCTCCTAGAAGTTTCTTGGGAGT
GTATATTGTTACAACTGTTAAAATTTCTTCCCTTTGGTTTATTCATGAATGCTATATTAGTGAACTTACTGTATATGACTTTTGGAATACTTTAGGTGGTAGTATGCGGT
TGTATGTCTTCTTGGTGAAATCTGTGCATTATGCATTACAAATATGCATTATGCATTTGTACTTGAAACAAAATTTTAAAGATTTGTGTAATGATTATGCCCTTGTATTC
AAATATCTTTATTATGAACTCAGTTTGGCTTTGTTGACATCCATTTTATATAATCAGTGGTACTATCTATATCATAATGATGTTCCTTTTCATACGGAAACAGGCAGCAA
GGTAGGGGTGGTCCACCATCACGTCAACGTGGTCGTGGTTTTGGAAGAGGGCGTGGGCGGGGAAGAGGTCCAAGTGAAAAAGTATCCGCTGAAGATCTTGATGCTGACTT
GGAAAACTTTTGGGCTTTTCTATTGTTTAGCTTTCCCCGATCACATCTGCGAGGATGACATTGGTTGGAATAAAACAACAGCTGCAAATTATACAACAAATGAGTTGGGG
TTCCCCGTTGAGGGAACTTCAACTCACTAAAACTAATATATAAGCATTGAACTTGTCAAGACTAACGTGCTAAAAGATGAACAGTCTTCATAGCTATGGGATCCGACTTG
AGTCAATTGATTTAAAGAAACTTCCAAGATGAATTTGTTGCCTTGGTCTATGGGATCGTCATCTTTGCCTGCGTTTTATAGTAAAAATGCCTGCTAACAGGTGATATCAT
TATTTTAAAGTAATAGTTGTTCAAACATGTGCAACGTGGTGGTATTCTACTGCTGTTCTTGTGGAGAAAATCCGCAGAGCAATTTGCCGCAACTTGAAGGCCGGTACGCA
TTTGAGGTACCATACCATCACTGTTGAGGTCTGTAGCAAGTTTTTCAATTACTGCTTGCGTGCTAGTCTTGCAATATGATCTCCTTCCTTAGATGATGGATGGCATCATG
AGATTTTGGCCGATTGACGTGCGGTAAAAACTTGGTCTCTCCAGGATCATATACGTATGTGTTTTCATTTGCTTGAATATCTGAAAACAACCTGCCTATGAATTGTGTTC
CCTTGCGTTGCTTACAGATTATAGATTATATGTGGGTATGTCAATTTCAGCTTCTTCGTACTTTCAAACTCCCGATCGACTAAGCTTGGACCAGTATTATGATGGTTTAG
TGTTATTAAAATGTATAGCTTAATTTTTTTTAATTATTTGTAGATGCTGTAATGTGCTAGTGTGATTTATAGACGATAGGGCTTGTATAGTGGTGTAATTTCCGTAATAT
ACATCCAGCGTTGCGTAACTGCTCTGATTAATCCATTCTGAATATCACGAACTCTATATGTATTCTATAAAATCTTTCGAAGTTAAATGTGAATAAAATATCAGCTTGAA
ACTCTTAA
Protein sequenceShow/hide protein sequence
LPPQSLSPQTLPYRRICETLIPFSSHSFLFDCFFTVFLFNYSLNMAAPLDMSLDDIIKNNKKSRSGNSRGRGRGSGPGPVRRFPNRAASRTPYSAPKVCFRVLSRFSSFE
FWIEAPETTWQHDMFADSGSAFPVQAGRASAIQTGTKLYISNLDYGVSNEDIKWRCLQELFSEVGEMKRYGIHYDKSGRSKGTAEVVFSRRLDAVAAVKKYNNVQLDGKP
MKIEIVGTNISTPAVGLTTAANPFDNLNGAPRSFLGVYIVTTVKISSLWFIHECYISELTVYDFWNTLGGSMRLYVFLVKSVHYALQICIMHLYLKQNFKDLCNDYALVF
KYLYYELSLALLTSILYNQWYYLYHNDVPFHTETGSKVGVVHHHVNVVVVLEEGVGGEEVQVKKYPLKILMLTWKTFGLFYCLAFPDHICEDDIGWNKTTAANYTTNELG
FPVEGTSTH