; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021434 (gene) of Snake gourd v1 genome

Gene IDTan0021434
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTHO complex subunit 4A
Genome locationLG01:35069802..35074893
RNA-Seq ExpressionTan0021434
SyntenyTan0021434
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR025715 - Chromatin target of PRMT1 protein, C-terminal
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583468.1 THO complex subunit 4A, partial [Cucurbita argyrosperma subsp. sororia]7.0e-11492.62Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFRGRG ASSGPGPSRRF NRGLNRAAPYS A+APETAWSHDMFVDHGAAYPS PAR+SAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVT PA+PAS+N NFGN NGF RGG VLGRNRGGG
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG

Query:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGRGPGRGGRGRG+ S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

KAG7019223.1 THO complex subunit 4A [Cucurbita argyrosperma subsp. argyrosperma]1.6e-11392.62Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFRGRG ASSGPGPSRRF NRGLNRAAPYS A+APETAWSHDMFVDHGAAYPS PAR+SAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVT PAVPAS+N NFGN NGF RGG VLGRNRGGG
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG

Query:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGRGPGRGGRGRG+ S RGRGEKLSAEDLDADLEKYHEEAM+IN
Subjt:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_008457549.1 PREDICTED: THO complex subunit 4A [Cucumis melo]3.1e-11491.06Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFR RG ASSGPGPSRRF NRGLNRA PYS +KAPETAWSHDMFVDHGAAYPSHP R+SAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE++FSR ADALAAIKRYNNVQLDGK MKLEIVGTNIVT PAVPA SNA+FGN+NGFPRGGR +GRNRGGG
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG

Query:  RGRGPGRGGRGR--GNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGRGPGRGGRGR  G+GSGRGRGEKLSAEDLDADL+KYHEEAMQIN
Subjt:  RGRGPGRGGRGR--GNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_022964653.1 THO complex subunit 4B-like [Cucurbita moschata]1.2e-11392.62Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFRGRG ASSGPGPSRRF NRGLNRAAPYS A+APETAWSHDMFVDHG+AYPS PAR+SAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVT PAVPAS+N NFGN NGF RGG VLGRNRGGG
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG

Query:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGRGPGRGGRGRG+ S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_038895300.1 THO complex subunit 4A-like [Benincasa hispida]3.0e-11794.26Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFRGRG ASSGPGPSRRF NRGLNRAAPYS AKAPETAWSHDMFVDHGAAYPS P R+SAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQ+DALAAIKRYNNVQLDGK MKLEIVGTNIVT PAVPASSNA FGN NGFPRGGR LGRNRGGG
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG

Query:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGRGPGRGGRGRG+GSGRGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

TrEMBL top hitse value%identityAlignment
A0A0A0LUQ3 RRM domain-containing protein3.2e-11289.24Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFR RG ASSGPGPSRRF NRGLNRA PYS +KAPETAWSHDMFVDHGAAYPSHP R+SAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG
        EDIKELFSEVGD+KRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVT PAVPA SNA+FGN NGFPRGGR +GRNRGGG
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG

Query:  RGRGPGRG-GRGR------GNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGRGPGRG GRGR      G+GSGRG GEKLSAEDLDADL+KYHEEAMQIN
Subjt:  RGRGPGRG-GRGR------GNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A1S3C5R6 THO complex subunit 4A1.5e-11491.06Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFR RG ASSGPGPSRRF NRGLNRA PYS +KAPETAWSHDMFVDHGAAYPSHP R+SAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE++FSR ADALAAIKRYNNVQLDGK MKLEIVGTNIVT PAVPA SNA+FGN+NGFPRGGR +GRNRGGG
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG

Query:  RGRGPGRGGRGR--GNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGRGPGRGGRGR  G+GSGRGRGEKLSAEDLDADL+KYHEEAMQIN
Subjt:  RGRGPGRGGRGR--GNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1G9Y5 THO complex subunit 4A-like isoform X26.6e-11089.75Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN
        M +PLD SLDDIIKNNKKSGSSNFRGRG ASSGP PSRRF NRGLNRAAPYS AKAPET WSHD+FVDHG AYPSHPAR+S IETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGK MKLEIVG NIVT P +PASSN NFGN +GF RGGR LGRNRGGG
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG

Query:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGRGPGRGGRGRGN  GRG GEKLSAEDLDADLEKYHEEAMQIN
Subjt:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1HLI4 THO complex subunit 4B-like5.7e-11492.62Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFRGRG ASSGPGPSRRF NRGLNRAAPYS A+APETAWSHDMFVDHG+AYPS PAR+SAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVT PAVPAS+N NFGN NGF RGG VLGRNRGGG
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG

Query:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGRGPGRGGRGRG+ S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1I3N4 THO complex subunit 4A-like7.5e-11492.62Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFRGRG ASSGPGPSRRF NRGLNRAAPYS A+APETAWSHDMFVDHGAAYPS PAR+SAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVT PAVPAS+N NFGN NGF RGG VLGRNRGGG
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGG

Query:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGRGPGRGGRG G+ S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  RGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

SwissProt top hitse value%identityAlignment
B5FXN8 THO complex subunit 45.0e-3846.15Show/hide
Query:  MAEPLDMSLDDIIKNNKKS-----GSSNFRGRGRASSGPGPSR------RFG-----NR-------GLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHP
        MA+ +DMSLDDIIK N+       G    RGRG  + G GP R      R G     NR       G NR APYS  K     W HD+F D G       
Subjt:  MAEPLDMSLDDIIKNNKKS-----GSSNFRGRGRASSGPGPSR------RFG-----NR-------GLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHP

Query:  ARSSAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSN
           + +ETG KL VSNLD+GVS+ DI+ELF+E G LK+ +++YD+SGRS GTA+V F R+ADAL A+K+YN V LDG+ M +++V + I T    PA S 
Subjt:  ARSSAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSN

Query:  ANFGNYNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYH
            N  G  R   VLG   GGG  RG   G RGRG G+GR   ++LSAE+LDA L+ Y+
Subjt:  ANFGNYNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYH

Q3T0I4 THO complex subunit 43.2e-3744.87Show/hide
Query:  MAEPLDMSLDDIIKNNK--KSGSSNFRGRGRASS----------------GPGPSRR--------FGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYP
        MA+ +DMSLDDIIK N+  + G    RGRGRA S                G GP R          G  G NR APYS  K     W HD+F D G    
Subjt:  MAEPLDMSLDDIIKNNK--KSGSSNFRGRGRASS----------------GPGPSRR--------FGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYP

Query:  SHPARSSAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPA
              + +ETG KL VSNLD+GVS+ DI+ELF+E G LK+ +++YD+SGRS GTA+V F R+ADAL A+K+YN V LDG+ M +++V + I T    PA
Subjt:  SHPARSSAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPA

Query:  SSNANFGNYNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYH
         S     N  G  R     G   GGG  RG   G RGRG G+GR   ++LSAE+LDA L+ Y+
Subjt:  SSNANFGNYNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYH

Q6NQ72 THO complex subunit 4D1.4e-3741.78Show/hide
Query:  MAEPLDMSLDDIIKNNKKS-----GSSNFRGRGRASS--GPGPSRRFGNRGLNRAAPYS------AAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGT
        M+  L+M+LD+I+K  K +     G S  RGRGR     G GP+RR G   +N A P S        +     W   +F D   A     A +S +E GT
Subjt:  MAEPLDMSLDDIIKNNKKS-----GSSNFRGRGRASS--GPGPSRRFGNRGLNRAAPYS------AAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGT

Query:  KLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNG--
        +L+V+NLD GV+NEDI+ELFSE+G+++RY+I+YDK+GR  GTAEVV+ R++DA  A+K+YNNV LDG+ M+LEI+G N  +   +    N N    NG  
Subjt:  KLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNG--

Query:  -----FPRGGRVLGRNRGGGRGRGP----------------------------GRGGRGRGNGSGRGRGEK---LSAEDLDADLEKYHEEAM
               +GG   GR RGG  GRGP                            GRG  GRG G GRG G+K    SA DLD DLE YH +AM
Subjt:  -----FPRGGRVLGRNRGGGRGRGP----------------------------GRGGRGRGNGSGRGRGEK---LSAEDLDADLEKYHEEAM

Q8L719 THO complex subunit 4B8.4e-6255.67Show/hide
Query:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYS----AAKAPETAWSHDMFVDH---GAAYPSHPAR----
        M+  LDMSLDDIIK+N+K           G +N  GRG + S  GPSRRF NR   R APYS      +A +  W +D+F       AA+  H       
Subjt:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYS----AAKAPETAWSHDMFVDH---GAAYPSHPAR----

Query:  SSAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVP------
         S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAEVVFSR+ DALAA+KRYNNVQLDGK MK+EIVGTN+ + PA+P      
Subjt:  SSAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVP------

Query:  -------------ASSNANF-GNYNGFPRG---GRVLGRNRGGGRGRGPGRGGRG-RGNG----SGRGRGEKLSAEDLDADLEKYHEEAMQ
                      + N NF GN+NG  RG   G  +GR RGGG G G  RGGRG RG G     GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  -------------ASSNANF-GNYNGFPRG---GRVLGRNRGGGRGRGPGRGGRG-RGNG----SGRGRGEKLSAEDLDADLEKYHEEAMQ

Q8L773 THO complex subunit 4A1.6e-6562.8Show/hide
Query:  MAEPLDMSLDDIIKNNKKS--GSSNFRGRGRASSGPGPSRRFG-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSA-IETGTKLYVSNLDY
        M+  LDMSLDD+I  N+KS  G+   RG G + SGPGP+RR   NR   R+APY +AKAPE+ W HDMF D    + S   RSSA IETGTKLY+SNLDY
Subjt:  MAEPLDMSLDDIIKNNKKS--GSSNFRGRGRASSGPGPSRRFG-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSA-IETGTKLYVSNLDY

Query:  GVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFP-RGGRVL-G
        GV NEDIKELF+EVG+LKRY++++D+SGRSKGTAEVV+SR+ DALAA+K+YN+VQLDGK MK+EIVGTN+ T  A P+   AN GN NG P RGG+   G
Subjt:  GVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFP-RGGRVL-G

Query:  RNRGGGRGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        + RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  RNRGGGRGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

Arabidopsis top hitse value%identityAlignment
AT5G02530.1 RNA-binding (RRM/RBD/RNP motifs) family protein6.0e-6355.67Show/hide
Query:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYS----AAKAPETAWSHDMFVDH---GAAYPSHPAR----
        M+  LDMSLDDIIK+N+K           G +N  GRG + S  GPSRRF NR   R APYS      +A +  W +D+F       AA+  H       
Subjt:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYS----AAKAPETAWSHDMFVDH---GAAYPSHPAR----

Query:  SSAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVP------
         S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAEVVFSR+ DALAA+KRYNNVQLDGK MK+EIVGTN+ + PA+P      
Subjt:  SSAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVP------

Query:  -------------ASSNANF-GNYNGFPRG---GRVLGRNRGGGRGRGPGRGGRG-RGNG----SGRGRGEKLSAEDLDADLEKYHEEAMQ
                      + N NF GN+NG  RG   G  +GR RGGG G G  RGGRG RG G     GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  -------------ASSNANF-GNYNGFPRG---GRVLGRNRGGGRGRGPGRGGRG-RGNG----SGRGRGEKLSAEDLDADLEKYHEEAMQ

AT5G02530.2 RNA-binding (RRM/RBD/RNP motifs) family protein3.5e-6356.06Show/hide
Query:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYS----AAKAPETAWSHDMFVDH---GAAYPSHPAR----
        M+  LDMSLDDIIK+N+K           G +N  GRG + S  GPSRRF NR   R APYS      +A +  W +D+F       AA+  H       
Subjt:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYS----AAKAPETAWSHDMFVDH---GAAYPSHPAR----

Query:  SSAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVP------
         S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAEVVFSR+ DALAA+KRYNNVQLDGK MK+EIVGTN+ + PA+P      
Subjt:  SSAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVP------

Query:  -------------ASSNANF-GNYNGFPRG-GRVLGRNRGGGRGRGPGRGGRG-RGNG----SGRGRGEKLSAEDLDADLEKYHEEAMQ
                      + N NF GN+NG  RG G  +GR RGGG G G  RGGRG RG G     GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  -------------ASSNANF-GNYNGFPRG-GRVLGRNRGGGRGRGPGRGGRG-RGNG----SGRGRGEKLSAEDLDADLEKYHEEAMQ

AT5G59950.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.2e-6662.8Show/hide
Query:  MAEPLDMSLDDIIKNNKKS--GSSNFRGRGRASSGPGPSRRFG-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSA-IETGTKLYVSNLDY
        M+  LDMSLDD+I  N+KS  G+   RG G + SGPGP+RR   NR   R+APY +AKAPE+ W HDMF D    + S   RSSA IETGTKLY+SNLDY
Subjt:  MAEPLDMSLDDIIKNNKKS--GSSNFRGRGRASSGPGPSRRFG-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSA-IETGTKLYVSNLDY

Query:  GVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFP-RGGRVL-G
        GV NEDIKELF+EVG+LKRY++++D+SGRSKGTAEVV+SR+ DALAA+K+YN+VQLDGK MK+EIVGTN+ T  A P+   AN GN NG P RGG+   G
Subjt:  GVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFP-RGGRVL-G

Query:  RNRGGGRGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        + RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  RNRGGGRGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

AT5G59950.3 RNA-binding (RRM/RBD/RNP motifs) family protein4.1e-6462Show/hide
Query:  MAEPLDMSLDDIIKNNKKS--GSSNFRGRGRASSGPGPSRRFG-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSA-IETGTKLYVSNLDY
        M+  LDMSLDD+I  N+KS  G+   RG G + SGPGP+RR   NR   R+APY +  APE+ W HDMF D    + S   RSSA IETGTKLY+SNLDY
Subjt:  MAEPLDMSLDDIIKNNKKS--GSSNFRGRGRASSGPGPSRRFG-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSA-IETGTKLYVSNLDY

Query:  GVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFP-RGGRVL-G
        GV NEDIKELF+EVG+LKRY++++D+SGRSKGTAEVV+SR+ DALAA+K+YN+VQLDGK MK+EIVGTN+ T  A P+   AN GN NG P RGG+   G
Subjt:  GVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFP-RGGRVL-G

Query:  RNRGGGRGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        + RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  RNRGGGRGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

AT5G59950.5 RNA-binding (RRM/RBD/RNP motifs) family protein1.5e-6662.55Show/hide
Query:  MAEPLDMSLDDIIKNNKKS--GSSNFRGRGRASSGPGPSRRFG-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSA-IETGTKLYVSNLDY
        M+  LDMSLDD+I  N+KS  G+   RG G + SGPGP+RR   NR   R+APY +AKAPE+ W HDMF D    + S   RSSA IETGTKLY+SNLDY
Subjt:  MAEPLDMSLDDIIKNNKKS--GSSNFRGRGRASSGPGPSRRFG-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSA-IETGTKLYVSNLDY

Query:  GVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFP--RGGRVL-
        GV NEDIKELF+EVG+LKRY++++D+SGRSKGTAEVV+SR+ DALAA+K+YN+VQLDGK MK+EIVGTN+ T  A P+   AN GN NG P  RGG+   
Subjt:  GVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFP--RGGRVL-

Query:  GRNRGGGRGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        G+ RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  GRNRGGGRGRGPGRGGRGRGNGSGRGRGEKLSAEDLDADLEKYHEEAMQIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGCCTCTCGACATGAGCTTAGACGATATCATCAAGAACAACAAGAAATCCGGATCCTCAAACTTCAGGGGTCGTGGCAGAGCTTCTTCTGGACCAGGTCCCTC
TCGCCGCTTTGGCAATCGCGGTCTTAATAGAGCAGCGCCCTATTCTGCAGCCAAGGCGCCCGAGACGGCTTGGTCACACGACATGTTTGTAGATCATGGCGCGGCATATC
CTTCACATCCTGCACGGTCCTCTGCTATCGAAACTGGCACGAAGCTTTATGTTTCGAATTTGGATTATGGTGTTTCCAACGAGGATATCAAGGAACTGTTTTCTGAAGTT
GGCGATCTAAAACGGTATTCTATCAATTATGATAAAAGTGGGAGATCAAAGGGAACAGCAGAAGTAGTTTTCTCACGACAAGCTGATGCTCTAGCTGCTATAAAAAGATA
CAACAATGTTCAGCTAGATGGGAAGGCCATGAAGTTGGAGATTGTGGGAACAAACATCGTGACACCACCTGCTGTACCTGCTTCTTCAAATGCCAATTTTGGGAATTATA
ATGGATTTCCGAGAGGTGGACGTGTACTGGGCCGAAACCGGGGTGGTGGACGAGGTCGTGGTCCTGGAAGAGGAGGGCGTGGACGTGGCAATGGCAGTGGTAGAGGTCGT
GGTGAGAAGTTATCAGCCGAAGATCTAGATGCTGATTTGGAGAAGTACCATGAAGAAGCAATGCAGATCAATTAA
mRNA sequenceShow/hide mRNA sequence
CCCATTGCTCATAACATAAGATCGTTCCGTTGAAACCCTAGAATTCTCTCTCCTCTGTGAATCTCTGATTCTCTGATCCACGACCACCACTCTTCCGCCAGCACAATGGC
AGAGCCTCTCGACATGAGCTTAGACGATATCATCAAGAACAACAAGAAATCCGGATCCTCAAACTTCAGGGGTCGTGGCAGAGCTTCTTCTGGACCAGGTCCCTCTCGCC
GCTTTGGCAATCGCGGTCTTAATAGAGCAGCGCCCTATTCTGCAGCCAAGGCGCCCGAGACGGCTTGGTCACACGACATGTTTGTAGATCATGGCGCGGCATATCCTTCA
CATCCTGCACGGTCCTCTGCTATCGAAACTGGCACGAAGCTTTATGTTTCGAATTTGGATTATGGTGTTTCCAACGAGGATATCAAGGAACTGTTTTCTGAAGTTGGCGA
TCTAAAACGGTATTCTATCAATTATGATAAAAGTGGGAGATCAAAGGGAACAGCAGAAGTAGTTTTCTCACGACAAGCTGATGCTCTAGCTGCTATAAAAAGATACAACA
ATGTTCAGCTAGATGGGAAGGCCATGAAGTTGGAGATTGTGGGAACAAACATCGTGACACCACCTGCTGTACCTGCTTCTTCAAATGCCAATTTTGGGAATTATAATGGA
TTTCCGAGAGGTGGACGTGTACTGGGCCGAAACCGGGGTGGTGGACGAGGTCGTGGTCCTGGAAGAGGAGGGCGTGGACGTGGCAATGGCAGTGGTAGAGGTCGTGGTGA
GAAGTTATCAGCCGAAGATCTAGATGCTGATTTGGAGAAGTACCATGAAGAAGCAATGCAGATCAATTAAATCATTTGGTGTCATTTTCTGGGGCTTGATATTCATAACT
TCGTTAGGTGCTTGGTGATAAATGTGATAAGGACACCGTTTTCTATTTTGCGAGTCATATCATGAAGCCCTGACCTTGAGAGAACCGCTAGACCTTGCTATGATGTGGAG
TTTGGGCTTGAATTGTTGTTTGTTGCAGTATACTATAAAGAGGTGTTCATTAATACTTGAATTTGTAATCTCCCGTTACTTATCCATTTTGATCTGCTCCAATCCTATTT
ACTTTCCATGAAAGTTGAACTGGAACTGGGACAGTCGTTCACACTCCTGTACACATGGAAGGATTTGGGTGCATGACATGGGAAGTATTGATCCTAACAAAATCAACCAC
ATCTTCTGGTGTCAGAGGCCAAACTTCTGGTACACTGTTTCTAAATCATAATATTTCCTTGTTAGAATGATAGACATGGCTGACAATGTATTCTGTATATGATAATTCTT
AGTTTCTGACTGCAGAATTTCTGCTTTATTGCTTTGCAGTGATTGGTTTTTTGAATTAAAAGTAATGTGTTAAGATCTTCTACTTATTGGCATGGAGGGAGGGAGGGTGT
GGAGATGATTACCAAGTAAGTGTTCAGGAGTTGGCAATGACTATCGTGGCCCCTGAACCATAGAATTAGTATTGGTATGCAGTTAGAAATCAATGGAGCTTCTACTTTTG
TTTCAGCATTGTGTATGGCTTTTACTTTTCTGCGCTTGTGTGTGACTATTATAGGCTGCAGTCTCGTTTTTGCATATTTTGTTTGAGAACACTCCTGACATTTGCATTCT
ATAATTTGAAAGCAAGGCATATATCATGCAATAAATTTGGGCCTCTTTTCTGTTCATGTTAATTTTAATTTTAAAATCTTTGGAAAGCCTCCTTAATCATGCCTACTCTG
CCATTGACTCCTAGGGGAGAAACTCGAGTGCTGTTGGTCAGAGTAGGACCAACATTGATGCTGAGATATGATTTAGATTTTACAAAACTGGCCCATATTTGCTAGATTCT
GCCTCTACATTATTGTTATTATTCAGATTTGTACCTTTTCACTTTTGTTTGAAGTTTTCTGAAATTTAAATTTAGTTCCCTCACACTTGATGAAGAAATGAAAAATGATT
GATCATCATTTATTCAAGTTTGTCTAAAGTTGATTCACTGAGTCAACATAAGTGTAGAAATCAAGCCTTTTGCAATAGATTGACAGAAGTAGCAATATTACGGTGGAAAT
AATGGAAGCAAAGACATGTTTGAGCTTTACATCTTGCCTCATTTTGCATATCTCTTCATATGTGTAGATGGATTGGGAAACTTCCTTTACATATTGTGAAAAGGAAGTTT
TGGAAGCCATTGATTACCGGTGTCGGTGGGGGCATGCTATCTGAAGAACTTTTTATAATTCAGTTTATGGGATAGAAGTATGTATCGGTCTTTATTCCGAATGAAAGAGC
TTTGTAGGAAAAGGGCAATGTGAGAATCATGGTGCAGTAGCTGGAGCTTCCTTTCAAGTACAAACGAGGTTGTAACAACTAGGGGTGAAAACAGGTGGGGTTGGGTTTGC
TTTGTAGCAAGACCGACCCAATCCAAGTTGGTTTTAGATGAAAGAGTCTGACCCGACCAAACACTTTTCGAACAATTTCAAATGAAGCTCAGCTTACCCTCTCTCTAGGT
TGTCTCTTTTTGGAAGGTTAAGGAATATCGGTGGGTGGTTTGGATGGGGGTGCCCCTAGGAATGTATGCATTTTTGCAGCTTTGAAATTATTGGAAGTGCTTTTTTGTAT
GTCTTACAGATTCTTTAGATAGATGGGATTCTGGTTTATGCCTAGTGAGTGCTCCTTTAAAACTCTAACGTGTTTGACATAGCTAGCTGAATGGCAGATATCTCTCTCTA
ATGATAGTTGTAGCTTGCTCAACTGCCA
Protein sequenceShow/hide protein sequence
MAEPLDMSLDDIIKNNKKSGSSNFRGRGRASSGPGPSRRFGNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSHPARSSAIETGTKLYVSNLDYGVSNEDIKELFSEV
GDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKAMKLEIVGTNIVTPPAVPASSNANFGNYNGFPRGGRVLGRNRGGGRGRGPGRGGRGRGNGSGRGR
GEKLSAEDLDADLEKYHEEAMQIN