; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023144 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023144
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTHO complex subunit 4A
Genome locationchr7:44913611..44916895
RNA-Seq ExpressionLag0023144
SyntenyLag0023144
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR025715 - Chromatin target of PRMT1 protein, C-terminal
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583468.1 THO complex subunit 4A, partial [Cucurbita argyrosperma subsp. sororia]1.5e-11693.83Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS A+APETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVTPA+PAS+  NFGN NGF RGG  +GRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

KAG7019223.1 THO complex subunit 4A [Cucurbita argyrosperma subsp. argyrosperma]5.7e-11693.42Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS A+APETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVTPA+PAS+  NFGN NGF RGG  +GRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAM+IN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_008457549.1 PREDICTED: THO complex subunit 4A [Cucumis melo]5.1e-11793.06Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFR RGGASSGPGPSRRFRNRGLNRA PYS +KAPETAWSHDMFVDHGAAYPS P RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE++FSR ADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPA+PA S A+FGN NGFPRGGRAMGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR

Query:  GRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGR  GSGSGRGRGEKLSAEDLDADL+KYHEEAMQIN
Subjt:  GRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_022964653.1 THO complex subunit 4B-like [Cucurbita moschata]4.3e-11693.42Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS A+APETAWSHDMFVDHG+AYPSQPARASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVTPA+PAS+  NFGN NGF RGG  +GRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

XP_038895300.1 THO complex subunit 4A-like [Benincasa hispida]1.5e-12196.3Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS AKAPETAWSHDMFVDHGAAYPSQP RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQ+DALAAIKRYNNVQLDGKPMKLEIVGTNIVTPA+PASS A FGN NGFPRGGRA+GRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

TrEMBL top hitse value%identityAlignment
A0A0A0LUQ3 RRM domain-containing protein1.4e-11591.2Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFR RGGASSGPGPSRRFRNRGLNRA PYS +KAPETAWSHDMFVDHGAAYPS P RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR
        EDIKELFSEVGD+KRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPA+PA S A+FGN NGFPRGGRAMGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR

Query:  GRGPGRG-GRGR------GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRG GRGR      GSGSGRG GEKLSAEDLDADL+KYHEEAMQIN
Subjt:  GRGPGRG-GRGR------GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A1S3C5R6 THO complex subunit 4A2.5e-11793.06Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFR RGGASSGPGPSRRFRNRGLNRA PYS +KAPETAWSHDMFVDHGAAYPS P RASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE++FSR ADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPA+PA S A+FGN NGFPRGGRAMGRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR

Query:  GRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGR  GSGSGRGRGEKLSAEDLDADL+KYHEEAMQIN
Subjt:  GRGPGRGGRGR--GSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1G9Y5 THO complex subunit 4A-like isoform X23.1e-11290.95Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN
        M +PLD SLDDIIKNNKKSGSSNFRGRGGASSGP PSRRF NRGLNRAAPYS AKAPET WSHD+FVDHG AYPS PARAS IETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVG NIVTP LPASS  NFGNS+GF RGGRA+GRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGR  G+GRG GEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1HLI4 THO complex subunit 4B-like2.1e-11693.42Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS A+APETAWSHDMFVDHG+AYPSQPARASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVTPA+PAS+  NFGN NGF RGG  +GRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRGRGS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

A0A6J1I3N4 THO complex subunit 4A-like4.7e-11693.42Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN
        MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS A+APETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSN

Query:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR
        EDIKELFSEVGDLKRYSINYDKSGRSKGTAE+VFSRQADALAAIKRYNNVQLDGK MKLEIVGTNIVTPA+PAS+  NFGN NGF RGG  +GRNRGGGR
Subjt:  EDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGR

Query:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        GRGPGRGGRG GS S RGRGEKLSAEDLDADLEKYHEEAMQIN
Subjt:  GRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

SwissProt top hitse value%identityAlignment
B5FXN8 THO complex subunit 49.6e-4245.95Show/hide
Query:  MAEPLDMSLDDIIKNNKKS-----GSSNFRGRGGASSGPGPSR-----------RFRNR-------GLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQP
        MA+ +DMSLDDIIK N+       G    RGRGG + G GP R             RNR       G NR APYS  K     W HD+F D G       
Subjt:  MAEPLDMSLDDIIKNNKKS-----GSSNFRGRGGASSGPGPSR-----------RFRNR-------GLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQP

Query:  ARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIA
           + +ETG KL VSNLD+GVS+ DI+ELF+E G LK+ +++YD+SGRS GTA+V F R+ADAL A+K+YN V LDG+PM +++V + I T   PA S+ 
Subjt:  ARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIA

Query:  NFGNSNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH
           N  G  R    +G   GGG  RG   G RGRG G+GR   ++LSAE+LDA L+ Y+
Subjt:  NFGNSNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH

O08583 THO complex subunit 42.6e-3945.21Show/hide
Query:  MAEPLDMSLDDIIKNNKKS----GSSNFRGRGGASSGPGPSRR-----------FRNR----------GLNRAAPYSAAKAPETAWSHDMFVDHGAAYPS
        MA+ +DMSLDDIIK N+      G    RGR G+  G G + +            RNR          G NR APYS  K     W HD+F D G     
Subjt:  MAEPLDMSLDDIIKNNKKS----GSSNFRGRGGASSGPGPSRR-----------FRNR----------GLNRAAPYSAAKAPETAWSHDMFVDHGAAYPS

Query:  QPARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASS
             + +ETG KL VSNLD+GVS+ DI+ELF+E G LK+ +++YD+SGRS GTA+V F R+ADAL A+K+YN V LDG+PM +++V + I T   PA S
Subjt:  QPARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASS

Query:  IANFGNSNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH
        I    N  G  R  R  G   GGG  RG   G RGRG G+GR   ++LSAE+LDA L+ Y+
Subjt:  IANFGNSNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH

Q3T0I4 THO complex subunit 42.0e-3944.27Show/hide
Query:  MAEPLDMSLDDIIKNNKKS----GSSNFRGRGGASSGPGPSRR-----------FRNR-----------GLNRAAPYSAAKAPETAWSHDMFVDHGAAYP
        MA+ +DMSLDDIIK N+      G    RGR G+  G G   +            RNR           G NR APYS  K     W HD+F D G    
Subjt:  MAEPLDMSLDDIIKNNKKS----GSSNFRGRGGASSGPGPSRR-----------FRNR-----------GLNRAAPYSAAKAPETAWSHDMFVDHGAAYP

Query:  SQPARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPAS
              + +ETG KL VSNLD+GVS+ DI+ELF+E G LK+ +++YD+SGRS GTA+V F R+ADAL A+K+YN V LDG+PM +++V + I T   PA 
Subjt:  SQPARASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPAS

Query:  SIANFGNSNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH
        S+    N  G  R   + G   GGG  RG   G RGRG G+GR   ++LSAE+LDA L+ Y+
Subjt:  SIANFGNSNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYH

Q8L719 THO complex subunit 4B2.2e-6256.36Show/hide
Query:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS----AAKAPETAWSHDMFVD-------HGAAYPSQPAR
        M+  LDMSLDDIIK+N+K           G +N  GRGG+ S  GPSRRF NR   R APYS      +A +  W +D+F          G    +    
Subjt:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS----AAKAPETAWSHDMFVD-------HGAAYPSQPAR

Query:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIA--
         S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAEVVFSR+ DALAA+KRYNNVQLDGK MK+EIVGTN+  PALP  + A  
Subjt:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIA--

Query:  -----------------NF-GNSNGFPRG---GRAMGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ
                         NF GN NG  RG   G  MGR RGGG G G  RGGR      GRGSG GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  -----------------NF-GNSNGFPRG---GRAMGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ

Q8L773 THO complex subunit 4A3.5e-6863.31Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASA-IETGTKLYVSNLDYG
        M+  LDMSLDD+I  N+KS       RG G+ SGPGP+RR   NR   R+APY +AKAPE+ W HDMF D    + S   R+SA IETGTKLY+SNLDYG
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASA-IETGTKLYVSNLDYG

Query:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFP-RGGRAM-GRN
        V NEDIKELF+EVG+LKRY++++D+SGRSKGTAEVV+SR+ DALAA+K+YN+VQLDGKPMK+EIVGTN+ T A P+   AN GNSNG P RGG+   G+ 
Subjt:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFP-RGGRAM-GRN

Query:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

Arabidopsis top hitse value%identityAlignment
AT5G02530.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.6e-6356.36Show/hide
Query:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS----AAKAPETAWSHDMFVD-------HGAAYPSQPAR
        M+  LDMSLDDIIK+N+K           G +N  GRGG+ S  GPSRRF NR   R APYS      +A +  W +D+F          G    +    
Subjt:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS----AAKAPETAWSHDMFVD-------HGAAYPSQPAR

Query:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIA--
         S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAEVVFSR+ DALAA+KRYNNVQLDGK MK+EIVGTN+  PALP  + A  
Subjt:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIA--

Query:  -----------------NF-GNSNGFPRG---GRAMGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ
                         NF GN NG  RG   G  MGR RGGG G G  RGGR      GRGSG GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  -----------------NF-GNSNGFPRG---GRAMGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ

AT5G02530.2 RNA-binding (RRM/RBD/RNP motifs) family protein9.2e-6456.75Show/hide
Query:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS----AAKAPETAWSHDMFVD-------HGAAYPSQPAR
        M+  LDMSLDDIIK+N+K           G +N  GRGG+ S  GPSRRF NR   R APYS      +A +  W +D+F          G    +    
Subjt:  MAEPLDMSLDDIIKNNKK----------SGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYS----AAKAPETAWSHDMFVD-------HGAAYPSQPAR

Query:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIA--
         S+IETGTKLY+SNLDYGVSNEDIKELFSEVGDLKRY I+YD+SGRSKGTAEVVFSR+ DALAA+KRYNNVQLDGK MK+EIVGTN+  PALP  + A  
Subjt:  ASAIETGTKLYVSNLDYGVSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIA--

Query:  -----------------NF-GNSNGFPRG-GRAMGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ
                         NF GN NG  RG G  MGR RGGG G G  RGGR      GRGSG GRGR E +SAEDLDA+L+KYH+EAM+
Subjt:  -----------------NF-GNSNGFPRG-GRAMGRNRGGGRGRGPGRGGR------GRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQ

AT5G59950.1 RNA-binding (RRM/RBD/RNP motifs) family protein2.5e-6963.31Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASA-IETGTKLYVSNLDYG
        M+  LDMSLDD+I  N+KS       RG G+ SGPGP+RR   NR   R+APY +AKAPE+ W HDMF D    + S   R+SA IETGTKLY+SNLDYG
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASA-IETGTKLYVSNLDYG

Query:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFP-RGGRAM-GRN
        V NEDIKELF+EVG+LKRY++++D+SGRSKGTAEVV+SR+ DALAA+K+YN+VQLDGKPMK+EIVGTN+ T A P+   AN GNSNG P RGG+   G+ 
Subjt:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFP-RGGRAM-GRN

Query:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

AT5G59950.3 RNA-binding (RRM/RBD/RNP motifs) family protein8.9e-6762.5Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASA-IETGTKLYVSNLDYG
        M+  LDMSLDD+I  N+KS       RG G+ SGPGP+RR   NR   R+APY +  APE+ W HDMF D    + S   R+SA IETGTKLY+SNLDYG
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASA-IETGTKLYVSNLDYG

Query:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFP-RGGRAM-GRN
        V NEDIKELF+EVG+LKRY++++D+SGRSKGTAEVV+SR+ DALAA+K+YN+VQLDGKPMK+EIVGTN+ T A P+   AN GNSNG P RGG+   G+ 
Subjt:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFP-RGGRAM-GRN

Query:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
        RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  RGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN

AT5G59950.5 RNA-binding (RRM/RBD/RNP motifs) family protein3.3e-6963.05Show/hide
Query:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASA-IETGTKLYVSNLDYG
        M+  LDMSLDD+I  N+KS       RG G+ SGPGP+RR   NR   R+APY +AKAPE+ W HDMF D    + S   R+SA IETGTKLY+SNLDYG
Subjt:  MAEPLDMSLDDIIKNNKKSGSSNFRGRG-GASSGPGPSRRFR-NRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASA-IETGTKLYVSNLDYG

Query:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFP--RGGRAM-GR
        V NEDIKELF+EVG+LKRY++++D+SGRSKGTAEVV+SR+ DALAA+K+YN+VQLDGKPMK+EIVGTN+ T A P+   AN GNSNG P  RGG+   G+
Subjt:  VSNEDIKELFSEVGDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFP--RGGRAM-GR

Query:  NRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN
         RGGGRG G GRGG GRG   G+G  EK+SAEDLDADL+KYH   M+ N
Subjt:  NRGGGRGRGPGRGGRGRGSGSGRGRGEKLSAEDLDADLEKYHEEAMQIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGCCTCTCGACATGAGCTTAGACGATATCATCAAGAACAACAAGAAATCCGGATCCTCAAATTTCAGGGGTCGTGGCGGAGCTTCTTCTGGACCAGGTCCCTC
TCGCCGCTTTCGCAATCGCGGTCTTAATAGAGCAGCGCCCTATTCTGCAGCCAAGGCGCCCGAGACGGCTTGGTCACACGACATGTTTGTAGATCATGGTGCGGCATATC
CTTCACAGCCTGCACGGGCCTCTGCTATCGAAACTGGCACGAAGCTTTATGTTTCGAATTTGGATTATGGTGTTTCCAATGAGGATATCAAGGAACTGTTTTCTGAAGTT
GGCGATCTAAAACGGTATTCCATAAATTATGACAAAAGTGGGAGATCAAAGGGAACAGCAGAAGTAGTTTTTTCACGACAAGCAGATGCTTTAGCTGCTATAAAGAGATA
CAACAACGTTCAGCTAGATGGGAAACCCATGAAGTTGGAGATTGTGGGAACGAACATCGTGACACCTGCTCTGCCTGCTTCTTCGATTGCCAATTTTGGAAATTCAAATG
GATTTCCGAGAGGTGGACGTGCAATGGGCCGAAACCGGGGTGGTGGACGAGGTCGTGGTCCTGGAAGAGGAGGGCGTGGACGTGGCAGTGGCAGTGGTAGAGGTCGTGGT
GAGAAGTTATCGGCCGAAGATCTAGATGCTGATTTGGAGAAGTACCATGAAGAAGCAATGCAGATCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAGCCTCTCGACATGAGCTTAGACGATATCATCAAGAACAACAAGAAATCCGGATCCTCAAATTTCAGGGGTCGTGGCGGAGCTTCTTCTGGACCAGGTCCCTC
TCGCCGCTTTCGCAATCGCGGTCTTAATAGAGCAGCGCCCTATTCTGCAGCCAAGGCGCCCGAGACGGCTTGGTCACACGACATGTTTGTAGATCATGGTGCGGCATATC
CTTCACAGCCTGCACGGGCCTCTGCTATCGAAACTGGCACGAAGCTTTATGTTTCGAATTTGGATTATGGTGTTTCCAATGAGGATATCAAGGAACTGTTTTCTGAAGTT
GGCGATCTAAAACGGTATTCCATAAATTATGACAAAAGTGGGAGATCAAAGGGAACAGCAGAAGTAGTTTTTTCACGACAAGCAGATGCTTTAGCTGCTATAAAGAGATA
CAACAACGTTCAGCTAGATGGGAAACCCATGAAGTTGGAGATTGTGGGAACGAACATCGTGACACCTGCTCTGCCTGCTTCTTCGATTGCCAATTTTGGAAATTCAAATG
GATTTCCGAGAGGTGGACGTGCAATGGGCCGAAACCGGGGTGGTGGACGAGGTCGTGGTCCTGGAAGAGGAGGGCGTGGACGTGGCAGTGGCAGTGGTAGAGGTCGTGGT
GAGAAGTTATCGGCCGAAGATCTAGATGCTGATTTGGAGAAGTACCATGAAGAAGCAATGCAGATCAATTAA
Protein sequenceShow/hide protein sequence
MAEPLDMSLDDIIKNNKKSGSSNFRGRGGASSGPGPSRRFRNRGLNRAAPYSAAKAPETAWSHDMFVDHGAAYPSQPARASAIETGTKLYVSNLDYGVSNEDIKELFSEV
GDLKRYSINYDKSGRSKGTAEVVFSRQADALAAIKRYNNVQLDGKPMKLEIVGTNIVTPALPASSIANFGNSNGFPRGGRAMGRNRGGGRGRGPGRGGRGRGSGSGRGRG
EKLSAEDLDADLEKYHEEAMQIN