; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS026075 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS026075
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionU1 small nuclear ribonucleoprotein A isoform X2
Genome locationscaffold239:2010165..2017063
RNA-Seq ExpressionMS026075
SyntenyMS026075
Gene Ontology termsGO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604168.1 Ethylene-responsive transcription factor, partial [Cucurbita argyrosperma subsp. sororia]6.5e-15887.92Show/hide
Query:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES
        MDD+ S+Y PPPQ  PP QPSGLEAPHYPYYQVPPPPSSAP SQHYLAQHPPTFA+YGLPFLPHAAS NEVRTLFIAGLP+DVKPREIYNLFREFPGYES
Subjt:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES

Query:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNT
        SHLRSPTQTTQPFAFAVFSDQQSA+GAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVS+FSRSTPD GLGSTHMSGMGNSAYNT
Subjt:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNT

Query:  IGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDF
        IGYPSAQSHGSFD+K+VNDTVAANV                       TPQNPPCPTLFVANLGPSCTEQELIQ FSRCPGFLKLKMQSTYGAPVAFVDF
Subjt:  IGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDF

Query:  QDTACSTGALNHLQGSILYSSPPGEGMRLEY
        QDTACSTGALNHLQGSILYSSPPGEGMRLEY
Subjt:  QDTACSTGALNHLQGSILYSSPPGEGMRLEY

XP_022133116.1 protein WHI4 [Momordica charantia]1.1e-16893.03Show/hide
Query:  MDDMASFYPPPPQAPPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESS
        MDDMASFYPPPPQAPPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESS
Subjt:  MDDMASFYPPPPQAPPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESS

Query:  HLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNTI
        HLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNTI
Subjt:  HLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNTI

Query:  GYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDFQ
        GYPSAQSHGSFDNKSVNDTVAANV                       TPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDFQ
Subjt:  GYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDFQ

Query:  DTACSTGALNHLQGSILYSSPPGEGMRLEY
        DTACSTGALNHLQGSILYSSPPGEGMRLEY
Subjt:  DTACSTGALNHLQGSILYSSPPGEGMRLEY

XP_022949757.1 protein WHI4 isoform X2 [Cucurbita moschata]6.5e-15887.92Show/hide
Query:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES
        MDD+ S+Y PPPQ  PP QPSGLEAPHYPYYQVPPPPSSAP SQHYLAQHPPTFA+YGLPFLPHAAS NEVRTLFIAGLP+DVKPREIYNLFREFPGYES
Subjt:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES

Query:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNT
        SHLRSPTQTTQPFAFAVFSDQQSA+GAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVS+FSRSTPD GLGSTHMSGMGNSAYNT
Subjt:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNT

Query:  IGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDF
        IGYPSAQSHGSFD+K+VNDTVAANV                       TPQNPPCPTLFVANLGPSCTEQELIQ FSRCPGFLKLKMQSTYGAPVAFVDF
Subjt:  IGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDF

Query:  QDTACSTGALNHLQGSILYSSPPGEGMRLEY
        QDTACSTGALNHLQGSILYSSPPGEGMRLEY
Subjt:  QDTACSTGALNHLQGSILYSSPPGEGMRLEY

XP_023543723.1 protein WHI4 isoform X2 [Cucurbita pepo subsp. pepo]3.8e-15887.92Show/hide
Query:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES
        MDD+ S+Y PPPQ  PP QPSGLEAPHYPYYQVPPPPSSAP SQHYLAQHPPTFA+YGLPFLPHAAS NEVRTLFIAGLPEDVKPREIYNLFREFPGYES
Subjt:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES

Query:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNT
        SHLRSPTQTTQPFAFAVFSDQQSA+GAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVS+FSRSTPD GLGSTHMSGMGNSAYNT
Subjt:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNT

Query:  IGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDF
        IGYPSAQSHGSFD+K+VNDTVAANV                       TPQNPPCPTLFVANLGPSCTEQEL+Q FSRCPGFLKLKMQSTYGAPVAFVDF
Subjt:  IGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDF

Query:  QDTACSTGALNHLQGSILYSSPPGEGMRLEY
        QDTACSTGALNHLQGSILYSSPPGEGMRLEY
Subjt:  QDTACSTGALNHLQGSILYSSPPGEGMRLEY

XP_038881694.1 protein WHI4 isoform X2 [Benincasa hispida]4.9e-15887.88Show/hide
Query:  MDDMASFYPPPPQAPPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESS
        MDDMAS+YP P   PPPQPSGLE PHYP+YQVPPPPSSAPPSQHYL+QHPPTFA+YGLPFLPHAAS NEVRTLFIAGLPEDVKPREIYNLFREFPGYESS
Subjt:  MDDMASFYPPPPQAPPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESS

Query:  HLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNTI
        HLR+PTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVS+FSRSTPD GLGSTHMSGMGNSAYNTI
Subjt:  HLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNTI

Query:  GYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDFQ
        GYPSAQSHGSFDNK+VNDTVAANV+                       PQNPPCPTLFVANLGPSCTEQELIQ FSRCPGFLKLKMQSTYGAPVAFVDFQ
Subjt:  GYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDFQ

Query:  DTACSTGALNHLQGSILYSSPPGEGMRLEY
        DT CSTGALNHLQGSILYSSPPGEGMRLEY
Subjt:  DTACSTGALNHLQGSILYSSPPGEGMRLEY

TrEMBL top hitse value%identityAlignment
A0A6J1BU41 protein WHI45.1e-16993.03Show/hide
Query:  MDDMASFYPPPPQAPPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESS
        MDDMASFYPPPPQAPPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESS
Subjt:  MDDMASFYPPPPQAPPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESS

Query:  HLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNTI
        HLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNTI
Subjt:  HLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNTI

Query:  GYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDFQ
        GYPSAQSHGSFDNKSVNDTVAANV                       TPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDFQ
Subjt:  GYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDFQ

Query:  DTACSTGALNHLQGSILYSSPPGEGMRLEY
        DTACSTGALNHLQGSILYSSPPGEGMRLEY
Subjt:  DTACSTGALNHLQGSILYSSPPGEGMRLEY

A0A6J1GCW9 U1 small nuclear ribonucleoprotein A isoform X14.5e-15787.65Show/hide
Query:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES
        MDD+ S+Y PPPQ  PP QPSGLEAPHYPYYQVPPPPSSAP SQHYLAQHPPTFA+YGLPFLPHAAS NEVRTLFIAGLP+DVKPREIYNLFREFPGYES
Subjt:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES

Query:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPD-SGLGSTHMSGMGNSAYN
        SHLRSPTQTTQPFAFAVFSDQQSA+GAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVS+FSRSTPD +GLGSTHMSGMGNSAYN
Subjt:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPD-SGLGSTHMSGMGNSAYN

Query:  TIGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVD
        TIGYPSAQSHGSFD+K+VNDTVAANV                       TPQNPPCPTLFVANLGPSCTEQELIQ FSRCPGFLKLKMQSTYGAPVAFVD
Subjt:  TIGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVD

Query:  FQDTACSTGALNHLQGSILYSSPPGEGMRLEY
        FQDTACSTGALNHLQGSILYSSPPGEGMRLEY
Subjt:  FQDTACSTGALNHLQGSILYSSPPGEGMRLEY

A0A6J1GD02 protein WHI4 isoform X23.1e-15887.92Show/hide
Query:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES
        MDD+ S+Y PPPQ  PP QPSGLEAPHYPYYQVPPPPSSAP SQHYLAQHPPTFA+YGLPFLPHAAS NEVRTLFIAGLP+DVKPREIYNLFREFPGYES
Subjt:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES

Query:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNT
        SHLRSPTQTTQPFAFAVFSDQQSA+GAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVS+FSRSTPD GLGSTHMSGMGNSAYNT
Subjt:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNT

Query:  IGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDF
        IGYPSAQSHGSFD+K+VNDTVAANV                       TPQNPPCPTLFVANLGPSCTEQELIQ FSRCPGFLKLKMQSTYGAPVAFVDF
Subjt:  IGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDF

Query:  QDTACSTGALNHLQGSILYSSPPGEGMRLEY
        QDTACSTGALNHLQGSILYSSPPGEGMRLEY
Subjt:  QDTACSTGALNHLQGSILYSSPPGEGMRLEY

A0A6J1IMS7 U1 small nuclear ribonucleoprotein A isoform X21.0e-15687.65Show/hide
Query:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES
        MDD+ S+Y PPPQ  PP QPSGLEAPHYPYYQVPPPPSSAP SQHYLAQHPPTFA+YGLPFLPHAAS NEVRTLFIAGLPEDVKPREIYNLFREFPGYES
Subjt:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES

Query:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPD-SGLGSTHMSGMGNSAYN
        SHLRSPTQTTQPFAFAVFSDQQSA+GAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKR RTEDERYGSDKKAKVS+FSR TPD +GLGSTHMSGMGNSAYN
Subjt:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPD-SGLGSTHMSGMGNSAYN

Query:  TIGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVD
        TIGYPSAQSHGSFDNK+VNDTVAANV                       TPQNPPCPTLFVANLGPSCTEQELIQ FSRCPGFLKLKMQSTYGAPVAFVD
Subjt:  TIGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVD

Query:  FQDTACSTGALNHLQGSILYSSPPGEGMRLEY
        FQDTACSTGALNHLQGSILYSSPPGEGMRLEY
Subjt:  FQDTACSTGALNHLQGSILYSSPPGEGMRLEY

A0A6J1IQG5 protein WHI4 isoform X17.0e-15887.92Show/hide
Query:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES
        MDD+ S+Y PPPQ  PP QPSGLEAPHYPYYQVPPPPSSAP SQHYLAQHPPTFA+YGLPFLPHAAS NEVRTLFIAGLPEDVKPREIYNLFREFPGYES
Subjt:  MDDMASFYPPPPQ-APPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYES

Query:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNT
        SHLRSPTQTTQPFAFAVFSDQQSA+GAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKR RTEDERYGSDKKAKVS+FSR TPD GLGSTHMSGMGNSAYNT
Subjt:  SHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNT

Query:  IGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDF
        IGYPSAQSHGSFDNK+VNDTVAANV                       TPQNPPCPTLFVANLGPSCTEQELIQ FSRCPGFLKLKMQSTYGAPVAFVDF
Subjt:  IGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDF

Query:  QDTACSTGALNHLQGSILYSSPPGEGMRLEY
        QDTACSTGALNHLQGSILYSSPPGEGMRLEY
Subjt:  QDTACSTGALNHLQGSILYSSPPGEGMRLEY

SwissProt top hitse value%identityAlignment
Q6DH13 RNA-binding protein, mRNA-processing factor 2a7.0e-1443.33Show/hide
Query:  EVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTR
        EVRTLF++GLP D+KPRE+Y LFR F GYE S ++  ++  QP  F  F  +  A  A +A+NG+ FD E    L ++ AK+N++  +++
Subjt:  EVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTR

Q6ZRY4 RNA-binding protein with multiple splicing 24.1e-1442.55Show/hide
Query:  ASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTR
        A   EVRTLF++GLP D+KPRE+Y LFR F GYE S ++   +  QP  F +F  +  A  A +A+NG+ FD E    L ++ AK+N++  +++
Subjt:  ASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTR

Q8VC52 RNA-binding protein with multiple splicing 24.1e-1443.33Show/hide
Query:  EVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTR
        EVRTLF++GLP D+KPRE+Y LFR F GYE S ++  ++  QP  F +F  +  A  A +A+NG+ FD E    L ++ AK+N++  +++
Subjt:  EVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTR

Q9W6I1 RNA-binding protein with multiple splicing 29.1e-1443.33Show/hide
Query:  EVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTR
        EVRTLF++GLP D+KPRE+Y LFR F GYE S ++  ++  QP  F  F  +  A  A +A+NG+ FD E    L ++ AK+N++  +++
Subjt:  EVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTR

Q9YGP5 RNA-binding protein with multiple splicing 21.2e-1343.33Show/hide
Query:  EVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTR
        EVRTLF++GLP D+KPRE+Y LFR F GYE S ++  ++  QP  F  F ++  A  A +A+NG+ FD E    L ++ AK+N++  + +
Subjt:  EVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTR

Arabidopsis top hitse value%identityAlignment
AT2G42240.1 RNA-binding (RRM/RBD/RNP motifs) family protein3.6e-7450.89Show/hide
Query:  MDDMASFYPPPPQAPPPQPSGLEAPHY--PYYQVPPPPSSAP-PSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGY
        MDD+ ++Y                 HY  P    PPPP  +P P     + + PT  + G        + +EVRTLF+AGLPEDVKPREIYNLFREFPGY
Subjt:  MDDMASFYPPPPQAPPPQPSGLEAPHY--PYYQVPPPPSSAP-PSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGY

Query:  ESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAY
        E+SHLRS +   +PFAFAVFSD QSA+  MHA+NGMVFDLEK S L++DLAKSN +SKR+RT+D   G +   K+  ++ +T +SG GS    GM +SAY
Subjt:  ESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAY

Query:  NTIGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNP-----PCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGA
        NTIGY  AQS G            ANV            GR         T + P     PCPTLF+AN+GP+CTE ELIQ FSRC GFLKLK+Q TYG 
Subjt:  NTIGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNP-----PCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGA

Query:  PVAFVDFQDTACSTGALNHLQGSILYSSPPGEGMRLEY
        PVAFVDFQD +CS+ AL+ LQG++LYSS  GE +RL+Y
Subjt:  PVAFVDFQDTACSTGALNHLQGSILYSSPPGEGMRLEY

AT2G42240.2 RNA-binding (RRM/RBD/RNP motifs) family protein6.6e-7650.14Show/hide
Query:  MDDMASFYPPPPQAPPPQPSGLEAPHY--PYYQVPPPPSSAP-PSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGY
        MDD+ ++Y                 HY  P    PPPP  +P P     + + PT  + G        + +EVRTLF+AGLPEDVKPREIYNLFREFPGY
Subjt:  MDDMASFYPPPPQAPPPQPSGLEAPHY--PYYQVPPPPSSAP-PSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGY

Query:  ESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAY
        E+SHLRS +   +PFAFAVFSD QSA+  MHA+NGMVFDLEK S L++DLAKSN +SKR+RT+D   G +   K+  ++ +T +SG GS    GM +SAY
Subjt:  ESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAY

Query:  NTIGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNP-----PCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGA
        NTIGY  AQS G            ANV            GR         T + P     PCPTLF+AN+GP+CTE ELIQ FSRC GFLKLK+Q TYG 
Subjt:  NTIGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNP-----PCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGA

Query:  PVAFVDFQDTACSTGALNHLQGSILYSSPPGEGMRLEYPSTIFLFFTLY
        PVAFVDFQD +CS+ AL+ LQG++LYSS  GE +RL+YPS + + F  +
Subjt:  PVAFVDFQDTACSTGALNHLQGSILYSSPPGEGMRLEYPSTIFLFFTLY

AT2G42240.3 RNA-binding (RRM/RBD/RNP motifs) family protein1.8e-7350.74Show/hide
Query:  MDDMASFYPPPPQAPPPQPSGLEAPHY--PYYQVPPPPSSAP-PSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGY
        MDD+ ++Y                 HY  P    PPPP  +P P     + + PT  + G        + +EVRTLF+AGLPEDVKPREIYNLFREFPGY
Subjt:  MDDMASFYPPPPQAPPPQPSGLEAPHY--PYYQVPPPPSSAP-PSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGY

Query:  ESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAY
        E+SHLRS +   +PFAFAVFSD QSA+  MHA+NGMVFDLEK S L++DLAKSN +SKR+RT+D   G +   K+  ++ +T +SG GS    GM +SAY
Subjt:  ESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAY

Query:  NTIGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNP-----PCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGA
        NTIGY  AQS G            ANV            GR         T + P     PCPTLF+AN+GP+CTE ELIQ FSRC GFLKLK+Q TYG 
Subjt:  NTIGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNP-----PCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGA

Query:  PVAFVDFQDTACSTGALNHLQGSILYSSPPGEGMRLE
        PVAFVDFQD +CS+ AL+ LQG++LYSS  GE +RL+
Subjt:  PVAFVDFQDTACSTGALNHLQGSILYSSPPGEGMRLE

AT3G13700.1 RNA-binding (RRM/RBD/RNP motifs) family protein3.4e-3235.26Show/hide
Query:  APHYPY-----YQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLR---------------
        A H PY     YQ+   P   PP                LP L  A  P  + TLF++GLP DVK REI+NLFR   G+ES  L+               
Subjt:  APHYPY-----YQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLR---------------

Query:  ---SPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNTI
            P       AFA F+  + A+ AM+ +NG+ FD +  S L+++LA+SNSR K      ER GS     +   ++    S       S  G+S  + +
Subjt:  ---SPTQTTQPFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNTI

Query:  GYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDFQ
             Q  G+ D+   NDT  +   S      P   G   L            C TLF+ANLGP+CTE EL Q  SR PGF  LK+++  G PVAF DF+
Subjt:  GYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDFQ

Query:  DTACSTGALNHLQGSILYSSPPGEGMRLE
        +   +T A+NHLQG++L SS  G GM +E
Subjt:  DTACSTGALNHLQGSILYSSPPGEGMRLE

AT3G13700.2 RNA-binding (RRM/RBD/RNP motifs) family protein9.3e-3838.18Show/hide
Query:  LAQHPP--TFATYGLPFLPH---------AASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMV
        +A H P   F  Y L   PH         A  P  + TLF++GLP DVK REI+NLFR   G+ES  L+   +  Q  AFA F+  + A+ AM+ +NG+ 
Subjt:  LAQHPP--TFATYGLPFLPH---------AASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSDQQSAIGAMHAVNGMV

Query:  FDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPV
        FD +  S L+++LA+SNSR K      ER GS     +   ++    S       S  G+S  + +     Q  G+ D+   NDT  +   S      P 
Subjt:  FDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKSVNDTVAANVVSAIYLVNPV

Query:  NFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEGMRLEY
          G   L            C TLF+ANLGP+CTE EL Q  SR PGF  LK+++  G PVAF DF++   +T A+NHLQG++L SS  G GM +EY
Subjt:  NFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEGMRLEY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGACATGGCGAGTTTCTATCCGCCACCACCGCAAGCACCACCGCCGCAGCCTTCTGGTCTTGAAGCTCCTCACTATCCCTATTACCAGGTGCCTCCTCCTCCTTC
TTCAGCACCGCCGTCTCAGCATTACCTCGCTCAACACCCGCCCACCTTCGCTACCTATGGCTTACCGTTTTTACCTCACGCAGCGTCCCCTAATGAGGTCCGAACCCTAT
TCATAGCCGGCCTTCCTGAAGATGTCAAGCCCCGAGAAATCTACAATCTCTTCCGGGAGTTTCCGGGATACGAGTCCTCTCATCTTCGGAGCCCCACGCAGACGACGCAG
CCATTTGCATTTGCTGTATTCTCGGACCAGCAGTCTGCCATTGGTGCAATGCATGCAGTAAACGGCATGGTATTTGATCTTGAGAAGCAGTCAGTACTATACGTTGATTT
AGCTAAATCCAATTCAAGATCAAAGCGGACAAGGACAGAGGATGAAAGATATGGATCTGATAAGAAAGCTAAAGTATCTATGTTTTCAAGGAGTACACCTGATTCTGGTC
TTGGCAGCACTCACATGTCTGGAATGGGTAATTCTGCTTACAACACGATTGGTTATCCATCTGCACAAAGCCATGGAAGCTTTGATAACAAAAGTGTAAATGATACAGTG
GCTGCAAATGTGGTAAGCGCTATTTATTTGGTAAATCCTGTTAACTTTGGCAGAATAATTCTTCATAACTTCATTCCACAGACTCCTCAAAATCCCCCATGTCCAACACT
TTTTGTGGCAAATCTTGGGCCAAGTTGCACAGAGCAAGAGCTAATTCAAACTTTTTCGAGATGCCCGGGTTTTTTGAAACTGAAGATGCAGAGCACATATGGGGCCCCAG
TTGCTTTTGTTGATTTTCAGGATACTGCCTGTTCAACTGGAGCACTGAACCATCTGCAAGGTTCAATTCTGTACTCATCACCTCCTGGGGAGGGCATGCGATTGGAGTAT
CCTTCTACAATATTTTTATTCTTTACATTGTATTGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACGACATGGCGAGTTTCTATCCGCCACCACCGCAAGCACCACCGCCGCAGCCTTCTGGTCTTGAAGCTCCTCACTATCCCTATTACCAGGTGCCTCCTCCTCCTTC
TTCAGCACCGCCGTCTCAGCATTACCTCGCTCAACACCCGCCCACCTTCGCTACCTATGGCTTACCGTTTTTACCTCACGCAGCGTCCCCTAATGAGGTCCGAACCCTAT
TCATAGCCGGCCTTCCTGAAGATGTCAAGCCCCGAGAAATCTACAATCTCTTCCGGGAGTTTCCGGGATACGAGTCCTCTCATCTTCGGAGCCCCACGCAGACGACGCAG
CCATTTGCATTTGCTGTATTCTCGGACCAGCAGTCTGCCATTGGTGCAATGCATGCAGTAAACGGCATGGTATTTGATCTTGAGAAGCAGTCAGTACTATACGTTGATTT
AGCTAAATCCAATTCAAGATCAAAGCGGACAAGGACAGAGGATGAAAGATATGGATCTGATAAGAAAGCTAAAGTATCTATGTTTTCAAGGAGTACACCTGATTCTGGTC
TTGGCAGCACTCACATGTCTGGAATGGGTAATTCTGCTTACAACACGATTGGTTATCCATCTGCACAAAGCCATGGAAGCTTTGATAACAAAAGTGTAAATGATACAGTG
GCTGCAAATGTGGTAAGCGCTATTTATTTGGTAAATCCTGTTAACTTTGGCAGAATAATTCTTCATAACTTCATTCCACAGACTCCTCAAAATCCCCCATGTCCAACACT
TTTTGTGGCAAATCTTGGGCCAAGTTGCACAGAGCAAGAGCTAATTCAAACTTTTTCGAGATGCCCGGGTTTTTTGAAACTGAAGATGCAGAGCACATATGGGGCCCCAG
TTGCTTTTGTTGATTTTCAGGATACTGCCTGTTCAACTGGAGCACTGAACCATCTGCAAGGTTCAATTCTGTACTCATCACCTCCTGGGGAGGGCATGCGATTGGAGTAT
CCTTCTACAATATTTTTATTCTTTACATTGTATTGCTAA
Protein sequenceShow/hide protein sequence
MDDMASFYPPPPQAPPPQPSGLEAPHYPYYQVPPPPSSAPPSQHYLAQHPPTFATYGLPFLPHAASPNEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQ
PFAFAVFSDQQSAIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSMFSRSTPDSGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKSVNDTV
AANVVSAIYLVNPVNFGRIILHNFIPQTPQNPPCPTLFVANLGPSCTEQELIQTFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEGMRLEY
PSTIFLFFTLYC