; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G16870 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G16870
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionU1 small nuclear ribonucleoprotein A isoform X2
Genome locationClcChr01:29666525..29672364
RNA-Seq ExpressionClc01G16870
SyntenyClc01G16870
Gene Ontology termsGO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008440306.2 PREDICTED: protein WHI4 isoform X2 [Cucumis melo]4.5e-16489.58Show/hide
Query:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHL
        MDDM SYYP PQPPPQPS LEP HYPYYQV PPPPSAPPSQHYLS HPPTFASYGLP LPH  SINEVRTLFIAGLP+DVKPREIYNLFREFPGYESSHL
Subjt:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHL

Query:  RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAG
        RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKR RTEDERYGSDKKAKVS+ SRSTPDP                   G
Subjt:  RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAG

Query:  LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACST
        LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGP CTEQELIQIF RCPGFLKLKMQSTYGAPVAFVDFQDTACST
Subjt:  LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACST

Query:  GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
        GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKK K
Subjt:  GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK

XP_016899375.1 PREDICTED: U1 small nuclear ribonucleoprotein A isoform X1 [Cucumis melo]1.2e-16489.88Show/hide
Query:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHL
        MDDM SYYP PQPPPQPS LEP HYPYYQV PPPPSAPPSQHYLS HPPTFASYGLP LPH  SINEVRTLFIAGLP+DVKPREIYNLFREFPGYESSHL
Subjt:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHL

Query:  RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAG
        RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKR RTEDERYGSDKKAKVS+ SRSTPDP                  AG
Subjt:  RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAG

Query:  LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACST
        LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGP CTEQELIQIF RCPGFLKLKMQSTYGAPVAFVDFQDTACST
Subjt:  LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACST

Query:  GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
        GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKK K
Subjt:  GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK

XP_022977595.1 U1 small nuclear ribonucleoprotein A isoform X2 [Cucurbita maxima]2.7e-16489.38Show/hide
Query:  MDDMASYY---PQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYES
        MDD+ SYY   PQPQPP QPS LE  HYPYYQVPPPP SAP SQHYL+ HPPTFASYGLPFLPHAASINEVRTLFIAGLP+DVKPREIYNLFREFPGYES
Subjt:  MDDMASYY---PQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYES

Query:  SHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWA
        SHLR+PTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVS+FSR TPDP                 
Subjt:  SHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWA

Query:  LAGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTA
         AGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANV PQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTA
Subjt:  LAGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTA

Query:  CSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
        CSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
Subjt:  CSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK

XP_038881693.1 protein WHI4 isoform X1 [Benincasa hispida]7.2e-17091.67Show/hide
Query:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHL
        MDDMASYYPQPQPPPQPS LEP HYP+YQVPPPP SAPPSQHYLS HPPTFASYGLPFLPHAASINEVRTLFIAGLP+DVKPREIYNLFREFPGYESSHL
Subjt:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHL

Query:  RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAG
        RTPTQTTQPFAFAVFSDQQSA+GAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKR RTEDERYGSDKKAKVS+FSRSTPDP                  AG
Subjt:  RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAG

Query:  LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACST
        LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDT CST
Subjt:  LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACST

Query:  GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
        GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
Subjt:  GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK

XP_038881694.1 protein WHI4 isoform X2 [Benincasa hispida]2.7e-16991.37Show/hide
Query:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHL
        MDDMASYYPQPQPPPQPS LEP HYP+YQVPPPP SAPPSQHYLS HPPTFASYGLPFLPHAASINEVRTLFIAGLP+DVKPREIYNLFREFPGYESSHL
Subjt:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHL

Query:  RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAG
        RTPTQTTQPFAFAVFSDQQSA+GAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKR RTEDERYGSDKKAKVS+FSRSTPDP                   G
Subjt:  RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAG

Query:  LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACST
        LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDT CST
Subjt:  LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACST

Query:  GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
        GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
Subjt:  GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK

TrEMBL top hitse value%identityAlignment
A0A1S3B1E9 protein WHI4 isoform X22.2e-16489.58Show/hide
Query:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHL
        MDDM SYYP PQPPPQPS LEP HYPYYQV PPPPSAPPSQHYLS HPPTFASYGLP LPH  SINEVRTLFIAGLP+DVKPREIYNLFREFPGYESSHL
Subjt:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHL

Query:  RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAG
        RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKR RTEDERYGSDKKAKVS+ SRSTPDP                   G
Subjt:  RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAG

Query:  LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACST
        LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGP CTEQELIQIF RCPGFLKLKMQSTYGAPVAFVDFQDTACST
Subjt:  LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACST

Query:  GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
        GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKK K
Subjt:  GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK

A0A1S4DUH9 U1 small nuclear ribonucleoprotein A isoform X15.8e-16589.88Show/hide
Query:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHL
        MDDM SYYP PQPPPQPS LEP HYPYYQV PPPPSAPPSQHYLS HPPTFASYGLP LPH  SINEVRTLFIAGLP+DVKPREIYNLFREFPGYESSHL
Subjt:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHL

Query:  RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAG
        RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKR RTEDERYGSDKKAKVS+ SRSTPDP                  AG
Subjt:  RTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAG

Query:  LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACST
        LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGP CTEQELIQIF RCPGFLKLKMQSTYGAPVAFVDFQDTACST
Subjt:  LGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACST

Query:  GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
        GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKK K
Subjt:  GALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK

A0A6J1GCW9 U1 small nuclear ribonucleoprotein A isoform X12.2e-16489.38Show/hide
Query:  MDDMASYY---PQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYES
        MDD+ SYY   PQPQPP QPS LE  HYPYYQVPPPP SAP SQHYL+ HPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYES
Subjt:  MDDMASYY---PQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYES

Query:  SHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWA
        SHLR+PTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKR RTEDERYGSDKKAKVS+FSRSTPDP                 
Subjt:  SHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWA

Query:  LAGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTA
         AGLGSTHMSGMGNSAYNTIGYPSAQSHGSFD+KTVNDTVAANV PQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTA
Subjt:  LAGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTA

Query:  CSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
        CSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
Subjt:  CSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK

A0A6J1IMS7 U1 small nuclear ribonucleoprotein A isoform X21.3e-16489.38Show/hide
Query:  MDDMASYY---PQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYES
        MDD+ SYY   PQPQPP QPS LE  HYPYYQVPPPP SAP SQHYL+ HPPTFASYGLPFLPHAASINEVRTLFIAGLP+DVKPREIYNLFREFPGYES
Subjt:  MDDMASYY---PQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYES

Query:  SHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWA
        SHLR+PTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVS+FSR TPDP                 
Subjt:  SHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWA

Query:  LAGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTA
         AGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANV PQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTA
Subjt:  LAGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTA

Query:  CSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
        CSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
Subjt:  CSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK

A0A6J1IQG5 protein WHI4 isoform X14.9e-16489.09Show/hide
Query:  MDDMASYY---PQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYES
        MDD+ SYY   PQPQPP QPS LE  HYPYYQVPPPP SAP SQHYL+ HPPTFASYGLPFLPHAASINEVRTLFIAGLP+DVKPREIYNLFREFPGYES
Subjt:  MDDMASYY---PQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYES

Query:  SHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWA
        SHLR+PTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVS+FSR TPDP                 
Subjt:  SHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWA

Query:  LAGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTA
          GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANV PQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTA
Subjt:  LAGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTA

Query:  CSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
        CSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK
Subjt:  CSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKLK

SwissProt top hitse value%identityAlignment
Q6DH13 RNA-binding protein, mRNA-processing factor 2a3.6e-1539.84Show/hide
Query:  EVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDK
        EVRTLF++GLP D+KPRE+Y LFR F GYE S ++  ++  QP  F  F  +  A  A +A+NG+ FD E    L ++ AK+N+               K
Subjt:  EVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDK

Query:  KAKVSLFSRSTPDPGHIILKAHF
         AK  L +   P   H  L AHF
Subjt:  KAKVSLFSRSTPDPGHIILKAHF

Q6ZRY4 RNA-binding protein with multiple splicing 22.1e-1539.84Show/hide
Query:  EVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDK
        EVRTLF++GLP D+KPRE+Y LFR F GYE S ++   +  QP  F +F  +  A  A +A+NG+ FD E    L ++ AK+N+               K
Subjt:  EVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDK

Query:  KAKVSLFSRSTPDPGHIILKAHF
         AK  L +   P   H  L AHF
Subjt:  KAKVSLFSRSTPDPGHIILKAHF

Q8VC52 RNA-binding protein with multiple splicing 28.1e-1539.34Show/hide
Query:  EVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDK
        EVRTLF++GLP D+KPRE+Y LFR F GYE S ++  ++  QP  F +F  +  A  A +A+NG+ FD E    L ++ AK+N+               K
Subjt:  EVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDK

Query:  KAKVSLFSRSTPDPGHIILKAH
         AK  L +   P   H  L AH
Subjt:  KAKVSLFSRSTPDPGHIILKAH

Q9W6I1 RNA-binding protein with multiple splicing 26.2e-1539.84Show/hide
Query:  EVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDK
        EVRTLF++GLP D+KPRE+Y LFR F GYE S ++  ++  QP  F  F  +  A  A +A+NG+ FD E    L ++ AK+N+               K
Subjt:  EVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDK

Query:  KAKVSLFSRSTPDPGHIILKAHF
         AK  L +   P   H  L AHF
Subjt:  KAKVSLFSRSTPDPGHIILKAHF

Q9YGP5 RNA-binding protein with multiple splicing 24.8e-1539.84Show/hide
Query:  EVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDK
        EVRTLF++GLP D+KPRE+Y LFR F GYE S ++  ++  QP  F  F ++  A  A +A+NG+ FD E    L ++ AK+N+               K
Subjt:  EVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDK

Query:  KAKVSLFSRSTPDPGHIILKAHF
         AK  L +   P   H  L AHF
Subjt:  KAKVSLFSRSTPDPGHIILKAHF

Arabidopsis top hitse value%identityAlignment
AT2G42240.1 RNA-binding (RRM/RBD/RNP motifs) family protein9.7e-8052.68Show/hide
Query:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFA-SYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSH
        MDD+ +YY     P               VPPPPP        +SP P T A S  LP      + +EVRTLF+AGLP+DVKPREIYNLFREFPGYE+SH
Subjt:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFA-SYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSH

Query:  LRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALA
        LR+ +   +PFAFAVFSD QSAV  MHA+NGMVFDLEK S L++DLAKSN +SKR RT+D   G +   K+  ++ +T                     +
Subjt:  LRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALA

Query:  GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACS
        G GS    GM +SAYNTIGY  AQS G   N       +        PCPTLF+AN+GP+CTE ELIQ+FSRC GFLKLK+Q TYG PVAFVDFQD +CS
Subjt:  GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACS

Query:  TGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKL
        + AL+ LQG++LYSS  GE +RL+YA+SRMGMRKK+
Subjt:  TGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKL

AT2G42240.2 RNA-binding (RRM/RBD/RNP motifs) family protein1.6e-7451.69Show/hide
Query:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFA-SYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSH
        MDD+ +YY     P               VPPPPP        +SP P T A S  LP      + +EVRTLF+AGLP+DVKPREIYNLFREFPGYE+SH
Subjt:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFA-SYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSH

Query:  LRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALA
        LR+ +   +PFAFAVFSD QSAV  MHA+NGMVFDLEK S L++DLAKSN +SKR RT+D   G +   K+  ++ +T                     +
Subjt:  LRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALA

Query:  GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACS
        G GS    GM +SAYNTIGY  AQS G   N       +        PCPTLF+AN+GP+CTE ELIQ+FSRC GFLKLK+Q TYG PVAFVDFQD +CS
Subjt:  GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACS

Query:  TGALNHLQGSILYSSPPGEGMRLEY
        + AL+ LQG++LYSS  GE +RL+Y
Subjt:  TGALNHLQGSILYSSPPGEGMRLEY

AT2G42240.3 RNA-binding (RRM/RBD/RNP motifs) family protein1.3e-7652.08Show/hide
Query:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFA-SYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSH
        MDD+ +YY     P               VPPPPP        +SP P T A S  LP      + +EVRTLF+AGLP+DVKPREIYNLFREFPGYE+SH
Subjt:  MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFA-SYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSH

Query:  LRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALA
        LR+ +   +PFAFAVFSD QSAV  MHA+NGMVFDLEK S L++DLAKSN +SKR RT+D   G +   K+  ++ +T                     +
Subjt:  LRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALA

Query:  GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACS
        G GS    GM +SAYNTIGY  AQS G   N       +        PCPTLF+AN+GP+CTE ELIQ+FSRC GFLKLK+Q TYG PVAFVDFQD +CS
Subjt:  GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACS

Query:  TGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKL
        + AL+ LQG++LYSS  GE +RL+   SRMGMRKK+
Subjt:  TGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKKL

AT3G13700.2 RNA-binding (RRM/RBD/RNP motifs) family protein1.0e-3333.75Show/hide
Query:  HYPY-----YQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQ
        H PY     YQ+   P   PP    L+  P                   + TLF++GLP+DVK REI+NLFR   G+ES  L+   +  Q  AFA F+  
Subjt:  HYPY-----YQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQ

Query:  QSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTE------DERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAGLGSTHMSGMGNS
        + A+ AM+ +NG+ FD +  S L+++LA+SNSR K           D R     K++         DP  +    +               +       S
Subjt:  QSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTE------DERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAGLGSTHMSGMGNS

Query:  AYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILY
        A   +   S    G+                    C TLF+ANLGP+CTE EL Q+ SR PGF  LK+++  G PVAF DF++   +T A+NHLQG++L 
Subjt:  AYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILY

Query:  SSPPGEGMRLEYAKSRM
        SS  G GM +EYA+S+M
Subjt:  SSPPGEGMRLEYAKSRM

AT3G21215.1 RNA-binding (RRM/RBD/RNP motifs) family protein3.7e-3131.96Show/hide
Query:  SALEPLH--YPYYQVPPP--------PPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTT
        + + P H  +P    PPP        PP  PP  H+  P P    ++  P        +E+RT+FIAGLPDDVK RE+ NL R  PGYE+S +    +  
Subjt:  SALEPLH--YPYYQVPPP--------PPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQTT

Query:  QPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSN--------------SRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVL
        +P  FA+FS  Q A+ A   +  MVFD E +SV++ ++AK N               +SKR+RT     G D    V   S   P P  +    H     
Subjt:  QPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSN--------------SRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVL

Query:  HAWALAGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDF
                   H   +       I  PS+              V    I  NPPC TLF+ NLG +  E+EL  + S  PGF ++K+       V F++F
Subjt:  HAWALAGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDF

Query:  QDTACSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKK
        +D   +T   ++LQG+++ SS    GMR++Y+K+  G RK+
Subjt:  QDTACSTGALNHLQGSILYSSPPGEGMRLEYAKSRMGMRKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGACATGGCGAGCTACTATCCACAACCGCAACCACCGCCACAGCCTTCTGCTCTTGAACCTCTTCACTATCCCTATTACCAGGTGCCTCCTCCTCCTCCT
TCAGCACCGCCGTCTCAGCATTACCTCTCTCCACACCCGCCCACCTTCGCTTCCTATGGCTTACCTTTTTTACCTCACGCAGCCTCCATCAATGAGGTCCGAACC
CTATTCATAGCTGGCCTTCCTGATGATGTCAAGCCCCGAGAAATTTACAATCTCTTCCGGGAGTTTCCGGGATACGAGTCCTCTCATCTTCGGACCCCCACGCAG
ACGACCCAGCCATTTGCATTTGCTGTATTCTCGGACCAGCAGTCTGCCGTTGGTGCAATGCATGCTGTAAATGGCATGGTTTTTGATCTTGAGAAGCAGTCAGTA
CTGTATGTTGATTTGGCTAAATCTAATTCACGATCAAAACGGATGAGGACAGAGGATGAAAGATATGGATCGGATAAGAAAGCTAAAGTATCTCTCTTTTCAAGG
AGTACTCCTGATCCTGGACACATTATTCTAAAAGCTCATTTTGCACATGTTTTGCACGCTTGGGCACTAGCAGGTCTTGGCAGCACTCACATGTCCGGAATGGGT
AATTCTGCTTACAACACGATTGGTTATCCATCTGCACAAAGCCATGGAAGCTTTGATAACAAAACTGTAAATGATACAGTGGCTGCAAATGTGATTCCTCAAAAT
CCCCCATGTCCAACACTTTTTGTGGCAAATCTAGGGCCAAGTTGCACCGAGCAAGAGCTTATTCAAATTTTTTCAAGATGCCCGGGCTTCTTAAAACTAAAGATG
CAGAGCACGTATGGGGCTCCAGTTGCTTTTGTTGATTTTCAGGATACTGCCTGCTCAACTGGAGCTCTGAACCATCTGCAAGGCTCAATTCTGTACTCATCACCT
CCTGGGGAGGGCATGCGATTGGAATACGCAAAATCACGAATGGGTATGCGAAAGAAATTGAAATGA
mRNA sequenceShow/hide mRNA sequence
AGGGCTTCCAGCTCTCTCTCTCTCCCTACTTCGTCAGAGACTCGGAGAGAACAGCACTCCGATGGACGACATGGCGAGCTACTATCCACAACCGCAACCACCGCC
ACAGCCTTCTGCTCTTGAACCTCTTCACTATCCCTATTACCAGGTGCCTCCTCCTCCTCCTTCAGCACCGCCGTCTCAGCATTACCTCTCTCCACACCCGCCCAC
CTTCGCTTCCTATGGCTTACCTTTTTTACCTCACGCAGCCTCCATCAATGAGGTCCGAACCCTATTCATAGCTGGCCTTCCTGATGATGTCAAGCCCCGAGAAAT
TTACAATCTCTTCCGGGAGTTTCCGGGATACGAGTCCTCTCATCTTCGGACCCCCACGCAGACGACCCAGCCATTTGCATTTGCTGTATTCTCGGACCAGCAGTC
TGCCGTTGGTGCAATGCATGCTGTAAATGGCATGGTTTTTGATCTTGAGAAGCAGTCAGTACTGTATGTTGATTTGGCTAAATCTAATTCACGATCAAAACGGAT
GAGGACAGAGGATGAAAGATATGGATCGGATAAGAAAGCTAAAGTATCTCTCTTTTCAAGGAGTACTCCTGATCCTGGACACATTATTCTAAAAGCTCATTTTGC
ACATGTTTTGCACGCTTGGGCACTAGCAGGTCTTGGCAGCACTCACATGTCCGGAATGGGTAATTCTGCTTACAACACGATTGGTTATCCATCTGCACAAAGCCA
TGGAAGCTTTGATAACAAAACTGTAAATGATACAGTGGCTGCAAATGTGATTCCTCAAAATCCCCCATGTCCAACACTTTTTGTGGCAAATCTAGGGCCAAGTTG
CACCGAGCAAGAGCTTATTCAAATTTTTTCAAGATGCCCGGGCTTCTTAAAACTAAAGATGCAGAGCACGTATGGGGCTCCAGTTGCTTTTGTTGATTTTCAGGA
TACTGCCTGCTCAACTGGAGCTCTGAACCATCTGCAAGGCTCAATTCTGTACTCATCACCTCCTGGGGAGGGCATGCGATTGGAATACGCAAAATCACGAATGGG
TATGCGAAAGAAATTGAAATGAATAAAACAATTCTTGCAGACGGGACAGGAGCCAAAAAGGAAAAATAGAAAAGAGAGACCCTCAGGCCATGTCCCTACATGACT
AAGGTCAGAGGTAGTGCGTTACTTGTACTCTTATGTAGATGAGTGAATTAACAAATATACCATGCCCATTTTTTTCATTGTATCAAATGCTGCCTTCTCTGTCAT
GAACACCAAATTTATTTTATATCTTCAGAGCCCCATTGAAT
Protein sequenceShow/hide protein sequence
MDDMASYYPQPQPPPQPSALEPLHYPYYQVPPPPPSAPPSQHYLSPHPPTFASYGLPFLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRTPTQ
TTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSLFSRSTPDPGHIILKAHFAHVLHAWALAGLGSTHMSGMG
NSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSP
PGEGMRLEYAKSRMGMRKKLK