; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg07211 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg07211
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionU1 small nuclear ribonucleoprotein C
Genome locationCarg_Chr17:8329314..8329933
RNA-Seq ExpressionCarg07211
SyntenyCarg07211
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0000395 - mRNA 5'-splice site recognition (biological process)
GO:0000243 - commitment complex (cellular component)
GO:0005685 - U1 snRNP (cellular component)
GO:0071004 - U2-type prespliceosome (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0030619 - U1 snRNA binding (molecular function)
GO:0030627 - pre-mRNA 5'-splice site binding (molecular function)
InterPro domainsIPR017340 - U1 small nuclear ribonucleoprotein C


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014402.1 U1 small nuclear ribonucleoprotein C, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-85100Show/hide
Query:  MNALQANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMM
        MNALQANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMM
Subjt:  MNALQANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMM

Query:  APPGAPMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH
        APPGAPMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH
Subjt:  APPGAPMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH

XP_022953847.1 U1 small nuclear ribonucleoprotein C-like [Cucurbita moschata]7.6e-8399.4Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG

Query:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH
        APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH
Subjt:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH

XP_022991731.1 U1 small nuclear ribonucleoprotein C-like [Cucurbita maxima]1.6e-8097.62Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG

Query:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH
        APMPGQVNMLAR PPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSF T AQPSESNH
Subjt:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH

XP_023548482.1 U1 small nuclear ribonucleoprotein C-like [Cucurbita pepo subsp. pepo]9.3e-8197.62Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPT GAPGYLPAPTMPPMM PPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG

Query:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH
        APMPGQVNMLARPPPPPAPIPGS PQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH
Subjt:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH

XP_038899936.1 U1 small nuclear ribonucleoprotein C isoform X1 [Benincasa hispida]1.8e-7189.88Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAF QVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGL+PGIRPPVLPRP PGAPGYLPAPTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG

Query:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH
        APMPGQVN+ AR PPPPAP+PGS PQPSS NGAP  A P MYQANPAAPGSGGY+SFTT AQPSE NH
Subjt:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH

TrEMBL top hitse value%identityAlignment
A0A0A0K515 U1 small nuclear ribonucleoprotein C6.8e-6987.57Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAF QVGAA+NQHLLGQRPRLPVLPTPVMPGAAPGL+PGIRPPVLPRP PGAPGYLP PTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG

Query:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQP-SESNH
        AP+PGQVN+ +R PPPPAP+PGS PQPSS NGAP  AAP  YQANPAAPGSGGYDSFT+ AQP SESNH
Subjt:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQP-SESNH

A0A1S3BS92 U1 small nuclear ribonucleoprotein C1.2e-6887.57Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAF QVGAA+NQHLLGQRPRLPVLPTPV+PGAAPGL+PGIRPPVLPRP PGAPGYLP PTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG

Query:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQP-SESNH
        AP+PGQVN+ +R PPPPAPIPGS PQPSS NGAP  AAP  YQANPAAPGSGGYDSFT+ AQP SESNH
Subjt:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQP-SESNH

A0A5A7UNW2 U1 small nuclear ribonucleoprotein C1.2e-6887.57Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAF QVGAA+NQHLLGQRPRLPVLPTPV+PGAAPGL+PGIRPPVLPRP PGAPGYLP PTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG

Query:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQP-SESNH
        AP+PGQVN+ +R PPPPAPIPGS PQPSS NGAP  AAP  YQANPAAPGSGGYDSFT+ AQP SESNH
Subjt:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQP-SESNH

A0A6J1GPE6 U1 small nuclear ribonucleoprotein C3.7e-8399.4Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG

Query:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH
        APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH
Subjt:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH

A0A6J1JRL3 U1 small nuclear ribonucleoprotein C7.7e-8197.62Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG

Query:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH
        APMPGQVNMLAR PPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSF T AQPSESNH
Subjt:  APMPGQVNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH

SwissProt top hitse value%identityAlignment
C5XYW4 U1 small nuclear ribonucleoprotein C-26.6e-2953Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLG-----QRPRLPVLPTPVMPGAAP--GLLPGIRPPVLPRP----TPGAPGYLPA
        +ANVR+YYQQFEEQQTQSLIDQRIKEHLGQAAAF Q GA +NQH+L       RPRLP+LPTP MP   P   L+PG+RPP+LP P     PGAP  +P 
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLG-----QRPRLPVLPTPVMPGAAP--GLLPGIRPPVLPRP----TPGAPGYLPA

Query:  PTMPP-MMAPPGAPMPGQVNMLARPP------PPPAPIPGSTPQPSS-ANGAP---SAAAPLMYQANPAAPG---SGGYDSFTTTAQ-------PSESNH
        P  PP  M  PGAP PG +     PP        P P P + P P+S   GAP   SAA P +YQANP AP    SG   +  T  Q       PSE NH
Subjt:  PTMPP-MMAPPGAPMPGQVNMLARPP------PPPAPIPGSTPQPSS-ANGAP---SAAAPLMYQANPAAPG---SGGYDSFTTTAQ-------PSESNH

C5XZK6 U1 small nuclear ribonucleoprotein C-12.5e-2852.43Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLG-----QRPRLPVLPTPVMPGA---APG--LLPGIRPPVLPRP-----------
        +ANVR+YYQQFEEQQTQSLIDQRIKEHLGQAAAF Q GA +NQH+L       RPRLP+LPTP MP     APG  L+PG+RPP+LP P           
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLG-----QRPRLPVLPTPVMPGA---APG--LLPGIRPPVLPRP-----------

Query:  --TPGA-PGYLPAPTMPP-MMAPPGAPMPGQVNMLARPPPPPAPIPGSTPQPSSANGAP---SAAAPLMYQANPAAPG---SGGYDSFTTT-------AQ
           PGA PG +P P  PP  M  PGAP PG + M   P P P  +P   P  S   GAP   SAA P +YQ NP AP    SG   +  T        AQ
Subjt:  --TPGA-PGYLPAPTMPP-MMAPPGAPMPGQVNMLARPPPPPAPIPGSTPQPSSANGAP---SAAAPLMYQANPAAPG---SGGYDSFTTT-------AQ

Query:  PSESNH
        PSE NH
Subjt:  PSESNH

F6HQ26 U1 small nuclear ribonucleoprotein C4.9e-4868.11Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLG-----QRPRLPVLPTPVMP--GAAP-----GLLPGIRPPVLPRPTPGAPGYLP
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQ AAF QVGAAYNQHL+       RPRLPVLPTP MP  G+AP      L+PG+RPPVLPRP PGAPGY+P
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLG-----QRPRLPVLPTPVMP--GAAP-----GLLPGIRPPVLPRPTPGAPGYLP

Query:  APTMPPMMAPPGAP-MP-GQVNMLARPPP---PPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH
        AP MP MMAPPGAP MP   +N L RPP    PPA +PGST  P+S  GAPS     MYQANPA P SGG+DSF   AQ  E+NH
Subjt:  APTMPPMMAPPGAP-MP-GQVNMLARPPP---PPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH

Q56XE4 U1 small nuclear ribonucleoprotein C1.4e-2651.5Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
        +ANVR YYQQFEEQQTQSLIDQRIKEHLGQ   + QVGA +NQH+L  RPR P++   + PG+ P    G+RPPVLPRP     GY+P P +P MMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG

Query:  APMPGQVNMLARPPPPPAPIPGS-------TPQPSSANGAP----SAAAPLMYQANPAAPGSGGYDS
        AP+P         PP  APIPG         P P    G P        P  Y  NPAAP SG +++
Subjt:  APMPGQVNMLARPPPPPAPIPGS-------TPQPSSANGAP----SAAAPLMYQANPAAPGSGGYDS

Arabidopsis top hitse value%identityAlignment
AT4G03120.1 C2H2 and C2HC zinc fingers superfamily protein9.8e-2851.5Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG
        +ANVR YYQQFEEQQTQSLIDQRIKEHLGQ   + QVGA +NQH+L  RPR P++   + PG+ P    G+RPPVLPRP     GY+P P +P MMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPG

Query:  APMPGQVNMLARPPPPPAPIPGS-------TPQPSSANGAP----SAAAPLMYQANPAAPGSGGYDS
        AP+P         PP  APIPG         P P    G P        P  Y  NPAAP SG +++
Subjt:  APMPGQVNMLARPPPPPAPIPGS-------TPQPSSANGAP----SAAAPLMYQANPAAPGSGGYDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCACTCCAGGCAAACGTGCGATCATACTATCAGCAATTTGAGGAGCAACAAACCCAAAGTTTAATTGACCAGAGGATCAAAGAACATCTTGGTCAAGCCGCAGC
ATTCCATCAGGTTGGTGCAGCCTATAATCAGCATTTACTTGGCCAACGACCTCGTCTTCCTGTATTACCTACTCCTGTAATGCCGGGAGCTGCCCCGGGGTTATTGCCTG
GAATTAGGCCTCCAGTTTTGCCAAGACCAACTCCTGGTGCTCCAGGATATCTACCTGCTCCTACAATGCCACCTATGATGGCTCCGCCCGGTGCTCCTATGCCTGGCCAA
GTGAATATGCTTGCCAGGCCGCCGCCTCCTCCAGCACCAATTCCAGGGAGCACACCTCAACCTTCTTCAGCCAATGGCGCACCTTCAGCTGCTGCACCATTGATGTATCA
AGCAAATCCAGCAGCACCAGGAAGTGGTGGTTATGATAGCTTCACCACCACCGCTCAACCTTCTGAGTCTAACCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGCACTCCAGGCAAACGTGCGATCATACTATCAGCAATTTGAGGAGCAACAAACCCAAAGTTTAATTGACCAGAGGATCAAAGAACATCTTGGTCAAGCCGCAGC
ATTCCATCAGGTTGGTGCAGCCTATAATCAGCATTTACTTGGCCAACGACCTCGTCTTCCTGTATTACCTACTCCTGTAATGCCGGGAGCTGCCCCGGGGTTATTGCCTG
GAATTAGGCCTCCAGTTTTGCCAAGACCAACTCCTGGTGCTCCAGGATATCTACCTGCTCCTACAATGCCACCTATGATGGCTCCGCCCGGTGCTCCTATGCCTGGCCAA
GTGAATATGCTTGCCAGGCCGCCGCCTCCTCCAGCACCAATTCCAGGGAGCACACCTCAACCTTCTTCAGCCAATGGCGCACCTTCAGCTGCTGCACCATTGATGTATCA
AGCAAATCCAGCAGCACCAGGAAGTGGTGGTTATGATAGCTTCACCACCACCGCTCAACCTTCTGAGTCTAACCATTAG
Protein sequenceShow/hide protein sequence
MNALQANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLLPGIRPPVLPRPTPGAPGYLPAPTMPPMMAPPGAPMPGQ
VNMLARPPPPPAPIPGSTPQPSSANGAPSAAAPLMYQANPAAPGSGGYDSFTTTAQPSESNH