; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013762 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013762
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionU1 small nuclear ribonucleoprotein C
Genome locationscaffold263:774079..775466
RNA-Seq ExpressionMS013762
SyntenyMS013762
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0000395 - mRNA 5'-splice site recognition (biological process)
GO:0000243 - commitment complex (cellular component)
GO:0005685 - U1 snRNP (cellular component)
GO:0071004 - U2-type prespliceosome (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0030619 - U1 snRNA binding (molecular function)
GO:0030627 - pre-mRNA 5'-splice site binding (molecular function)
InterPro domainsIPR017340 - U1 small nuclear ribonucleoprotein C


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014402.1 U1 small nuclear ribonucleoprotein C, partial [Cucurbita argyrosperma subsp. argyrosperma]4.3e-7087.86Show/hide
Query:  LDALQANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMM
        ++ALQANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAF QVGAAYNQHLLGQRPRLPVLPTPVMPGAAP L+PGIRPPVLPRP PGAPGYLP PTMPPMM
Subjt:  LDALQANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMM

Query:  APPGAPMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGA-SMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH
        APPGAPMPG VNM  RPPPP P P+PGSTPQP+S NGA S  AP MYQ NPAAPGSGGYDSFTT AQPSESNH
Subjt:  APPGAPMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGA-SMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH

XP_004148886.1 U1 small nuclear ribonucleoprotein C [Cucumis sativus]1.6e-6988.17Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAA+NQHLLGQRPRLPVLPTPVMPGAAP LMPGIRPPVLPRP+PGAPGYLPTPTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPG

Query:  APMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQP-SESNH
        AP+PG VN+P+RPPPPA  P+PGS PQP+STNGA + APS YQ NPAAPGSGGYDSFT+MAQP SESNH
Subjt:  APMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQP-SESNH

XP_022150286.1 U1 small nuclear ribonucleoprotein C [Momordica charantia]4.9e-8298.8Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPLMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPGA
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPLMPGIRPPVLPRPMPGA GYLPTPTMPPMMAPPGA
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPLMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPGA

Query:  PMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH
        PMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH
Subjt:  PMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH

XP_038899936.1 U1 small nuclear ribonucleoprotein C isoform X1 [Benincasa hispida]5.1e-7189.29Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP LMPGIRPPVLPRP+PGAPGYLP PTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPG

Query:  APMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH
        APMPG VN+P RPPPPA  PVPGS PQP+STNGA +  PSMYQ NPAAPGSGGY+SFTTMAQPSE NH
Subjt:  APMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH

XP_038899937.1 U1 small nuclear ribonucleoprotein C isoform X2 [Benincasa hispida]5.1e-7189.82Show/hide
Query:  ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPGA
        ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP LMPGIRPPVLPRP+PGAPGYLP PTMPPMMAPPGA
Subjt:  ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPGA

Query:  PMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH
        PMPG VN+P RPPPPA  PVPGS PQP+STNGA +  PSMYQ NPAAPGSGGY+SFTTMAQPSE NH
Subjt:  PMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH

TrEMBL top hitse value%identityAlignment
A0A0A0K515 U1 small nuclear ribonucleoprotein C8.0e-7088.17Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAA+NQHLLGQRPRLPVLPTPVMPGAAP LMPGIRPPVLPRP+PGAPGYLPTPTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPG

Query:  APMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQP-SESNH
        AP+PG VN+P+RPPPPA  P+PGS PQP+STNGA + APS YQ NPAAPGSGGYDSFT+MAQP SESNH
Subjt:  APMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQP-SESNH

A0A1S3BS92 U1 small nuclear ribonucleoprotein C1.4e-6987.57Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAA+NQHLLGQRPRLPVLPTPV+PGAAP LMPGIRPPVLPRP+PGAPGYLPTPTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPG

Query:  APMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQP-SESNH
        AP+PG VN+P+RPPPPA  P+PGS PQP+STNGA + APS YQ NPAAPGSGGYDSFT+MAQP SESNH
Subjt:  APMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQP-SESNH

A0A5A7UNW2 U1 small nuclear ribonucleoprotein C1.4e-6987.57Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAA+NQHLLGQRPRLPVLPTPV+PGAAP LMPGIRPPVLPRP+PGAPGYLPTPTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPG

Query:  APMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQP-SESNH
        AP+PG VN+P+RPPPPA  P+PGS PQP+STNGA + APS YQ NPAAPGSGGYDSFT+MAQP SESNH
Subjt:  APMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQP-SESNH

A0A6J1DB31 U1 small nuclear ribonucleoprotein C2.4e-8298.8Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPLMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPGA
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPLMPGIRPPVLPRPMPGA GYLPTPTMPPMMAPPGA
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPLMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPGA

Query:  PMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH
        PMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH
Subjt:  PMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH

A0A6J1GPE6 U1 small nuclear ribonucleoprotein C1.1e-6888.17Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPG
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAF QVGAAYNQHLLGQRPRLPVLPTPVMPGAAP L+PGIRPPVLPRP PGAPGYLP PTMPPMMAPPG
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAP-LMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPG

Query:  APMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGA-SMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH
        APMPG VNM  RPPPP P P+PGSTPQP+S NGA S  AP MYQ NPAAPGSGGYDSFTT AQPSESNH
Subjt:  APMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGA-SMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH

SwissProt top hitse value%identityAlignment
C5XYW4 U1 small nuclear ribonucleoprotein C-21.7e-2953.5Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMPGA---APLMPGIRPPVLPRP----MPGAPGYLPT
        +ANVR+YYQQFEEQQTQSLIDQRIKEHLGQAAAF Q GA +NQH+L       RPRLP+LPTP MP     APLMPG+RPP+LP P     PGAP  +P 
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMPGA---APLMPGIRPPVLPRP----MPGAPGYLPT

Query:  PTMPP-MMAPPGAPMPGHVNMPTRPP-----PPAPVPVPGSTPQPTS-TNGA----SMGAPSMYQGNPAAPG---SGGYDSFTTMAQ-------PSESNH
        P  PP  M  PGAP PG +  P  PP       AP+P P + P PTS   GA    S   P++YQ NP AP    SG   +  T  Q       PSE NH
Subjt:  PTMPP-MMAPPGAPMPGHVNMPTRPP-----PPAPVPVPGSTPQPTS-TNGA----SMGAPSMYQGNPAAPG---SGGYDSFTTMAQ-------PSESNH

C5XZK6 U1 small nuclear ribonucleoprotein C-11.9e-2853.69Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMP------GAAPLMPGIRPPVLPRP-MPGAPGYLPT
        +ANVR+YYQQFEEQQTQSLIDQRIKEHLGQAAAF Q GA +NQH+L       RPRLP+LPTP MP        APLMPG+RPP+LP P +PG PG  PT
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMP------GAAPLMPGIRPPVLPRP-MPGAPGYLPT

Query:  ---PTMPP-MMAPPGAPMPGHVNMPTRPP-----PPAPVPVPGSTPQPTS-TNGA----SMGAPSMYQGNPAAPG---SGGYDSFTT-------MAQPSE
           P  PP  M  PGAP PG +  P  PP       AP+P P + P PTS   GA    S   P++YQ NP AP    SG   +  T        AQPSE
Subjt:  ---PTMPP-MMAPPGAPMPGHVNMPTRPP-----PPAPVPVPGSTPQPTS-TNGA----SMGAPSMYQGNPAAPG---SGGYDSFTT-------MAQPSE

Query:  SNH
         NH
Subjt:  SNH

F6HQ26 U1 small nuclear ribonucleoprotein C2.4e-4765.03Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMPGA--------APLMPGIRPPVLPRPMPGAPGYLP
        +ANVRSYYQQFEEQQTQSLIDQRIKEHLGQ AAFQQVGAAYNQHL+       RPRLPVLPTP MP A        +PL+PG+RPPVLPRP+PGAPGY+P
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMPGA--------APLMPGIRPPVLPRPMPGAPGYLP

Query:  TPTMPPMMAPPGA---PMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH
         P MP MMAPPGA   PMP   ++P  P    P  VPGST  PTS    SM    MYQ NPA P SGG+DSF   AQ  E+NH
Subjt:  TPTMPPMMAPPGA---PMPGHVNMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH

Q56XE4 U1 small nuclear ribonucleoprotein C7.3e-2853.61Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPLMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPGA
        +ANVR YYQQFEEQQTQSLIDQRIKEHLGQ   +QQVGA +NQH+L  RPR P++   + PG+ P+  G+RPPVLPRPM    GY+P P +P MMAPPGA
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPLMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPGA

Query:  PM-PGHVNMPTRPPPPAPVP-----VPGSTPQPTSTNGA-----SMGAPSMYQGNPAAPGSGGYDS
        P+ P   N   RPP  AP+P      PG  P P    G       +  P  Y  NPAAP SG +++
Subjt:  PM-PGHVNMPTRPPPPAPVP-----VPGSTPQPTSTNGA-----SMGAPSMYQGNPAAPGSGGYDS

Arabidopsis top hitse value%identityAlignment
AT4G03120.1 C2H2 and C2HC zinc fingers superfamily protein5.2e-2953.61Show/hide
Query:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPLMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPGA
        +ANVR YYQQFEEQQTQSLIDQRIKEHLGQ   +QQVGA +NQH+L  RPR P++   + PG+ P+  G+RPPVLPRPM    GY+P P +P MMAPPGA
Subjt:  QANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPLMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPGA

Query:  PM-PGHVNMPTRPPPPAPVP-----VPGSTPQPTSTNGA-----SMGAPSMYQGNPAAPGSGGYDS
        P+ P   N   RPP  AP+P      PG  P P    G       +  P  Y  NPAAP SG +++
Subjt:  PM-PGHVNMPTRPPPPAPVP-----VPGSTPQPTSTNGA-----SMGAPSMYQGNPAAPGSGGYDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTGGATGCACTCCAGGCAAACGTGCGATCATACTACCAGCAATTTGAGGAGCAACAAACCCAAAGTTTAATTGACCAGAGGATCAAGGAACATCTCGGTCAAGCAGCAGC
ATTCCAGCAGGTTGGTGCAGCCTATAATCAACATTTACTTGGTCAAAGACCTCGTCTTCCTGTACTACCTACTCCTGTAATGCCAGGAGCTGCACCGTTAATGCCGGGCA
TTAGGCCTCCAGTTTTGCCAAGACCAATGCCTGGTGCTCCAGGATATCTACCTACTCCTACAATGCCACCTATGATGGCCCCACCGGGAGCTCCTATGCCTGGCCACGTG
AACATGCCTACCAGGCCGCCTCCTCCAGCACCAGTGCCAGTTCCAGGTAGCACACCGCAACCAACTTCGACTAATGGTGCCTCGATGGGCGCACCCTCAATGTATCAAGG
AAATCCAGCAGCACCAGGAAGCGGAGGGTATGATAGTTTTACCACAATGGCTCAACCTTCTGAGTCTAATCAT
mRNA sequenceShow/hide mRNA sequence
CTGGATGCACTCCAGGCAAACGTGCGATCATACTACCAGCAATTTGAGGAGCAACAAACCCAAAGTTTAATTGACCAGAGGATCAAGGAACATCTCGGTCAAGCAGCAGC
ATTCCAGCAGGTTGGTGCAGCCTATAATCAACATTTACTTGGTCAAAGACCTCGTCTTCCTGTACTACCTACTCCTGTAATGCCAGGAGCTGCACCGTTAATGCCGGGCA
TTAGGCCTCCAGTTTTGCCAAGACCAATGCCTGGTGCTCCAGGATATCTACCTACTCCTACAATGCCACCTATGATGGCCCCACCGGGAGCTCCTATGCCTGGCCACGTG
AACATGCCTACCAGGCCGCCTCCTCCAGCACCAGTGCCAGTTCCAGGTAGCACACCGCAACCAACTTCGACTAATGGTGCCTCGATGGGCGCACCCTCAATGTATCAAGG
AAATCCAGCAGCACCAGGAAGCGGAGGGTATGATAGTTTTACCACAATGGCTCAACCTTCTGAGTCTAATCAT
Protein sequenceShow/hide protein sequence
LDALQANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPLMPGIRPPVLPRPMPGAPGYLPTPTMPPMMAPPGAPMPGHV
NMPTRPPPPAPVPVPGSTPQPTSTNGASMGAPSMYQGNPAAPGSGGYDSFTTMAQPSESNH