; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004102 (gene) of Snake gourd v1 genome

Gene IDTan0004102
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionU1 small nuclear ribonucleoprotein C
Genome locationLG06:7597251..7601323
RNA-Seq ExpressionTan0004102
SyntenyTan0004102
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0000395 - mRNA 5'-splice site recognition (biological process)
GO:0000243 - commitment complex (cellular component)
GO:0005685 - U1 snRNP (cellular component)
GO:0071004 - U2-type prespliceosome (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0030619 - U1 snRNA binding (molecular function)
GO:0030627 - pre-mRNA 5'-splice site binding (molecular function)
InterPro domainsIPR000690 - Matrin/U1-C, C2H2-type zinc finger
IPR003604 - Matrin/U1-C-like, C2H2-type zinc finger
IPR013085 - U1-C, C2H2-type zinc finger
IPR017340 - U1 small nuclear ribonucleoprotein C
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593155.1 U1 small nuclear ribonucleoprotein C, partial [Cucurbita argyrosperma subsp. sororia]3.4e-9593.33Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGG PGL+PG
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG

Query:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQPSESN
        IRPPV PRPVPGAPGYLP PTMPPMMAPPGA MPGQVN+P+R PPPAPIPGS PQPSSTNGAPL AP  YQANPAAPGSGGYDSFTTMAQPSESN
Subjt:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQPSESN

XP_004148886.1 U1 small nuclear ribonucleoprotein C [Cucumis sativus]1.2e-9592.39Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQAAAFQQVGAA+NQHLLGQRPRLPVLPTPVMPG APGL+PG
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG

Query:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQP-SESNH
        IRPPVLPRP+PGAPGYLP PTMPPMMAPPGAP+PGQVN+P+R PPPAP+PGSAPQPSSTNGAPLAAP  YQANPAAPGSGGYDSFT+MAQP SESNH
Subjt:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQP-SESNH

XP_008451425.1 PREDICTED: U1 small nuclear ribonucleoprotein C [Cucumis melo]2.0e-9592.39Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQAAAFQQVGAA+NQHLLGQRPRLPVLPTPV+PG APGL+PG
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG

Query:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQP-SESNH
        IRPPVLPRP+PGAPGYLP PTMPPMMAPPGAP+PGQVN+P+R PPPAPIPGSAPQPSSTNGAPLAAP  YQANPAAPGSGGYDSFT+MAQP SESNH
Subjt:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQP-SESNH

XP_023004664.1 U1 small nuclear ribonucleoprotein C [Cucurbita maxima]1.2e-9593.85Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGL+PG
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG

Query:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQPSESN
        IRPPV PRPVPGAPGYLP PTMPPMMAPPGA MPGQVN+P+R PPPAPIPGS  QPSSTNGAPL APP YQANPAAPGSGGYDSFTTMAQPSESN
Subjt:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQPSESN

XP_038899936.1 U1 small nuclear ribonucleoprotein C isoform X1 [Benincasa hispida]1.5e-9894.9Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPG APGL+PG
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG

Query:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQPSESNH
        IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVN+PAR PPPAP+PGSAPQPSSTNGAPLA P MYQANPAAPGSGGY+SFTTMAQPSE NH
Subjt:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQPSESNH

TrEMBL top hitse value%identityAlignment
A0A0A0K515 U1 small nuclear ribonucleoprotein C5.7e-9692.39Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQAAAFQQVGAA+NQHLLGQRPRLPVLPTPVMPG APGL+PG
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG

Query:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQP-SESNH
        IRPPVLPRP+PGAPGYLP PTMPPMMAPPGAP+PGQVN+P+R PPPAP+PGSAPQPSSTNGAPLAAP  YQANPAAPGSGGYDSFT+MAQP SESNH
Subjt:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQP-SESNH

A0A1S3BS92 U1 small nuclear ribonucleoprotein C9.7e-9692.39Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQAAAFQQVGAA+NQHLLGQRPRLPVLPTPV+PG APGL+PG
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG

Query:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQP-SESNH
        IRPPVLPRP+PGAPGYLP PTMPPMMAPPGAP+PGQVN+P+R PPPAPIPGSAPQPSSTNGAPLAAP  YQANPAAPGSGGYDSFT+MAQP SESNH
Subjt:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQP-SESNH

A0A5A7UNW2 U1 small nuclear ribonucleoprotein C9.7e-9692.39Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQAAAFQQVGAA+NQHLLGQRPRLPVLPTPV+PG APGL+PG
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG

Query:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQP-SESNH
        IRPPVLPRP+PGAPGYLP PTMPPMMAPPGAP+PGQVN+P+R PPPAPIPGSAPQPSSTNGAPLAAP  YQANPAAPGSGGYDSFT+MAQP SESNH
Subjt:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQP-SESNH

A0A6J1H7P8 U1 small nuclear ribonucleoprotein C1.8e-9492.82Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGL+PG
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG

Query:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQPSESN
        IRPPV PRP+PGAPGYLP PTMPPMMAPPGA MPGQVN+P+R PPPAPIPGS PQ SSTNGAPL AP  YQANPAAPGSGGYDSFTTMAQPSESN
Subjt:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQPSESN

A0A6J1KWY5 U1 small nuclear ribonucleoprotein C5.7e-9693.85Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGL+PG
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG

Query:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQPSESN
        IRPPV PRPVPGAPGYLP PTMPPMMAPPGA MPGQVN+P+R PPPAPIPGS  QPSSTNGAPL APP YQANPAAPGSGGYDSFTTMAQPSESN
Subjt:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQPSESN

SwissProt top hitse value%identityAlignment
A8XW44 U1 small nuclear ribonucleoprotein C6.4e-2045.58Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG
        MP+YYCDYCDT+LTHDSPSVRK HN G KHK NVR +YQ++ E Q Q L+DQ       +A A  ++  A  +  +G  P  PV   P+M GG PG+   
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG

Query:  IRPPVLPRPVPGAP-GYLPAPTMPPMMAPPGAPMPGQVNLPARLPPP
          P + PRP PG P G+  AP + P   PP   + G   +P  +P P
Subjt:  IRPPVLPRPVPGAP-GYLPAPTMPPMMAPPGAPMPGQVNLPARLPPP

C5XYW4 U1 small nuclear ribonucleoprotein C-22.0e-5059.23Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMPGGAP
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVR+YYQQFEEQQTQ+LIDQRIKEHLGQAAAF Q GA +NQH+L       RPRLP+LPTP MP G P
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMPGGAP

Query:  --GLLPGIRPPVLPRP-VPGAPGYLPAPTMPPMMAPPGA-PMPGQVNLPARLPPPAPIPGSAPQ-----------PSSTNGAP------LAAPP-MYQAN
           L+PG+RPP+LP P VPG PG    PTMP   APPG+ P PG    P  +P P   PGS P            P  T+G P       AAPP +YQAN
Subjt:  --GLLPGIRPPVLPRP-VPGAPGYLPAPTMPPMMAPPGA-PMPGQVNLPARLPPPAPIPGSAPQ-----------PSSTNGAP------LAAPP-MYQAN

Query:  PAAPG---SGGYDSFTTMAQ-------PSESNH
        P AP    SG   +  T  Q       PSE NH
Subjt:  PAAPG---SGGYDSFTTMAQ-------PSESNH

C5XZK6 U1 small nuclear ribonucleoprotein C-13.5e-5058.23Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMPGG--
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVR+YYQQFEEQQTQ+LIDQRIKEHLGQAAAF Q GA +NQH+L       RPRLP+LPTP MP G  
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMPGG--

Query:  -APG--LLPGIRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAP---MPGQVNLPARLPPPAPIPGSAPQ-----------PSSTNGAP------LAAPP-M
         APG  L+PG+RPP+L  P PG PGY   P  PP M  PGAP   MP     P  +P P   PGS P            P  T+G P       AAPP +
Subjt:  -APG--LLPGIRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAP---MPGQVNLPARLPPPAPIPGSAPQ-----------PSSTNGAP------LAAPP-M

Query:  YQANPAAPG---SGGYDSFTT-------MAQPSESNH
        YQ NP AP    SG   +  T        AQPSE NH
Subjt:  YQANPAAPG---SGGYDSFTT-------MAQPSESNH

F6HQ26 U1 small nuclear ribonucleoprotein C6.3e-6870.23Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMP--GG
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQ AAFQQVGAAYNQHL+       RPRLPVLPTP MP  G 
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMP--GG

Query:  AP-----GLLPGIRPPVLPRPVPGAPGYLPAPTMPPMMAPPGA---PMPGQVNLPARLPP----PAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGG
        AP      L+PG+RPPVLPRPVPGAPGY+PAP MP MMAPPGA   PMP   +LP   PP    P  +PGS   P+S     +   PMYQANPA P SGG
Subjt:  AP-----GLLPGIRPPVLPRPVPGAPGYLPAPTMPPMMAPPGA---PMPGQVNLPARLPP----PAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGG

Query:  YDSFTTMAQPSESNH
        +DSF   AQ  E+NH
Subjt:  YDSFTTMAQPSESNH

Q56XE4 U1 small nuclear ribonucleoprotein C1.3e-4961.42Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVR YYQQFEEQQTQ+LIDQRIKEHLGQ   +QQVGA +NQH+L  RPR P++   + PG  P    G
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG

Query:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPM-PGQVNLPARLPPPAPIPGS-------APQPSSTNGAP-----LAAPPMYQANPAAPGSGGYDS
        +RPPVLPRP+    GY+P P +P MMAPPGAP+ P   N   R P  APIPG        AP P    G P     L  PP Y  NPAAP SG +++
Subjt:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPM-PGQVNLPARLPPPAPIPGS-------APQPSSTNGAP-----LAAPPMYQANPAAPGSGGYDS

Arabidopsis top hitse value%identityAlignment
AT4G03120.1 C2H2 and C2HC zinc fingers superfamily protein9.4e-5161.42Show/hide
Query:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG
        MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVR YYQQFEEQQTQ+LIDQRIKEHLGQ   +QQVGA +NQH+L  RPR P++   + PG  P    G
Subjt:  MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPG

Query:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPM-PGQVNLPARLPPPAPIPGS-------APQPSSTNGAP-----LAAPPMYQANPAAPGSGGYDS
        +RPPVLPRP+    GY+P P +P MMAPPGAP+ P   N   R P  APIPG        AP P    G P     L  PP Y  NPAAP SG +++
Subjt:  IRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPM-PGQVNLPARLPPPAPIPGS-------APQPSSTNGAP-----LAAPPMYQANPAAPGSGGYDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGGTATTATTGTGACTATTGTGACACATATCTGACCCATGATTCTCCATCTGTGAGGAAGCAGCATAATGCAGGCTACAAACATAAGGCAAACGTGCGATCATA
CTATCAGCAATTTGAGGAGCAACAAACCCAAAATTTAATTGACCAGAGGATCAAAGAACATCTGGGTCAAGCAGCAGCATTCCAGCAGGTTGGTGCAGCCTACAATCAGC
ATTTACTCGGCCAAAGACCTCGTCTTCCTGTACTACCTACTCCTGTAATGCCGGGAGGTGCCCCGGGATTACTGCCCGGAATTAGGCCTCCAGTTTTGCCAAGACCAGTT
CCTGGTGCTCCAGGATATCTACCTGCTCCTACAATGCCACCCATGATGGCCCCGCCGGGAGCTCCTATGCCCGGCCAAGTGAACCTTCCTGCAAGGCTGCCACCTCCAGC
GCCAATTCCAGGGAGCGCGCCGCAGCCATCATCGACCAATGGTGCACCGTTGGCTGCGCCACCAATGTATCAAGCAAATCCAGCAGCACCAGGAAGTGGAGGGTATGATA
GTTTCACCACCATGGCTCAACCTTCCGAGTCTAACCATTAG
mRNA sequenceShow/hide mRNA sequence
CCTTCTTTGCAAAAAAAGACTAGAAGTCTCTCTCTTCTTCGTTCTCGCAGAGTCGCTGATAGGGTTTTTGGTGGGTCTGCCATTGTTGCTTCTTCTTTGAGCTACCGCTA
TGCCTCGGTATTATTGTGACTATTGTGACACATATCTGACCCATGATTCTCCATCTGTGAGGAAGCAGCATAATGCAGGCTACAAACATAAGGCAAACGTGCGATCATAC
TATCAGCAATTTGAGGAGCAACAAACCCAAAATTTAATTGACCAGAGGATCAAAGAACATCTGGGTCAAGCAGCAGCATTCCAGCAGGTTGGTGCAGCCTACAATCAGCA
TTTACTCGGCCAAAGACCTCGTCTTCCTGTACTACCTACTCCTGTAATGCCGGGAGGTGCCCCGGGATTACTGCCCGGAATTAGGCCTCCAGTTTTGCCAAGACCAGTTC
CTGGTGCTCCAGGATATCTACCTGCTCCTACAATGCCACCCATGATGGCCCCGCCGGGAGCTCCTATGCCCGGCCAAGTGAACCTTCCTGCAAGGCTGCCACCTCCAGCG
CCAATTCCAGGGAGCGCGCCGCAGCCATCATCGACCAATGGTGCACCGTTGGCTGCGCCACCAATGTATCAAGCAAATCCAGCAGCACCAGGAAGTGGAGGGTATGATAG
TTTCACCACCATGGCTCAACCTTCCGAGTCTAACCATTAGAGCTTCTAATTCTTGTGCTGTCTGTGTATGATATCCAAAGCTCTTCTAATGCAAACCGAGACACAACAAT
TCTTAAAATATCTGAAGAAAAATTTGGAGCTTGGAGATTTAATGTAGAAAAGTTTTTTTTAGTGAGTGAGCAAAGGGAATGATTTTTGGTGGTTGATTCCTTGCTACTTG
TTTGGAGATAGAATATATAATGTGATTATATGGTAAAATGGGCAACTCTGAATTCGTTCAATGAGGCTTTTACTTTTACTTTTCTATTTGTAATTTCTAGTGCTCGTGAA
TGGAATGTATTTGATCATGTCA
Protein sequenceShow/hide protein sequence
MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPV
PGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQPSESNH