; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G07670 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G07670
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDNA-directed RNA polymerase II subunit 1-like
Genome locationChr3:6685352..6686842
RNA-Seq ExpressionCSPI03G07670
SyntenyCSPI03G07670
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004133797.1 uncharacterized protein LOC101205942 [Cucumis sativus]6.2e-15799.32Show/hide
Query:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDKYLERRNGE
        MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVD SPAAKATRSPPDSIGDKYLERRNGE
Subjt:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDKYLERRNGE

Query:  TTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVGAVMEVNKS
        TTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVGAVMEVNKS
Subjt:  TTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVGAVMEVNKS

Query:  SAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYKPKLRQ
        SAGYRLGGETLKKKETEDDGDVDGYGHEDKKTG KKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYKPKLRQ
Subjt:  SAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYKPKLRQ

XP_008437851.1 PREDICTED: zyxin-like [Cucumis melo]2.4e-12481.79Show/hide
Query:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFAS--------RVDRSPAAKATRSPPDSIGDK
        MANLPR GR RQRL +VPPPVPAAAQPAVEP+Y+ +P+AT+ITTPAPASPRRESPRPLSSP KKATSPFAS        R+DRSPAAKAT SPPDS+ DK
Subjt:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFAS--------RVDRSPAAKATRSPPDSIGDK

Query:  YLERRNGETTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQS-EHGSGKSVHQKQQKPGVMKLKGHNV
        Y ERRNGETTPPL+PAKSR AKT PLSPLALPR+QV TGNGTTAQPRVQPEVETKGIVYNK   EKPSKSNR S E+GS KS HQK+QKP V+KLKGHNV
Subjt:  YLERRNGETTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQS-EHGSGKSVHQKQQKPGVMKLKGHNV

Query:  GAVMEVNKSSAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKK-PPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYK
        GAVME+NKSS GYRLGGETLKK ETED GDV GYGHEDKKT TKKK PPI+AFMNSNFQSVNNSLLFDSSC HRDPGLHL+FP+A DG GA+VDG KSYK
Subjt:  GAVMEVNKSSAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKK-PPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYK

Query:  PK
        PK
Subjt:  PK

XP_022147458.1 vegetative cell wall protein gp1-like [Momordica charantia]4.6e-5953.49Show/hide
Query:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETM--------PYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDK
        MANLPR GRT QR + V PP PAA  P+V P  +T+        P A+  ++P  ASP R  P   +SP  +A    A RV  SP AK+ RSPP S   K
Subjt:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETM--------PYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDK

Query:  YLERRNGETTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVG
        Y +R + +TTPPL+PAKSR   T PLSPLALPR    T NG      V PEVE K ++YNK   EKP+KS R SEHGSGK    +     V+ L GHNVG
Subjt:  YLERRNGETTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVG

Query:  AVMEVNKSSAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAAD-GGGAVVDG-DKSYK
        AVME+N+SSA +   GE +KKKE+E +     +G++++KTG KK+ P +AFMNSNFQSVNNS+L+DSSC HRDPGLHL+F + AD GGGA+VDG +K+YK
Subjt:  AVMEVNKSSAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAAD-GGGAVVDG-DKSYK

Query:  P
        P
Subjt:  P

XP_022974816.1 wiskott-Aldrich syndrome protein family member 2-like [Cucurbita maxima]1.0e-5550.68Show/hide
Query:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDKYLERRNGE
        M+N PR GR RQR S   PPV     PA E K ET+P+  +  T  PA    +   P++SP      P  + V   P AK   SPP S   KY +R + +
Subjt:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDKYLERRNGE

Query:  TTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVGAVMEVNKS
        T+P  +P++S      P  PLALP +     +G T QPR+Q EVE K IVYNK   EKP KS+R  E+GSGKS H+KQ+    + L GHNVGAVME++K 
Subjt:  TTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVGAVMEVNKS

Query:  SAGYRLGGETLKKKETE-DDGDVDGYGH---EDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYK
        SA +RLGGET++K +TE  DG  DG      ++KK   KKK P++AFMNSNFQSVNNS+L+DSSC HRDPGLHL F +AADG GA VDG K+YK
Subjt:  SAGYRLGGETLKKKETE-DDGDVDGYGH---EDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYK

XP_038879417.1 proline-rich receptor-like protein kinase PERK8 [Benincasa hispida]1.3e-9065.58Show/hide
Query:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYA-TSI-------TTPAPASPRRESPRPLSSPSKKATSPFAS--------RVDRSPAAKATRS
        MANLPR+GRTRQR S+V PP+ AA QP  EPK E  P+A TSI       TTPAPASP R+SPR ++SP KKATSPFAS        RVD  PAAKATRS
Subjt:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYA-TSI-------TTPAPASPRRESPRPLSSPSKKATSPFAS--------RVDRSPAAKATRS

Query:  PPDSIGDKYLERRNGETTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPG-V
        PPDS  +KYLE RNGETTPPL+PAKSR  KT PLSPL LPR+ VI+ + TTA PR QP VETKGIVYNKA  EKP+K++R SE+GSGK  HQKQQ    V
Subjt:  PPDSIGDKYLERRNGETTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPG-V

Query:  MKLKGHNVGAVMEVNKSSAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVV
        + L GHNVGAVME+NKSS GYRLGGET+K KET+  G    +GH++K  G K  PP++AFMN+NFQS+NNS+L+DSSC H DPGLHLS P + DG GA V
Subjt:  MKLKGHNVGAVMEVNKSSAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVV

Query:  DGDKSYKP
         G KSYKP
Subjt:  DGDKSYKP

TrEMBL top hitse value%identityAlignment
A0A0A0L320 Uncharacterized protein3.0e-15799.32Show/hide
Query:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDKYLERRNGE
        MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVD SPAAKATRSPPDSIGDKYLERRNGE
Subjt:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDKYLERRNGE

Query:  TTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVGAVMEVNKS
        TTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVGAVMEVNKS
Subjt:  TTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVGAVMEVNKS

Query:  SAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYKPKLRQ
        SAGYRLGGETLKKKETEDDGDVDGYGHEDKKTG KKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYKPKLRQ
Subjt:  SAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYKPKLRQ

A0A1S3AV38 zyxin-like1.1e-12481.79Show/hide
Query:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFAS--------RVDRSPAAKATRSPPDSIGDK
        MANLPR GR RQRL +VPPPVPAAAQPAVEP+Y+ +P+AT+ITTPAPASPRRESPRPLSSP KKATSPFAS        R+DRSPAAKAT SPPDS+ DK
Subjt:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFAS--------RVDRSPAAKATRSPPDSIGDK

Query:  YLERRNGETTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQS-EHGSGKSVHQKQQKPGVMKLKGHNV
        Y ERRNGETTPPL+PAKSR AKT PLSPLALPR+QV TGNGTTAQPRVQPEVETKGIVYNK   EKPSKSNR S E+GS KS HQK+QKP V+KLKGHNV
Subjt:  YLERRNGETTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQS-EHGSGKSVHQKQQKPGVMKLKGHNV

Query:  GAVMEVNKSSAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKK-PPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYK
        GAVME+NKSS GYRLGGETLKK ETED GDV GYGHEDKKT TKKK PPI+AFMNSNFQSVNNSLLFDSSC HRDPGLHL+FP+A DG GA+VDG KSYK
Subjt:  GAVMEVNKSSAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKK-PPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYK

Query:  PK
        PK
Subjt:  PK

A0A5A7TZ24 Zyxin-like1.1e-12481.79Show/hide
Query:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFAS--------RVDRSPAAKATRSPPDSIGDK
        MANLPR GR RQRL +VPPPVPAAAQPAVEP+Y+ +P+AT+ITTPAPASPRRESPRPLSSP KKATSPFAS        R+DRSPAAKAT SPPDS+ DK
Subjt:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFAS--------RVDRSPAAKATRSPPDSIGDK

Query:  YLERRNGETTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQS-EHGSGKSVHQKQQKPGVMKLKGHNV
        Y ERRNGETTPPL+PAKSR AKT PLSPLALPR+QV TGNGTTAQPRVQPEVETKGIVYNK   EKPSKSNR S E+GS KS HQK+QKP V+KLKGHNV
Subjt:  YLERRNGETTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQS-EHGSGKSVHQKQQKPGVMKLKGHNV

Query:  GAVMEVNKSSAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKK-PPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYK
        GAVME+NKSS GYRLGGETLKK ETED GDV GYGHEDKKT TKKK PPI+AFMNSNFQSVNNSLLFDSSC HRDPGLHL+FP+A DG GA+VDG KSYK
Subjt:  GAVMEVNKSSAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKK-PPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYK

Query:  PK
        PK
Subjt:  PK

A0A6J1D1D2 vegetative cell wall protein gp1-like2.2e-5953.49Show/hide
Query:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETM--------PYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDK
        MANLPR GRT QR + V PP PAA  P+V P  +T+        P A+  ++P  ASP R  P   +SP  +A    A RV  SP AK+ RSPP S   K
Subjt:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETM--------PYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDK

Query:  YLERRNGETTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVG
        Y +R + +TTPPL+PAKSR   T PLSPLALPR    T NG      V PEVE K ++YNK   EKP+KS R SEHGSGK    +     V+ L GHNVG
Subjt:  YLERRNGETTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVG

Query:  AVMEVNKSSAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAAD-GGGAVVDG-DKSYK
        AVME+N+SSA +   GE +KKKE+E +     +G++++KTG KK+ P +AFMNSNFQSVNNS+L+DSSC HRDPGLHL+F + AD GGGA+VDG +K+YK
Subjt:  AVMEVNKSSAGYRLGGETLKKKETEDDGDVDGYGHEDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAAD-GGGAVVDG-DKSYK

Query:  P
        P
Subjt:  P

A0A6J1ICG8 wiskott-Aldrich syndrome protein family member 2-like5.1e-5650.68Show/hide
Query:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDKYLERRNGE
        M+N PR GR RQR S   PPV     PA E K ET+P+  +  T  PA    +   P++SP      P  + V   P AK   SPP S   KY +R + +
Subjt:  MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDKYLERRNGE

Query:  TTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVGAVMEVNKS
        T+P  +P++S      P  PLALP +     +G T QPR+Q EVE K IVYNK   EKP KS+R  E+GSGKS H+KQ+    + L GHNVGAVME++K 
Subjt:  TTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVGAVMEVNKS

Query:  SAGYRLGGETLKKKETE-DDGDVDGYGH---EDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYK
        SA +RLGGET++K +TE  DG  DG      ++KK   KKK P++AFMNSNFQSVNNS+L+DSSC HRDPGLHL F +AADG GA VDG K+YK
Subjt:  SAGYRLGGETLKKKETE-DDGDVDGYGH---EDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46630.1 unknown protein6.0e-0924.84Show/hide
Query:  RQRLSSVPPPVPAAAQPAVEPKYETMPYAT--SITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDKYLERRNGETTPPLTPA
        RQ+   + PP   A  P   P  E  PY +  S     P  P+  +P P    S   + P    V  +   +   SPP             E+    +P+
Subjt:  RQRLSSVPPPVPAAAQPAVEPKYETMPYAT--SITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDKYLERRNGETTPPLTPA

Query:  KSRPAKTSP----LSPLALPRS-----------QVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPG-----------
        +S   + +P    LSP +LP S            ++T   T+         +     YN+  +   + S  Q+++  G +  +  ++P            
Subjt:  KSRPAKTSP----LSPLALPRS-----------QVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPG-----------

Query:  VMKLKGHNVGAVMEVNKSSAGYRLGGETL-----------KKKETEDDGDVDGYGHEDKKTGTKKKP-------PISAFMNSNFQSVNNSLLFDSSCQHR
        V+ + G N GAVME+ +S  G + GG              K +  +          E KK  TK  P       P+ AFMNSN Q +NNS++++S+  H 
Subjt:  VMKLKGHNVGAVMEVNKSSAGYRLGGETL-----------KKKETEDDGDVDGYGHEDKKTGTKKKP-------PISAFMNSNFQSVNNSLLFDSSCQHR

Query:  DPGLHLSFPN--AADGGGAVVD
        DPG+HL       +D G  V D
Subjt:  DPGLHLSFPN--AADGGGAVVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATCTTCCTCGCTTGGGCCGTACACGCCAACGTCTTTCTTCGGTACCGCCGCCAGTTCCCGCCGCAGCACAGCCTGCTGTGGAACCCAAGTATGAAACTATGCC
GTATGCTACGTCCATCACTACGCCGGCCCCGGCTTCTCCCCGGAGAGAATCCCCTCGCCCGCTTTCTTCTCCGTCGAAGAAAGCCACGTCACCGTTTGCATCACGTGTGG
ACAGATCGCCGGCGGCGAAGGCAACACGGTCGCCTCCGGATTCGATTGGAGATAAATATCTTGAGAGGAGGAATGGGGAAACAACCCCACCTCTGACGCCGGCCAAGTCC
CGGCCAGCAAAGACGTCGCCGCTATCTCCTCTTGCTCTGCCACGTAGCCAAGTGATAACCGGGAACGGGACGACGGCTCAACCCAGGGTTCAACCGGAGGTGGAGACCAA
AGGGATTGTATACAACAAGGCCGCTGATGAGAAGCCATCAAAGTCAAACCGGCAGTCGGAGCACGGCTCCGGCAAGTCAGTACACCAAAAGCAGCAGAAGCCGGGGGTTA
TGAAACTCAAAGGACATAACGTTGGCGCCGTCATGGAAGTAAACAAGTCATCCGCTGGCTACCGTTTGGGCGGAGAAACCTTAAAAAAGAAAGAAACAGAAGACGACGGG
GATGTCGATGGATATGGACATGAAGATAAGAAAACGGGGACAAAGAAGAAACCGCCGATAAGCGCATTTATGAACAGTAATTTTCAGAGTGTAAACAATTCTCTTCTGTT
CGACTCGTCGTGCCAACACCGTGATCCCGGCCTGCATCTCTCGTTTCCCAATGCGGCGGATGGCGGTGGAGCCGTTGTTGACGGCGATAAGAGTTACAAGCCCAAGCTGC
GCCAGTGA
mRNA sequenceShow/hide mRNA sequence
CATTAAAGAAATTGTGAGAATTTTATTTGTTTTTAGAGAATCATATGATCTTCTCTCATAATTTTCAATTTCCACATTATAAATAAATTGGCAACACATAAGCCTTTTCA
ATATATCGCTTAATTCTATTTTCAATTTCCATTTTCTTTGCTATGGCAAATCTTCCTCGCTTGGGCCGTACACGCCAACGTCTTTCTTCGGTACCGCCGCCAGTTCCCGC
CGCAGCACAGCCTGCTGTGGAACCCAAGTATGAAACTATGCCGTATGCTACGTCCATCACTACGCCGGCCCCGGCTTCTCCCCGGAGAGAATCCCCTCGCCCGCTTTCTT
CTCCGTCGAAGAAAGCCACGTCACCGTTTGCATCACGTGTGGACAGATCGCCGGCGGCGAAGGCAACACGGTCGCCTCCGGATTCGATTGGAGATAAATATCTTGAGAGG
AGGAATGGGGAAACAACCCCACCTCTGACGCCGGCCAAGTCCCGGCCAGCAAAGACGTCGCCGCTATCTCCTCTTGCTCTGCCACGTAGCCAAGTGATAACCGGGAACGG
GACGACGGCTCAACCCAGGGTTCAACCGGAGGTGGAGACCAAAGGGATTGTATACAACAAGGCCGCTGATGAGAAGCCATCAAAGTCAAACCGGCAGTCGGAGCACGGCT
CCGGCAAGTCAGTACACCAAAAGCAGCAGAAGCCGGGGGTTATGAAACTCAAAGGACATAACGTTGGCGCCGTCATGGAAGTAAACAAGTCATCCGCTGGCTACCGTTTG
GGCGGAGAAACCTTAAAAAAGAAAGAAACAGAAGACGACGGGGATGTCGATGGATATGGACATGAAGATAAGAAAACGGGGACAAAGAAGAAACCGCCGATAAGCGCATT
TATGAACAGTAATTTTCAGAGTGTAAACAATTCTCTTCTGTTCGACTCGTCGTGCCAACACCGTGATCCCGGCCTGCATCTCTCGTTTCCCAATGCGGCGGATGGCGGTG
GAGCCGTTGTTGACGGCGATAAGAGTTACAAGCCCAAGCTGCGCCAGTGAGGATATGGACGGAGAAGTACTACTATTTTATTATATGAAATAAGCCATTTATATGGGTAT
TTTATGTAACCATTAATAAACTATATATTATAAGAAATTTCTCTTAAAAAGTATAGGCTAGTTTATAGAAAATAAAAAAAAGGCTTTTTTTTAAGAAATAAAACAAAATA
AAATATTATGGTAAAAAACTCGAG
Protein sequenceShow/hide protein sequence
MANLPRLGRTRQRLSSVPPPVPAAAQPAVEPKYETMPYATSITTPAPASPRRESPRPLSSPSKKATSPFASRVDRSPAAKATRSPPDSIGDKYLERRNGETTPPLTPAKS
RPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADEKPSKSNRQSEHGSGKSVHQKQQKPGVMKLKGHNVGAVMEVNKSSAGYRLGGETLKKKETEDDG
DVDGYGHEDKKTGTKKKPPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVVDGDKSYKPKLRQ