; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy1G010300 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy1G010300
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionserine/arginine repetitive matrix protein 1-like
Genome locationGy14Chr1:6437272..6439023
RNA-Seq ExpressionCsGy1G010300
SyntenyCsGy1G010300
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011648413.2 probable serine/threonine-protein kinase DDB_G0280111 [Cucumis sativus]3.33e-25197.85Show/hide
Query:  MDESLTEGFKFRGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSSKPKPQAAARNSNPNSNSAPSSPAPPPTPIPTTRKTKSQP
        MDESLTEGFKFRGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSSKPKPQAAARNSNPNSNSAPSSPAPPPTPIPTTRKTKSQP
Subjt:  MDESLTEGFKFRGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSSKPKPQAAARNSNPNSNSAPSSPAPPPTPIPTTRKTKSQP

Query:  PTPSPALPTTPSPT----RKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSFEGKDLDD
        PTPSPALP TPSPT    RKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEKAVKTTAYPN PASSDPSNDIGHRLLQGLSFEGKDLDD
Subjt:  PTPSPALPTTPSPT----RKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSFEGKDLDD

Query:  ILKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQFFIGLNDY
        ILKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQFFIGLNDY
Subjt:  ILKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQFFIGLNDY

Query:  YSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEIDRSI
        YS ICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGI+AAQVIQSSYQYGSSNNGDNKNLGIDQEIDRSI
Subjt:  YSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEIDRSI

XP_011652531.1 protein ENL [Cucumis sativus]2.69e-15873.1Show/hide
Query:  RGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSSKPKPQAAARN-------SNPNSNSAPSSPAPPPTPIPTTRKTKSQPPTPS
        RGI  P        ++   SP   +KP D E K SA K+P+KDPNFPSSSKPKPQAAA N       +NPNS +  S+P PPPTPI +T K KSQPPTPS
Subjt:  RGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSSKPKPQAAARN-------SNPNSNSAPSSPAPPPTPIPTTRKTKSQPPTPS

Query:  ----PALPTTPSPTRKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSFEGKDLDDILKG
            P LP   SP R++K QP  PSS+SK N VRRI NDNS KA PK SPTSGSD  +K VKTTA P+ P  SD S+DIG RLLQ LS EGKDLDDILKG
Subjt:  ----PALPTTPSPTRKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSFEGKDLDDILKG

Query:  NSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQFFIGLNDYYSTI
        N+IDDLMGSNN+KEESS RNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIG YSS+  +G      IAFWSELTERDKV+QFFIGLNDYYS I
Subjt:  NSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQFFIGLNDYYSTI

Query:  CSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEIDRSI
        CSQILVNQPFPTVEEAYSEIIREEKRRELFVALG VAAQVIQSSYQ GSSNNGDNKNLGIDQEID SI
Subjt:  CSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEIDRSI

XP_038895148.1 proline-rich receptor-like protein kinase PERK2 isoform X1 [Benincasa hispida]1.45e-7648.41Show/hide
Query:  RGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSS-------KPKPQAAARNSNPN-----SNSAPSSPAPPPTPIPTTRKTKSQ
        RGI  P        ++   SP  TKK T    K + NKHP    N   S         P P  AA N +PN     S S PSSP PPPTP PTTR     
Subjt:  RGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSS-------KPKPQAAARNSNPN-----SNSAPSSPAPPPTPIPTTRKTKSQ

Query:  PPTPSPALPTTPSPTRKTKCQPTMPSSSSKNNVVRR--IYNDN--------SPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSF
                            QPT PSSS+  NVVR+  +  DN        SPK   K SP S     +K V T+  P+ PASS  ++D+  RLLQ LSF
Subjt:  PPTPSPALPTTPSPTRKTKCQPTMPSSSSKNNVVRR--IYNDN--------SPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSF

Query:  EGKDLDDILKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQF
        +GKD+ DIL+G SI D+MGSN KKEE+S +++  L +LQIY++IASHRQ NL VE YF+KL  LW+++  Y +D AQ  SS G I   SELTER KV+QF
Subjt:  EGKDLDDILKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQF

Query:  FIGLNDYYSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEIDRS
         +GLND Y+TIC QILV +PFPTVEEAYSEII EEKRREL  AL  VAA+VIQS++   + N+  N N GIDQE+D +
Subjt:  FIGLNDYYSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEIDRS

XP_038895149.1 proline-rich receptor-like protein kinase PERK2 isoform X2 [Benincasa hispida]5.44e-7347.88Show/hide
Query:  RGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSS-------KPKPQAAARNSNPN-----SNSAPSSPAPPPTPIPTTRKTKSQ
        RGI  P        ++   SP  TKK T    K + NKHP    N   S         P P  AA N +PN     S S PSSP PPPTP PTTR     
Subjt:  RGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSS-------KPKPQAAARNSNPN-----SNSAPSSPAPPPTPIPTTRKTKSQ

Query:  PPTPSPALPTTPSPTRKTKCQPTMPSSSSKNNVVRR--IYNDN--------SPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSF
                            QPT PSSS+  NVVR+  +  DN        SPK   K SP S     +K V T+  P+ PASS  ++D+  RLLQ LSF
Subjt:  PPTPSPALPTTPSPTRKTKCQPTMPSSSSKNNVVRR--IYNDN--------SPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSF

Query:  EGKDLDDILKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQF
        +  D+ DIL+G SI D+MGSN KKEE+S +++  L +LQIY++IASHRQ NL VE YF+KL  LW+++  Y +D AQ  SS G I   SELTER KV+QF
Subjt:  EGKDLDDILKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQF

Query:  FIGLNDYYSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEIDRS
         +GLND Y+TIC QILV +PFPTVEEAYSEII EEKRREL  AL  VAA+VIQS++   + N+  N N GIDQE+D +
Subjt:  FIGLNDYYSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEIDRS

XP_038895286.1 hybrid signal transduction histidine kinase L-like isoform X2 [Benincasa hispida]7.10e-4150.26Show/hide
Query:  LQGLSFEGKDLDD-ILKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTE
        LQ LS +GKDL   +L  NSI + MGS+ K+  SS  N S +   QIY++IA HRQ N S+  YF KL+ LW+++ T+ +D  Q  S  G     SE  E
Subjt:  LQGLSFEGKDLDD-ILKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTE

Query:  RDKVMQFFIGLNDYYSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSY------QYGSSNNGDNKNLGIDQEIDRS
        R+KVMQF +GLND YS IC+QIL++ PFPT+E+AYS +IREEK REL V L  VA +VIQ+++         SSNNGDN N G+ Q +D S
Subjt:  RDKVMQFFIGLNDYYSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSY------QYGSSNNGDNKNLGIDQEIDRS

TrEMBL top hitse value%identityAlignment
A0A0A0LRE6 Uncharacterized protein1.30e-15873.1Show/hide
Query:  RGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSSKPKPQAAARN-------SNPNSNSAPSSPAPPPTPIPTTRKTKSQPPTPS
        RGI  P        ++   SP   +KP D E K SA K+P+KDPNFPSSSKPKPQAAA N       +NPNS +  S+P PPPTPI +T K KSQPPTPS
Subjt:  RGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSSKPKPQAAARN-------SNPNSNSAPSSPAPPPTPIPTTRKTKSQPPTPS

Query:  ----PALPTTPSPTRKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSFEGKDLDDILKG
            P LP   SP R++K QP  PSS+SK N VRRI NDNS KA PK SPTSGSD  +K VKTTA P+ P  SD S+DIG RLLQ LS EGKDLDDILKG
Subjt:  ----PALPTTPSPTRKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSFEGKDLDDILKG

Query:  NSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQFFIGLNDYYSTI
        N+IDDLMGSNN+KEESS RNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIG YSS+  +G      IAFWSELTERDKV+QFFIGLNDYYS I
Subjt:  NSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQFFIGLNDYYSTI

Query:  CSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEIDRSI
        CSQILVNQPFPTVEEAYSEIIREEKRRELFVALG VAAQVIQSSYQ GSSNNGDNKNLGIDQEID SI
Subjt:  CSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEIDRSI

A0A0A0LU31 Uncharacterized protein1.61e-25197.85Show/hide
Query:  MDESLTEGFKFRGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSSKPKPQAAARNSNPNSNSAPSSPAPPPTPIPTTRKTKSQP
        MDESLTEGFKFRGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSSKPKPQAAARNSNPNSNSAPSSPAPPPTPIPTTRKTKSQP
Subjt:  MDESLTEGFKFRGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSSKPKPQAAARNSNPNSNSAPSSPAPPPTPIPTTRKTKSQP

Query:  PTPSPALPTTPSPT----RKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSFEGKDLDD
        PTPSPALP TPSPT    RKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEKAVKTTAYPN PASSDPSNDIGHRLLQGLSFEGKDLDD
Subjt:  PTPSPALPTTPSPT----RKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSFEGKDLDD

Query:  ILKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQFFIGLNDY
        ILKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQFFIGLNDY
Subjt:  ILKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQFFIGLNDY

Query:  YSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEIDRSI
        YS ICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGI+AAQVIQSSYQYGSSNNGDNKNLGIDQEIDRSI
Subjt:  YSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEIDRSI

A0A5D3BME5 Early nodulin-20-like4.80e-4160.21Show/hide
Query:  RGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSS-SKPKPQAAARNSNPNSNSA-----PSSPAP--PPTPIPTTRKTKSQPPTP
        RGI  P        ++   SP   +K TD E K SANKHP+K P  PSS  K KPQAAA N NPN N       PSSPAP  PPTPI +T K KSQP TP
Subjt:  RGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSS-SKPKPQAAARNSNPNSNSA-----PSSPAP--PPTPIPTTRKTKSQPPTP

Query:  S--PALPTTPSPTRKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSFEGK
        S  P LP T    RKTK QP  PSSSSK NVVRRI ++NS KASPK SPTS SD  +K VK TA PN PA SDPSNDIG RLLQ LSFEG+
Subjt:  S--PALPTTPSPTRKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSFEGK

A0A6J1C5Z8 uncharacterized protein LOC1110085887.58e-3648.65Show/hide
Query:  LQGLSFEGKDLDDI-LKGNSIDDLMGSNNKKEE---SSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSE
        LQ LS +GKDL  I L  NSI + +GS+  +E    ++PR      I QIY+ IASHRQ N SV  YF KLK LW+++ TYS D  Q  S  G +   S 
Subjt:  LQGLSFEGKDLDDI-LKGNSIDDLMGSNNKKEE---SSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSE

Query:  LTERDKVMQFFIGLNDYYSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEI
          ER+KVMQF +GLN+ YSTIC QIL+ QPFPT+E+AYS IIREEKR EL  +L +VAA+V+++ +   +  + +  + GI +E+
Subjt:  LTERDKVMQFFIGLNDYYSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEI

A0A6J1GTG4 serine/arginine repetitive matrix protein 1-like8.26e-3749.74Show/hide
Query:  SDPSNDIGHRLLQGLSFEGKDLDDI-LKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSN
        SDP  D+ H+L Q LS + KDL +I L  N + + + S  K+EE S +  +S  + QIY++IASH QGN S+  Y  KLK LW+++  Y         S 
Subjt:  SDPSNDIGHRLLQGLSFEGKDLDDI-LKGNSIDDLMGSNNKKEESSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSN

Query:  GTIAFWSELTERDKVMQFFIGLNDYYSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSY--QYGSSNNGDNKNL
        G+    SE  ER+KVMQF IGLND YSTIC+QIL  +PFPTVE+A   I+REEKRREL ++L IVAA+VIQ+++  Q G S NGDN+ +
Subjt:  GTIAFWSELTERDKVMQFFIGLNDYYSTICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGIVAAQVIQSSY--QYGSSNNGDNKNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).5.9e-0628.71Show/hide
Query:  LAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYS--SDFAQGYSSNGTIAFWSELTERDKVMQFFIG--LNDYYSTICSQILVNQPFPTVEEAYSE
        L I Q+ +++A+ RQG  SVE YF KL K+W ++  Y+   +   G  +        E  E+++  +F +G  LN  +  + ++I+  +P P++ EA++ 
Subjt:  LAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYS--SDFAQGYSSNGTIAFWSELTERDKVMQFFIG--LNDYYSTICSQILVNQPFPTVEEAYSE

Query:  I
        +
Subjt:  I


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAAAGTTTAACGGAAGGGTTCAAATTTAGAGGAATCTTTGATCCCCATTTTAAATCTCCAGTTAATCGCCGCAAGATTAATTCTTCTCCAAACGGAACCAAGAA
GCCTACTGATGCAGAGGTTAAAAATTCTGCAAACAAACATCCAGAAAAAGATCCCAATTTTCCCAGCTCTTCAAAGCCAAAGCCACAAGCTGCAGCAAGGAATTCTAATC
CCAACAGCAACTCCGCGCCATCTTCTCCAGCTCCACCTCCGACTCCTATTCCGACTACTCGAAAGACAAAAAGTCAACCCCCCACTCCTTCTCCAGCTCTACCAACGACT
CCGAGTCCGACTCGAAAAACAAAATGTCAACCCACCATGCCTTCTTCTTCTTCAAAGAACAACGTTGTAAGACGTATTTATAATGATAATTCTCCAAAAGCTTCTCCCAA
GATTTCTCCCACCTCTGGGTCAGATTGTCATGAGAAAGCTGTCAAAACTACTGCTTATCCTAATCCACCTGCATCTTCTGATCCTTCTAATGATATTGGTCATCGGTTGT
TACAAGGCCTTTCTTTTGAAGGTAAAGACCTTGACGACATCCTCAAAGGAAACTCAATAGATGATTTAATGGGCTCAAATAATAAAAAGGAAGAATCTTCACCTCGAAAC
GTTTCTAGTTTGGCTATATTACAAATTTACCAGAAAATTGCATCTCATCGACAAGGAAACTTATCCGTTGAACGTTACTTCAAAAAGCTCAAGAAATTATGGAATGATAT
TGGAACCTATAGCAGTGATTTTGCTCAAGGTTATTCTAGCAATGGTACAATTGCATTTTGGAGCGAGCTTACAGAAAGAGACAAAGTTATGCAATTTTTTATTGGACTAA
ATGATTATTATTCCACAATTTGCTCCCAAATCCTAGTTAACCAGCCATTCCCAACAGTGGAGGAAGCTTATTCTGAAATAATTCGAGAAGAAAAACGTAGAGAATTGTTT
GTTGCATTAGGAATTGTGGCGGCACAAGTGATTCAAAGTAGTTACCAGTATGGTTCATCCAACAATGGTGATAATAAAAATCTTGGAATTGATCAAGAAATTGACAGAAG
TATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGAAAGTTTAACGGAAGGGTTCAAATTTAGAGGAATCTTTGATCCCCATTTTAAATCTCCAGTTAATCGCCGCAAGATTAATTCTTCTCCAAACGGAACCAAGAA
GCCTACTGATGCAGAGGTTAAAAATTCTGCAAACAAACATCCAGAAAAAGATCCCAATTTTCCCAGCTCTTCAAAGCCAAAGCCACAAGCTGCAGCAAGGAATTCTAATC
CCAACAGCAACTCCGCGCCATCTTCTCCAGCTCCACCTCCGACTCCTATTCCGACTACTCGAAAGACAAAAAGTCAACCCCCCACTCCTTCTCCAGCTCTACCAACGACT
CCGAGTCCGACTCGAAAAACAAAATGTCAACCCACCATGCCTTCTTCTTCTTCAAAGAACAACGTTGTAAGACGTATTTATAATGATAATTCTCCAAAAGCTTCTCCCAA
GATTTCTCCCACCTCTGGGTCAGATTGTCATGAGAAAGCTGTCAAAACTACTGCTTATCCTAATCCACCTGCATCTTCTGATCCTTCTAATGATATTGGTCATCGGTTGT
TACAAGGCCTTTCTTTTGAAGGTAAAGACCTTGACGACATCCTCAAAGGAAACTCAATAGATGATTTAATGGGCTCAAATAATAAAAAGGAAGAATCTTCACCTCGAAAC
GTTTCTAGTTTGGCTATATTACAAATTTACCAGAAAATTGCATCTCATCGACAAGGAAACTTATCCGTTGAACGTTACTTCAAAAAGCTCAAGAAATTATGGAATGATAT
TGGAACCTATAGCAGTGATTTTGCTCAAGGTTATTCTAGCAATGGTACAATTGCATTTTGGAGCGAGCTTACAGAAAGAGACAAAGTTATGCAATTTTTTATTGGACTAA
ATGATTATTATTCCACAATTTGCTCCCAAATCCTAGTTAACCAGCCATTCCCAACAGTGGAGGAAGCTTATTCTGAAATAATTCGAGAAGAAAAACGTAGAGAATTGTTT
GTTGCATTAGGAATTGTGGCGGCACAAGTGATTCAAAGTAGTTACCAGTATGGTTCATCCAACAATGGTGATAATAAAAATCTTGGAATTGATCAAGAAATTGACAGAAG
TATTTAACGTTGGATATCCTAATTAAGCAAAGTTGAAGATTTTCACCTGAAAGAAAATTCAAGGATTAATTTGAAGGACGTACATAGCTAAGACAAACTTGTCCAGACTG
TGTTGAT
Protein sequenceShow/hide protein sequence
MDESLTEGFKFRGIFDPHFKSPVNRRKINSSPNGTKKPTDAEVKNSANKHPEKDPNFPSSSKPKPQAAARNSNPNSNSAPSSPAPPPTPIPTTRKTKSQPPTPSPALPTT
PSPTRKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEKAVKTTAYPNPPASSDPSNDIGHRLLQGLSFEGKDLDDILKGNSIDDLMGSNNKKEESSPRN
VSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIAFWSELTERDKVMQFFIGLNDYYSTICSQILVNQPFPTVEEAYSEIIREEKRRELF
VALGIVAAQVIQSSYQYGSSNNGDNKNLGIDQEIDRSI