; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G21130 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G21130
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr4:19558813..19561571
RNA-Seq ExpressionCSPI04G21130
SyntenyCSPI04G21130
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042337.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.1e-7860.85Show/hide
Query:  EANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQH------------
        +ANL++ ELVAMI+KVNVI   EGWWLD GAS  VCHDLSL RKYNE+ DK+I+LGDHHTTKV G+G  EVELKFTS KTLVL+E  H            
Subjt:  EANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQH------------

Query:  -----------------------------------------------------HIGYAENSKAYRFFDLENKVIIESNDVDFFEDRFSFKSRNSGSLNSQ
                                                              IGYAENSKAYRF+DLENKVIIESNDVDFFEDRF FKSRN       
Subjt:  -----------------------------------------------------HIGYAENSKAYRFFDLENKVIIESNDVDFFEDRFSFKSRNSGSLNSQ

Query:  SSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIKEFGEDFKMYNVEDPKDLTGALSSVDANLWQKAINDEMDYLESNRT
         SGGSSSSSLPS++IQTQDKEVDPE RRSKRARTIK+FG+DF+MYNVEDPKDLT ALS VDANLWQ+AINDEMD LESNRT
Subjt:  SSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIKEFGEDFKMYNVEDPKDLTGALSSVDANLWQKAINDEMDYLESNRT

KAA0048414.1 uncharacterized protein E6C27_scaffold264G001400 [Cucumis melo var. makuwa]2.4e-7070.44Show/hide
Query:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV---NTLRHKTKDFSLESLIT
        KV  AC+ EK K+ E  PTEEQ+K+L  W ETDFICKNLILNGL +EL++YYSTM+TTKEVWDALQKKYDT+E  SKKYV   NTLRHKTK+FSLESLIT
Subjt:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV---NTLRHKTKDFSLESLIT

Query:  CLRIEEEARKPDQKEEANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQE
         L IEE   K     +ANL+E ELVAMI++VN+IG  EGWWLD GAS  VC DLSL RKYNE+ DK+I+LGDHHTTKVVG+G  EVELKFTS KTLVL+E
Subjt:  CLRIEEEARKPDQKEEANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQE

Query:  EQH
          H
Subjt:  EQH

KAA0050409.1 hypothetical protein E6C27_scaffold1166G00260 [Cucumis melo var. makuwa]7.0e-8646.98Show/hide
Query:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV--------------------
        KV  AC+ EKPK+ E  PT+EQ+K+LT W ETDFICKNLILNGL DEL++YYSTM+T K+VW+ALQKKYDT+E GSKKY                     
Subjt:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV--------------------

Query:  -------------------------------------NTLRHKTKDFSLESLITCLRIEEEARKPDQKEE------------------------------
                                             NTLRHKTK+FSLE+L T LRIEEEA+K D+KEE                              
Subjt:  -------------------------------------NTLRHKTKDFSLESLITCLRIEEEARKPDQKEE------------------------------

Query:  ---------ANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQHHIGY
                 ANL+E ELVAMI++VNVIG  EGWWLD GAS  VCHDLS+ RKYNE+ DK+I+LGDHH TKVVG+   EVELKFTS KTLV+++  H    
Subjt:  ---------ANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQHHIGY

Query:  AENSKAYRFFDLENKVIIESNDVDFFE----DRFSFKSRNSGSLNSQSSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIKEFGEDFKMYNVEDPKDL
         +N        L ++ ++  N   F +    + F+    N       S+ G          IQ+QDKEVDP+ RRSKRART+K+FGEDF+ YNVED KDL
Subjt:  AENSKAYRFFDLENKVIIESNDVDFFE----DRFSFKSRNSGSLNSQSSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIKEFGEDFKMYNVEDPKDL

Query:  TGALSSVDANLWQKAINDEMDYLESNRTWH
        T ALSSVDANLWQ+AINDE+   +   TW+
Subjt:  TGALSSVDANLWQKAINDEMDYLESNRTWH

KAA0065374.1 uncharacterized protein E6C27_scaffold17G00360 [Cucumis melo var. makuwa]2.7e-7447.03Show/hide
Query:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV--------------------
        KV  AC+ EK K+ E  P EEQ+K+L  W ETDFICKNLILNGL DEL++YYSTM+T KEVWDALQKKYDTKE  SKKY                     
Subjt:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV--------------------

Query:  -------------------------------------NTLRHKTKDFSLESLITCLRIEEEARKPDQKEEANLVEKELVAMITKVNVIGE--FEGWWLDN
                                             NTLRHKTK+FSLESLIT L+IEEEARK D+KEE N + ++    + K ++  E   +   +  
Subjt:  -------------------------------------NTLRHKTKDFSLESLITCLRIEEEARKPDQKEEANLVEKELVAMITKVNVIGE--FEGWWLDN

Query:  GASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQHHIGYAENSKAYRFFDLENKVIIESNDVDFFEDRFSFKSRNS
           P V       +KY           DHHTTK+ G+G  EVELKFTS KTLVL+E  H     +N  +    +         +D+        F  +  
Subjt:  GASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQHHIGYAENSKAYRFFDLENKVIIESNDVDFFEDRFSFKSRNS

Query:  GS-----LNSQ-SSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIK----EFGEDFKMYNVEDPKDLTGALSSVDANLWQKAINDEMDYLESNRTWHL
         +     LN + +   SS+  L S  IQTQDKEVD E RRSKRART+K    +FGEDF+MYNVEDPKDLT ALSSVDANLWQ+AINDEMD LESNRTWHL
Subjt:  GS-----LNSQ-SSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIK----EFGEDFKMYNVEDPKDLTGALSSVDANLWQKAINDEMDYLESNRTWHL

Query:  VDLP
        VDLP
Subjt:  VDLP

TYJ98000.1 hypothetical protein E5676_scaffold487G00230 [Cucumis melo var. makuwa]9.1e-8646.74Show/hide
Query:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV--------------------
        KV  AC+ EKPK+ E  PT+EQ+K+LT W ETDFICKNLILNGL DEL++YYSTM+T K+VW+ALQKKYDT+E GSKKY                     
Subjt:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV--------------------

Query:  -------------------------------------NTLRHKTKDFSLESLITCLRIEEEARKPDQKEE------------------------------
                                             NTLRHKTK+FSLE+LIT L+IEEEA+K D+K+E                              
Subjt:  -------------------------------------NTLRHKTKDFSLESLITCLRIEEEARKPDQKEE------------------------------

Query:  ---------ANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQHHIGY
                 ANL+E ELVAMI++VNVIG  EGWWLD  AS  VCHDLS+ RKYNE+ DK+I+LGDHH TKVVG+   EVEL+FTS KTLVL++  H    
Subjt:  ---------ANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQHHIGY

Query:  AENSKAYRFFDLENKVIIESNDVDFFE----DRFSFKSRNSGSLNSQSSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIKEFGEDFKMYNVEDPKDL
         +N        L ++ ++  N   F +    + F+    N       S+ G          IQ+QDKEVDP+ RRSKRART+K+FGEDF+ YNVEDPKDL
Subjt:  AENSKAYRFFDLENKVIIESNDVDFFE----DRFSFKSRNSGSLNSQSSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIKEFGEDFKMYNVEDPKDL

Query:  TGALSSVDANLWQKAINDEMDYLESNRTWH
        T ALSSVDANLWQ+AINDE+   +   TW+
Subjt:  TGALSSVDANLWQKAINDEMDYLESNRTWH

TrEMBL top hitse value%identityAlignment
A0A5A7UA92 Uncharacterized protein3.4e-8646.98Show/hide
Query:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV--------------------
        KV  AC+ EKPK+ E  PT+EQ+K+LT W ETDFICKNLILNGL DEL++YYSTM+T K+VW+ALQKKYDT+E GSKKY                     
Subjt:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV--------------------

Query:  -------------------------------------NTLRHKTKDFSLESLITCLRIEEEARKPDQKEE------------------------------
                                             NTLRHKTK+FSLE+L T LRIEEEA+K D+KEE                              
Subjt:  -------------------------------------NTLRHKTKDFSLESLITCLRIEEEARKPDQKEE------------------------------

Query:  ---------ANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQHHIGY
                 ANL+E ELVAMI++VNVIG  EGWWLD GAS  VCHDLS+ RKYNE+ DK+I+LGDHH TKVVG+   EVELKFTS KTLV+++  H    
Subjt:  ---------ANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQHHIGY

Query:  AENSKAYRFFDLENKVIIESNDVDFFE----DRFSFKSRNSGSLNSQSSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIKEFGEDFKMYNVEDPKDL
         +N        L ++ ++  N   F +    + F+    N       S+ G          IQ+QDKEVDP+ RRSKRART+K+FGEDF+ YNVED KDL
Subjt:  AENSKAYRFFDLENKVIIESNDVDFFE----DRFSFKSRNSGSLNSQSSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIKEFGEDFKMYNVEDPKDL

Query:  TGALSSVDANLWQKAINDEMDYLESNRTWH
        T ALSSVDANLWQ+AINDE+   +   TW+
Subjt:  TGALSSVDANLWQKAINDEMDYLESNRTWH

A0A5D3BDS3 Uncharacterized protein4.4e-8646.74Show/hide
Query:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV--------------------
        KV  AC+ EKPK+ E  PT+EQ+K+LT W ETDFICKNLILNGL DEL++YYSTM+T K+VW+ALQKKYDT+E GSKKY                     
Subjt:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV--------------------

Query:  -------------------------------------NTLRHKTKDFSLESLITCLRIEEEARKPDQKEE------------------------------
                                             NTLRHKTK+FSLE+LIT L+IEEEA+K D+K+E                              
Subjt:  -------------------------------------NTLRHKTKDFSLESLITCLRIEEEARKPDQKEE------------------------------

Query:  ---------ANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQHHIGY
                 ANL+E ELVAMI++VNVIG  EGWWLD  AS  VCHDLS+ RKYNE+ DK+I+LGDHH TKVVG+   EVEL+FTS KTLVL++  H    
Subjt:  ---------ANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQHHIGY

Query:  AENSKAYRFFDLENKVIIESNDVDFFE----DRFSFKSRNSGSLNSQSSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIKEFGEDFKMYNVEDPKDL
         +N        L ++ ++  N   F +    + F+    N       S+ G          IQ+QDKEVDP+ RRSKRART+K+FGEDF+ YNVEDPKDL
Subjt:  AENSKAYRFFDLENKVIIESNDVDFFE----DRFSFKSRNSGSLNSQSSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIKEFGEDFKMYNVEDPKDL

Query:  TGALSSVDANLWQKAINDEMDYLESNRTWH
        T ALSSVDANLWQ+AINDE+   +   TW+
Subjt:  TGALSSVDANLWQKAINDEMDYLESNRTWH

A0A5D3BEH3 Uncharacterized protein1.2e-7070.44Show/hide
Query:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV---NTLRHKTKDFSLESLIT
        KV  AC+ EK K+ E  PTEEQ+K+L  W ETDFICKNLILNGL +EL++YYSTM+TTKEVWDALQKKYDT+E  SKKYV   NTLRHKTK+FSLESLIT
Subjt:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV---NTLRHKTKDFSLESLIT

Query:  CLRIEEEARKPDQKEEANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQE
         L IEE   K     +ANL+E ELVAMI++VN+IG  EGWWLD GAS  VC DLSL RKYNE+ DK+I+LGDHHTTKVVG+G  EVELKFTS KTLVL+E
Subjt:  CLRIEEEARKPDQKEEANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQE

Query:  EQH
          H
Subjt:  EQH

A0A5D3CVC3 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-7860.85Show/hide
Query:  EANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQH------------
        +ANL++ ELVAMI+KVNVI   EGWWLD GAS  VCHDLSL RKYNE+ DK+I+LGDHHTTKV G+G  EVELKFTS KTLVL+E  H            
Subjt:  EANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQH------------

Query:  -----------------------------------------------------HIGYAENSKAYRFFDLENKVIIESNDVDFFEDRFSFKSRNSGSLNSQ
                                                              IGYAENSKAYRF+DLENKVIIESNDVDFFEDRF FKSRN       
Subjt:  -----------------------------------------------------HIGYAENSKAYRFFDLENKVIIESNDVDFFEDRFSFKSRNSGSLNSQ

Query:  SSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIKEFGEDFKMYNVEDPKDLTGALSSVDANLWQKAINDEMDYLESNRT
         SGGSSSSSLPS++IQTQDKEVDPE RRSKRARTIK+FG+DF+MYNVEDPKDLT ALS VDANLWQ+AINDEMD LESNRT
Subjt:  SSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIKEFGEDFKMYNVEDPKDLTGALSSVDANLWQKAINDEMDYLESNRT

A0A5D3DC59 Reverse transcriptase Ty1/copia-type domain-containing protein1.3e-7447.03Show/hide
Query:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV--------------------
        KV  AC+ EK K+ E  P EEQ+K+L  W ETDFICKNLILNGL DEL++YYSTM+T KEVWDALQKKYDTKE  SKKY                     
Subjt:  KVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYV--------------------

Query:  -------------------------------------NTLRHKTKDFSLESLITCLRIEEEARKPDQKEEANLVEKELVAMITKVNVIGE--FEGWWLDN
                                             NTLRHKTK+FSLESLIT L+IEEEARK D+KEE N + ++    + K ++  E   +   +  
Subjt:  -------------------------------------NTLRHKTKDFSLESLITCLRIEEEARKPDQKEEANLVEKELVAMITKVNVIGE--FEGWWLDN

Query:  GASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQHHIGYAENSKAYRFFDLENKVIIESNDVDFFEDRFSFKSRNS
           P V       +KY           DHHTTK+ G+G  EVELKFTS KTLVL+E  H     +N  +    +         +D+        F  +  
Subjt:  GASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQEEQHHIGYAENSKAYRFFDLENKVIIESNDVDFFEDRFSFKSRNS

Query:  GS-----LNSQ-SSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIK----EFGEDFKMYNVEDPKDLTGALSSVDANLWQKAINDEMDYLESNRTWHL
         +     LN + +   SS+  L S  IQTQDKEVD E RRSKRART+K    +FGEDF+MYNVEDPKDLT ALSSVDANLWQ+AINDEMD LESNRTWHL
Subjt:  GS-----LNSQ-SSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIK----EFGEDFKMYNVEDPKDLTGALSSVDANLWQKAINDEMDYLESNRTWHL

Query:  VDLP
        VDLP
Subjt:  VDLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACGATCAAACTATTGGCACAAAGAGAGCAGAGGTGTGAAACGTCTTAAAGAGAGCGCGCGATCCAAGGTTGTTGCCGCATGTAGCAATGAGAAACCAAAAATCCT
AGAGACAAAGCCAACTGAGGAGCAAATCAAGGATCTTACCGCATGGATAGAAACTGATTTCATATGTAAGAATTTAATTCTTAATGGTCTTATTGATGAATTGTTTGAAT
ATTATAGTACCATGTCTACCACGAAAGAAGTATGGGACGCGTTACAAAAGAAATATGATACTAAGGAAGTGGGATCCAAGAAGTATGTGAACACTCTAAGGCACAAAACC
AAGGATTTCTCGCTAGAAAGTCTTATCACATGTTTAAGGATAGAGGAGGAGGCGAGAAAGCCTGATCAAAAGGAGGAGGCGAACTTGGTAGAAAAAGAATTAGTAGCTAT
GATCACAAAAGTTAATGTGATTGGGGAGTTTGAAGGTTGGTGGCTAGACAATGGTGCATCTCCCCCTGTGTGTCATGACCTTAGCTTGCTTAGAAAATATAATGAGATCA
ATGATAAAAGTATCATTCTGGGAGATCATCACACAACCAAAGTGGTCGGCGTTGGAGAAGTAGAAGTAGAATTGAAATTCACATCTAGCAAGACGCTTGTGCTGCAGGAA
GAGCAGCACCATATTGGATACGCTGAAAATAGTAAAGCCTATAGGTTCTTTGACTTAGAGAACAAAGTAATCATAGAATCAAATGACGTAGATTTTTTTGAGGATAGATT
TTCTTTTAAATCTAGAAATAGTGGGAGCTTAAATAGTCAATCTAGTGGAGGCTCAAGTTCAAGTAGTCTACCTTCAGTTAAGATCCAAACCCAAGATAAGGAAGTTGATC
CTGAATGTAGAAGAAGCAAGAGAGCTAGAACCATAAAAGAGTTTGGAGAAGACTTCAAAATGTATAATGTAGAAGATCCAAAAGATCTAACAGGAGCCTTATCCTCAGTA
GATGCCAACTTATGGCAAAAAGCTATTAATGATGAAATGGACTATCTTGAATCCAATAGAACTTGGCACCTAGTTGACTTACCCCCTTGGCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAACGATCAAACTATTGGCACAAAGAGAGCAGAGGTGTGAAACGTCTTAAAGAGAGCGCGCGATCCAAGGTTGTTGCCGCATGTAGCAATGAGAAACCAAAAATCCT
AGAGACAAAGCCAACTGAGGAGCAAATCAAGGATCTTACCGCATGGATAGAAACTGATTTCATATGTAAGAATTTAATTCTTAATGGTCTTATTGATGAATTGTTTGAAT
ATTATAGTACCATGTCTACCACGAAAGAAGTATGGGACGCGTTACAAAAGAAATATGATACTAAGGAAGTGGGATCCAAGAAGTATGTGAACACTCTAAGGCACAAAACC
AAGGATTTCTCGCTAGAAAGTCTTATCACATGTTTAAGGATAGAGGAGGAGGCGAGAAAGCCTGATCAAAAGGAGGAGGCGAACTTGGTAGAAAAAGAATTAGTAGCTAT
GATCACAAAAGTTAATGTGATTGGGGAGTTTGAAGGTTGGTGGCTAGACAATGGTGCATCTCCCCCTGTGTGTCATGACCTTAGCTTGCTTAGAAAATATAATGAGATCA
ATGATAAAAGTATCATTCTGGGAGATCATCACACAACCAAAGTGGTCGGCGTTGGAGAAGTAGAAGTAGAATTGAAATTCACATCTAGCAAGACGCTTGTGCTGCAGGAA
GAGCAGCACCATATTGGATACGCTGAAAATAGTAAAGCCTATAGGTTCTTTGACTTAGAGAACAAAGTAATCATAGAATCAAATGACGTAGATTTTTTTGAGGATAGATT
TTCTTTTAAATCTAGAAATAGTGGGAGCTTAAATAGTCAATCTAGTGGAGGCTCAAGTTCAAGTAGTCTACCTTCAGTTAAGATCCAAACCCAAGATAAGGAAGTTGATC
CTGAATGTAGAAGAAGCAAGAGAGCTAGAACCATAAAAGAGTTTGGAGAAGACTTCAAAATGTATAATGTAGAAGATCCAAAAGATCTAACAGGAGCCTTATCCTCAGTA
GATGCCAACTTATGGCAAAAAGCTATTAATGATGAAATGGACTATCTTGAATCCAATAGAACTTGGCACCTAGTTGACTTACCCCCTTGGCTGTAA
Protein sequenceShow/hide protein sequence
MKRSNYWHKESRGVKRLKESARSKVVAACSNEKPKILETKPTEEQIKDLTAWIETDFICKNLILNGLIDELFEYYSTMSTTKEVWDALQKKYDTKEVGSKKYVNTLRHKT
KDFSLESLITCLRIEEEARKPDQKEEANLVEKELVAMITKVNVIGEFEGWWLDNGASPPVCHDLSLLRKYNEINDKSIILGDHHTTKVVGVGEVEVELKFTSSKTLVLQE
EQHHIGYAENSKAYRFFDLENKVIIESNDVDFFEDRFSFKSRNSGSLNSQSSGGSSSSSLPSVKIQTQDKEVDPECRRSKRARTIKEFGEDFKMYNVEDPKDLTGALSSV
DANLWQKAINDEMDYLESNRTWHLVDLPPWL