; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G14710 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G14710
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationChr5:15295117..15297419
RNA-Seq ExpressionCSPI05G14710
SyntenyCSPI05G14710
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062382.1 myosin-2 heavy chain [Cucumis melo var. makuwa]2.8e-6695.14Show/hide
Query:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS
        +VNEELG+IFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAI+RSFSDINELIKDMLD+KGKYTTVETELREMHDRYS+LS
Subjt:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS

Query:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS
        LQFAEVEGERQKLMMTVKNVRASKKLLNANNR SWS RGEHSPS
Subjt:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS

TYK26596.1 myosin-2 heavy chain [Cucumis melo var. makuwa]9.8e-6794.56Show/hide
Query:  VDARVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYS
        V  RVNEELG+IFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAI+RSFSDINELIKDMLD+KGKYTTVETELREMHDRYS
Subjt:  VDARVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYS

Query:  QLSLQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS
        +LSLQFAEVEGERQKLMMTVKNVRASKKLLNANNR SWS RGEHSPS
Subjt:  QLSLQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS

XP_004140370.1 myosin heavy chain, skeletal muscle isoform X1 [Cucumis sativus]4.7e-6998.61Show/hide
Query:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS
        +VNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLD+KGKYTTVETELREMHDRYSQLS
Subjt:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS

Query:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS
        LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS
Subjt:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS

XP_008460500.1 PREDICTED: myosin-2 heavy chain [Cucumis melo]2.8e-6695.14Show/hide
Query:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS
        +VNEELG+IFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAI+RSFSDINELIKDMLD+KGKYTTVETELREMHDRYS+LS
Subjt:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS

Query:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS
        LQFAEVEGERQKLMMTVKNVRASKKLLNANNR SWS RGEHSPS
Subjt:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS

XP_011655223.1 myosin heavy chain, skeletal muscle isoform X2 [Cucumis sativus]4.7e-6998.61Show/hide
Query:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS
        +VNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLD+KGKYTTVETELREMHDRYSQLS
Subjt:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS

Query:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS
        LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS
Subjt:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS

TrEMBL top hitse value%identityAlignment
A0A0A0KQA1 Uncharacterized protein2.3e-6998.61Show/hide
Query:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS
        +VNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLD+KGKYTTVETELREMHDRYSQLS
Subjt:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS

Query:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS
        LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS
Subjt:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS

A0A1S3CD41 myosin-2 heavy chain1.4e-6695.14Show/hide
Query:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS
        +VNEELG+IFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAI+RSFSDINELIKDMLD+KGKYTTVETELREMHDRYS+LS
Subjt:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS

Query:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS
        LQFAEVEGERQKLMMTVKNVRASKKLLNANNR SWS RGEHSPS
Subjt:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS

A0A5A7V2E5 Myosin-2 heavy chain1.4e-6695.14Show/hide
Query:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS
        +VNEELG+IFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAI+RSFSDINELIKDMLD+KGKYTTVETELREMHDRYS+LS
Subjt:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS

Query:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS
        LQFAEVEGERQKLMMTVKNVRASKKLLNANNR SWS RGEHSPS
Subjt:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS

A0A5D3DSB6 Myosin-2 heavy chain4.7e-6794.56Show/hide
Query:  VDARVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYS
        V  RVNEELG+IFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAI+RSFSDINELIKDMLD+KGKYTTVETELREMHDRYS
Subjt:  VDARVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYS

Query:  QLSLQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS
        +LSLQFAEVEGERQKLMMTVKNVRASKKLLNANNR SWS RGEHSPS
Subjt:  QLSLQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS

A0A6J1FEV0 centrosomal protein of 290 kDa-like isoform X12.6e-6593.06Show/hide
Query:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS
        +VNEELGSIFPLFKEFSS GN+LERVLALEIELAEAL++KKKPS HFQSSFLKQHSDEEAI+RSFSDINELIKDMLD+KGKYTTVETELREMHDRYSQLS
Subjt:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS

Query:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS
        LQFAEVEGERQKLMMTVKNVRAS+KLLNANNR SWSSRGEHSPS
Subjt:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.1e-1522.42Show/hide
Query:  NKVCKLKRSLDRLKQSPKAWSECFEKAVTNYEFSQNRADHTMF----------------------------------------YKHTKNDKAWSFVKCKI
        + VCKL +++  LKQ+ + W E FE+A+   EF  +  D  ++                                        ++ T  ++   F+  +I
Subjt:  NKVCKLKRSLDRLKQSPKAWSECFEKAVTNYEFSQNRADHTMF----------------------------------------YKHTKNDKAWSFVKCKI

Query:  -----GILVNQRRYILDLLEETGLLGCQVAETPID---------------------------------PDIAFAVSMVSQFMHALGLAHFDAVYRILRHL
              I ++Q  Y+  +L +  +  C    TP+                                  PD+  AV+++S++        +  + R+LR+L
Subjt:  -----GILVNQRRYILDLLEETGLLGCQVAETPID---------------------------------PDIAFAVSMVSQFMHALGLAHFDAVYRILRHL

Query:  K----------------------------GSTTTRRATSGY----YSFVGGILLHGEVKNSVVASSVEAEFRALAHGICEGIWIRRLLEKLRFTQTRLMC
        K                            GS   R++T+GY    + F   I  + + +NSV ASS EAE+ AL   + E +W++ LL  +       + 
Subjt:  K----------------------------GSTTTRRATSGY----YSFVGGILLHGEVKNSVVASSVEAEFRALAHGICEGIWIRRLLEKLRFTQTRLMC

Query:  SYGDNKAAISIAHNPVLHDRTKHIEVDKYFIKEKVDARV
         Y DN+  ISIA+NP  H R KHI++  +F +E+V   V
Subjt:  SYGDNKAAISIAHNPVLHDRTKHIEVDKYFIKEKVDARV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.0e-1533.74Show/hide
Query:  PDIAFAVSMVSQFMHALGLAHFDAVYRILRHLKGST-------------------------TTRRATSGY-YSFVGG-ILLHGEVKNSVVASSVEAEFRA
        PDIA AV +VS+F+   G  H++AV  ILR+L+G+T                           R++++GY ++F GG I    +++  V  S+ EAE+ A
Subjt:  PDIAFAVSMVSQFMHALGLAHFDAVYRILRHLKGST-------------------------TTRRATSGY-YSFVGG-ILLHGEVKNSVVASSVEAEFRA

Query:  LAHGICEGIWIRRLLEKLRFTQTRLMCSYGDNKAAISIAHNPVLHDRTKHIEVDKYFIKEKVD
              E IW++R L++L   Q   +  Y D+++AI ++ N + H RTKHI+V  ++I+E VD
Subjt:  LAHGICEGIWIRRLLEKLRFTQTRLMCSYGDNKAAISIAHNPVLHDRTKHIEVDKYFIKEKVD

P92519 Uncharacterized mitochondrial protein AtMg008107.4e-0927.17Show/hide
Query:  GILVNQRRYILDLLEETGLLGCQVAETPID--------------------------------PDIAFAVSMVSQFMHALGLAHFDAVYRILRHLKGS---
        G+ ++Q +Y   +L   G+L C+   TP+                                 PDI++AV++V Q MH   LA FD + R+LR++KG+   
Subjt:  GILVNQRRYILDLLEETGLLGCQVAETPID--------------------------------PDIAFAVSMVSQFMHALGLAHFDAVYRILRHLKGS---

Query:  -----------------------TTTRRATSGYYSFVGGILLHGEVKN--SVVASSVEAEFRALAHGICEGIW
                               T+TRR+T+G+ +F+G  ++    K   +V  SS E E+RALA    E  W
Subjt:  -----------------------TTTRRATSGYYSFVGGILLHGEVKN--SVVASSVEAEFRALAHGICEGIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-1926.59Show/hide
Query:  NKVCKLKRSLDRLKQSPKAWSECFEKAVTNYEFSQNRADHTMFYKHTKNDKAWSFV-------------------------------------------K
        N VCKL+++L  LKQ+P+AW       +    F  + +D ++F         +  V                                           +
Subjt:  NKVCKLKRSLDRLKQSPKAWSECFEKAVTNYEFSQNRADHTMFYKHTKNDKAWSFV-------------------------------------------K

Query:  CKIGILVNQRRYILDLLEETGLLGCQVAETPI---------------------------------DPDIAFAVSMVSQFMHALGLAHFDAVYRILRHL--
           G+ ++QRRYILDLL  T ++  +   TP+                                  PDI++AV+ +SQFMH     H  A+ RILR+L  
Subjt:  CKIGILVNQRRYILDLLEETGLLGCQVAETPI---------------------------------DPDIAFAVSMVSQFMHALGLAHFDAVYRILRHL--

Query:  ---------KGSTTTRRA---------------TSGYYSFVG--GILLHGEVKNSVVASSVEAEFRALAHGICEGIWIRRLLEKLRFTQTRLMCSYGDNK
                 KG+T +  A               T+GY  ++G   I    + +  VV SS EAE+R++A+   E  WI  LL +L    TR    Y DN 
Subjt:  ---------KGSTTTRRA---------------TSGYYSFVG--GILLHGEVKNSVVASSVEAEFRALAHGICEGIWIRRLLEKLRFTQTRLMCSYGDNK

Query:  AAISIAHNPVLHDRTKHIEVDKYFIKEKVDA
         A  +  NPV H R KHI +D +FI+ +V +
Subjt:  AAISIAHNPVLHDRTKHIEVDKYFIKEKVDA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-1625.23Show/hide
Query:  VCKLKRSLDRLKQSPKAWSECFEKAVTNYEFSQNRADHTMF---------------------------YKHTKNDKAWSF----------------VKCK
        VC+L++++  LKQ+P+AW       +    F  + +D ++F                            KHT +  +  F                 +  
Subjt:  VCKLKRSLDRLKQSPKAWSECFEKAVTNYEFSQNRADHTMF---------------------------YKHTKNDKAWSF----------------VKCK

Query:  IGILVNQRRYILDLLEETGLLGCQVAETPI---------------------------------DPDIAFAVSMVSQFMHALGLAHFDAVYRILRHL----
         G+ ++QRRY LDLL  T +L  +   TP+                                  PD+++AV+ +SQ+MH     H++A+ R+LR+L    
Subjt:  IGILVNQRRYILDLLEETGLLGCQVAETPI---------------------------------DPDIAFAVSMVSQFMHALGLAHFDAVYRILRHL----

Query:  -------KGSTTTRRA---------------TSGYYSFVG--GILLHGEVKNSVVASSVEAEFRALAHGICEGIWIRRLLEKLRFTQTRLMCSYGDNKAA
               KG+T +  A               T+GY  ++G   I    + +  VV SS EAE+R++A+   E  WI  LL +L    +     Y DN  A
Subjt:  -------KGSTTTRRA---------------TSGYYSFVG--GILLHGEVKNSVVASSVEAEFRALAHGICEGIWIRRLLEKLRFTQTRLMCSYGDNKAA

Query:  ISIAHNPVLHDRTKHIEVDKYFIKEKVDA
          +  NPV H R KHI +D +FI+ +V +
Subjt:  ISIAHNPVLHDRTKHIEVDKYFIKEKVDA

Arabidopsis top hitse value%identityAlignment
AT1G22060.1 LOCATED IN: vacuole2.9e-4873.05Show/hide
Query:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS
        +  EEL SIFPL +E  S GNALERVLALEIELAEALR KKK + HFQSSFLKQH+D+EAI++SF DIN LI++MLD KG+Y+++ETELREMHDRYSQLS
Subjt:  RVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLS

Query:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEH
        L+FAEVEGERQKLMMT+KNVRASKK +   NR S ++ GEH
Subjt:  LQFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEH

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-3030.49Show/hide
Query:  NKVCKLKRSLDRLKQSPKAWSECFEKAVTNYEFSQNRADHTMFYKHT------------------KNDKAWSFVKCKI----------------------
        N VC LK+S+  LKQ+ + W   F   +  + F Q+ +DHT F K T                   ND A   +K ++                      
Subjt:  NKVCKLKRSLDRLKQSPKAWSECFEKAVTNYEFSQNRADHTMFYKHT------------------KNDKAWSFVKCKI----------------------

Query:  ---GILVNQRRYILDLLEETGLLGCQVAETPIDP---------------------------------DIAFAVSMVSQFMHALGLAHFDAVYRILRHLKG
           GI + QR+Y LDLL+ETGLLGC+ +  P+DP                                 DI+FAV+ +SQF  A  LAH  AV +IL ++KG
Subjt:  ---GILVNQRRYILDLLEETGLLGCQVAETPIDP---------------------------------DIAFAVSMVSQFMHALGLAHFDAVYRILRHLKG

Query:  ST--------------------------TTRRATSGYYSFVGGILLHGEVKNSVVA--SSVEAEFRALAHGICEGIWIRRLLEKLRFTQTRLMCSYGDNK
        +                            TRR+T+GY  F+G  L+  + K   V   SS EAE+RAL+    E +W+ +   +L+   ++    + DN 
Subjt:  ST--------------------------TTRRATSGYYSFVGGILLHGEVKNSVVA--SSVEAEFRALAHGICEGIWIRRLLEKLRFTQTRLMCSYGDNK

Query:  AAISIAHNPVLHDRTKHIEVDKYFIKEK
        AAI IA N V H+RTKHIE D + ++E+
Subjt:  AAISIAHNPVLHDRTKHIEVDKYFIKEK

AT5G41140.1 Myosin heavy chain-related protein7.6e-0932.41Show/hide
Query:  KVDARVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSK--KKPSMHFQSSFLKQHSDE-----EAIYRSFSDI---------NELIKDMLDIKG
        K + R NE+   I  L  +     NALE    + IE  + L+++  +  +   + S   Q +DE     EAI   ++++          +L+ ++  ++ 
Subjt:  KVDARVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSK--KKPSMHFQSSFLKQHSDE-----EAIYRSFSDI---------NELIKDMLDIKG

Query:  KYTTVETELREMHDRYSQLSLQFAEVEGERQKLMMTVKNVRASKK
        +   +ETEL+EM +RYS++SL+FAEVEGERQ+L+MTV+ ++ +KK
Subjt:  KYTTVETELREMHDRYSQLSLQFAEVEGERQKLMMTVKNVRASKK

AT5G52280.1 Myosin heavy chain-related protein5.8e-0947.46Show/hide
Query:  DINELIKDMLDIKGKYTTVETELREMHDRYSQLSLQFAEVEGERQKLMMTVKNVRASKK
        ++++L  ++   K K +++E EL+EM +RYS++SL+FAEVEGERQ+L+M V+N++  KK
Subjt:  DINELIKDMLDIKGKYTTVETELREMHDRYSQLSLQFAEVEGERQKLMMTVKNVRASKK

ATMG00810.1 DNA/RNA polymerases superfamily protein5.3e-1027.17Show/hide
Query:  GILVNQRRYILDLLEETGLLGCQVAETPID--------------------------------PDIAFAVSMVSQFMHALGLAHFDAVYRILRHLKGS---
        G+ ++Q +Y   +L   G+L C+   TP+                                 PDI++AV++V Q MH   LA FD + R+LR++KG+   
Subjt:  GILVNQRRYILDLLEETGLLGCQVAETPID--------------------------------PDIAFAVSMVSQFMHALGLAHFDAVYRILRHLKGS---

Query:  -----------------------TTTRRATSGYYSFVGGILLHGEVKN--SVVASSVEAEFRALAHGICEGIW
                               T+TRR+T+G+ +F+G  ++    K   +V  SS E E+RALA    E  W
Subjt:  -----------------------TTTRRATSGYYSFVGGILLHGEVKN--SVVASSVEAEFRALAHGICEGIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTAACAAAGTGTGCAAGTTAAAGAGATCATTAGATCGTCTCAAACAATCTCCTAAAGCCTGGTCTGAATGCTTTGAAAAAGCAGTCACAAACTATGAATTCAGTCA
AAATCGAGCCGATCATACGATGTTCTATAAACATACAAAAAATGACAAGGCATGGAGTTTTGTCAAGTGCAAAATTGGAATTCTTGTCAACCAAAGGAGGTATATTCTTG
ATCTGTTAGAAGAGACAGGTTTACTTGGTTGCCAGGTAGCCGAAACTCCGATTGATCCTGACATCGCTTTTGCAGTTAGTATGGTAAGTCAGTTCATGCATGCTCTTGGG
CTAGCTCACTTTGATGCAGTTTATAGAATCCTAAGACATTTGAAAGGTAGCACGACTACTAGAAGAGCCACGTCTGGTTACTACTCTTTTGTTGGAGGAATTTTGTTACA
TGGCGAAGTAAAAAACAGTGTGGTTGCAAGTAGTGTAGAAGCAGAATTTAGAGCTTTAGCCCATGGTATTTGTGAAGGTATATGGATAAGAAGACTTTTGGAAAAATTGA
GATTCACTCAGACGAGGCTCATGTGCAGTTACGGTGACAACAAGGCAGCAATTTCCATTGCCCACAATCCAGTCCTTCATGACAGGACGAAACATATTGAAGTTGATAAA
TACTTTATAAAGGAAAAGGTCGATGCAAGAGTGAATGAAGAACTAGGAAGCATATTCCCCTTGTTCAAGGAATTTTCGAGCAGTGGCAATGCTTTAGAAAGGGTACTAGC
TCTAGAGATCGAGCTTGCTGAAGCTTTGCGGTCAAAAAAGAAACCAAGTATGCATTTTCAGAGTTCTTTCTTGAAGCAACACAGTGATGAAGAAGCGATATATCGAAGCT
TTAGCGACATCAATGAGCTAATAAAAGACATGTTAGATATAAAGGGAAAGTACACAACTGTAGAGACTGAACTGAGAGAGATGCATGATCGTTACTCCCAGTTAAGCCTC
CAGTTTGCTGAGGTTGAAGGGGAGAGACAGAAACTCATGATGACTGTCAAGAATGTCCGAGCATCCAAGAAGCTTCTCAACGCCAATAATCGACTCTCATGGTCATCCCG
GGGGGAGCATTCTCCTTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGATTAACAAAGTGTGCAAGTTAAAGAGATCATTAGATCGTCTCAAACAATCTCCTAAAGCCTGGTCTGAATGCTTTGAAAAAGCAGTCACAAACTATGAATTCAGTCA
AAATCGAGCCGATCATACGATGTTCTATAAACATACAAAAAATGACAAGGCATGGAGTTTTGTCAAGTGCAAAATTGGAATTCTTGTCAACCAAAGGAGGTATATTCTTG
ATCTGTTAGAAGAGACAGGTTTACTTGGTTGCCAGGTAGCCGAAACTCCGATTGATCCTGACATCGCTTTTGCAGTTAGTATGGTAAGTCAGTTCATGCATGCTCTTGGG
CTAGCTCACTTTGATGCAGTTTATAGAATCCTAAGACATTTGAAAGGTAGCACGACTACTAGAAGAGCCACGTCTGGTTACTACTCTTTTGTTGGAGGAATTTTGTTACA
TGGCGAAGTAAAAAACAGTGTGGTTGCAAGTAGTGTAGAAGCAGAATTTAGAGCTTTAGCCCATGGTATTTGTGAAGGTATATGGATAAGAAGACTTTTGGAAAAATTGA
GATTCACTCAGACGAGGCTCATGTGCAGTTACGGTGACAACAAGGCAGCAATTTCCATTGCCCACAATCCAGTCCTTCATGACAGGACGAAACATATTGAAGTTGATAAA
TACTTTATAAAGGAAAAGGTCGATGCAAGAGTGAATGAAGAACTAGGAAGCATATTCCCCTTGTTCAAGGAATTTTCGAGCAGTGGCAATGCTTTAGAAAGGGTACTAGC
TCTAGAGATCGAGCTTGCTGAAGCTTTGCGGTCAAAAAAGAAACCAAGTATGCATTTTCAGAGTTCTTTCTTGAAGCAACACAGTGATGAAGAAGCGATATATCGAAGCT
TTAGCGACATCAATGAGCTAATAAAAGACATGTTAGATATAAAGGGAAAGTACACAACTGTAGAGACTGAACTGAGAGAGATGCATGATCGTTACTCCCAGTTAAGCCTC
CAGTTTGCTGAGGTTGAAGGGGAGAGACAGAAACTCATGATGACTGTCAAGAATGTCCGAGCATCCAAGAAGCTTCTCAACGCCAATAATCGACTCTCATGGTCATCCCG
GGGGGAGCATTCTCCTTCATAACTTCTTGGCTTCCTAAGATAAGTCTCTCTTGTCTCTTTCACTAGTTGTTGAAATCGCATTCAGGTCAAATGTGACACCAAAGGCTTCA
TCTTTGGCTTTCTGCATCACAACACCAAGAAGAATATCGACAGGCGATACTGTCTCCAGGGATCGAGCTGCTACGAGTTTAGAGTCAGCTGCACGATAAAGATGCAACAG
TTTTGTGTAAATAAGAAAGCTTCTGCACCTATTCATTCACCAGTTGGTAGAGTATTATCAAGTATAGAATCTTCTTTTCTACTTCAATGTAATTTTTTCTGTACAGTAAA
TGTAATATCTTCATGGGTAGATGACTTAACTCTCCAAATATTATGAAGAGAAAGCAATGAAACCATGTCATTATACATCTTTGCCCTTCCTCTCTGC
Protein sequenceShow/hide protein sequence
MINKVCKLKRSLDRLKQSPKAWSECFEKAVTNYEFSQNRADHTMFYKHTKNDKAWSFVKCKIGILVNQRRYILDLLEETGLLGCQVAETPIDPDIAFAVSMVSQFMHALG
LAHFDAVYRILRHLKGSTTTRRATSGYYSFVGGILLHGEVKNSVVASSVEAEFRALAHGICEGIWIRRLLEKLRFTQTRLMCSYGDNKAAISIAHNPVLHDRTKHIEVDK
YFIKEKVDARVNEELGSIFPLFKEFSSSGNALERVLALEIELAEALRSKKKPSMHFQSSFLKQHSDEEAIYRSFSDINELIKDMLDIKGKYTTVETELREMHDRYSQLSL
QFAEVEGERQKLMMTVKNVRASKKLLNANNRLSWSSRGEHSPS