; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G20960 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G20960
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
Genome locationChr4:19383552..19384852
RNA-Seq ExpressionCSPI04G20960
SyntenyCSPI04G20960
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032617.1 gag protease polyprotein [Cucumis melo var. makuwa]1.3e-7051.27Show/hide
Query:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL
        RRGGR  RGR   R QP+ Q   Q  DP A VT ADLAAMEQR+ DL+ +   QQ+P   T  AL   PA       A    P+  Q +PD LSAEAKHL
Subjt:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL

Query:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK
        +DFRKYNP TFDGSL DP++A MWLSS+ETIF YMKC  DQKVQC + +LTDRG  WW++ ERMLGGD+SQITW+QFKE+F AKFFSA++RDAK QEFL 
Subjt:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK

Query:  LEQDNMIVE------------------------------------------EPATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR
        LEQ +M VE                                           PAT ADALR+ ++LSL ER+  +    +GST+GQKRKAEQ P  +  R
Subjt:  LEQDNMIVE------------------------------------------EPATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR

Query:  DSSSSGTFRHHQQE
        +  S G FR  QQ+
Subjt:  DSSSSGTFRHHQQE

KAA0046094.1 gag protease polyprotein [Cucumis melo var. makuwa]2.4e-7247.46Show/hide
Query:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL
        RRGGR  RGR   R QP+ Q   Q  DPT  VT ADLAAMEQR+ DL+ +   QQ+P   T      TPA       A    P+  Q +PD LSAEAKHL
Subjt:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL

Query:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK
        +DFRKYNP TFDGSL DP++A MWLSS+ETIF YMKC  DQKVQC + +LTDRG  WW++ ERMLGGD+SQITW+QFKE+F AKFFSA++RDAK QEFL 
Subjt:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK

Query:  LEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR
        LEQ +M VE+                                          PAT A+ALR+ ++LSL ER+  +    +GST+GQKRKAEQ P  +  R
Subjt:  LEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR

Query:  DSSSSGTFRHHQQEVIKTEG-------QEGHLANHFPQGVVATNRQGGEKPAHK
        +  S G FR  QQ+  + E        QEGH A+  P  +    +  G    H+
Subjt:  DSSSSGTFRHHQQEVIKTEG-------QEGHLANHFPQGVVATNRQGGEKPAHK

KAA0046185.1 pol protein [Cucumis melo var. makuwa]4.0e-7251.56Show/hide
Query:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPA-ARVQAPIVTQNLPDMLSAEAKH
        RRGGR  RGR   R QP+ Q   +  DP A VT ADLAAMEQR+ DL+ +   QQQP      AL  TPA  VQ PA A    P+  Q +PD LSAE+KH
Subjt:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPA-ARVQAPIVTQNLPDMLSAEAKH

Query:  LKDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFL
        L+DFRKYNP TFDGSL DP++A +WLSS+ETIF YMKC  DQKVQCV+ +LTDRG  WW++ ERMLGGD+SQITW+QFKE+F AKFFSA++RDAK QEFL
Subjt:  LKDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFL

Query:  KLEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITF
         LEQ +M VE+                                          PAT ADALR+ ++LSL ER+  +    +GST+GQKRKAEQ P  +  
Subjt:  KLEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITF

Query:  RDSSSSGTFRHHQQEVIKTE
        R+  S G FR  QQ+  + E
Subjt:  RDSSSSGTFRHHQQEVIKTE

KAA0062141.1 pol protein [Cucumis melo var. makuwa]1.3e-7051.27Show/hide
Query:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL
        RRGGR  RGR   R QP+ Q   Q  DP A VT ADLAAMEQR+ DL+ +   QQQP      A    PA       A  Q P+  Q +PD LSAEAKHL
Subjt:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL

Query:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK
        +DFRKYNP TFDGSL DP++A +WLSS+ETIF YMKC  DQKVQC I +LTDRG  WW++ ERMLGGD+SQITW+QFKE+F AKFFSA++RDAK QEFL 
Subjt:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK

Query:  LEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR
        LEQD+M VE+                                          PAT ADALR+ ++LSL ER+  +    +GST+G+KRKAEQ P     R
Subjt:  LEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR

Query:  DSSSSGTFRHHQQE
        +  S G FR  QQ+
Subjt:  DSSSSGTFRHHQQE

KAA0066035.1 gag protease polyprotein [Cucumis melo var. makuwa]4.4e-7150.96Show/hide
Query:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL
        RRGGR  RGR   R QP+ Q   Q  DPTA VT  DLAAMEQR+ DL+ +   QQQP           PA       A    P+  Q +PD LSAEAKHL
Subjt:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL

Query:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK
        +DFRKYNP TFDGSL DP++A +WLSS+ETIF YMKC  DQKVQC + +LTDRG  WW++ ERMLGGD+SQITW+QFKE+F AKFFSA++RDAK QEFL 
Subjt:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK

Query:  LEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR
        LEQD+M VE+                                          PAT ADALR+ ++LSL ER+  + A  +GST+GQKRKAEQ P  +  R
Subjt:  LEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR

Query:  DSSSSGTFRHHQQE
        +  S G FR  QQ+
Subjt:  DSSSSGTFRHHQQE

TrEMBL top hitse value%identityAlignment
A0A5A7STC8 Gag protease polyprotein6.3e-7151.27Show/hide
Query:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL
        RRGGR  RGR   R QP+ Q   Q  DP A VT ADLAAMEQR+ DL+ +   QQ+P   T  AL   PA       A    P+  Q +PD LSAEAKHL
Subjt:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL

Query:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK
        +DFRKYNP TFDGSL DP++A MWLSS+ETIF YMKC  DQKVQC + +LTDRG  WW++ ERMLGGD+SQITW+QFKE+F AKFFSA++RDAK QEFL 
Subjt:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK

Query:  LEQDNMIVE------------------------------------------EPATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR
        LEQ +M VE                                           PAT ADALR+ ++LSL ER+  +    +GST+GQKRKAEQ P  +  R
Subjt:  LEQDNMIVE------------------------------------------EPATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR

Query:  DSSSSGTFRHHQQE
        +  S G FR  QQ+
Subjt:  DSSSSGTFRHHQQE

A0A5A7TSQ8 Reverse transcriptase1.1e-7247.46Show/hide
Query:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL
        RRGGR  RGR   R QP+ Q   Q  DPT  VT ADLAAMEQR+ DL+ +   QQ+P   T      TPA       A    P+  Q +PD LSAEAKHL
Subjt:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL

Query:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK
        +DFRKYNP TFDGSL DP++A MWLSS+ETIF YMKC  DQKVQC + +LTDRG  WW++ ERMLGGD+SQITW+QFKE+F AKFFSA++RDAK QEFL 
Subjt:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK

Query:  LEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR
        LEQ +M VE+                                          PAT A+ALR+ ++LSL ER+  +    +GST+GQKRKAEQ P  +  R
Subjt:  LEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR

Query:  DSSSSGTFRHHQQEVIKTEG-------QEGHLANHFPQGVVATNRQGGEKPAHK
        +  S G FR  QQ+  + E        QEGH A+  P  +    +  G    H+
Subjt:  DSSSSGTFRHHQQEVIKTEG-------QEGHLANHFPQGVVATNRQGGEKPAHK

A0A5A7TXM6 Reverse transcriptase1.9e-7251.56Show/hide
Query:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPA-ARVQAPIVTQNLPDMLSAEAKH
        RRGGR  RGR   R QP+ Q   +  DP A VT ADLAAMEQR+ DL+ +   QQQP      AL  TPA  VQ PA A    P+  Q +PD LSAE+KH
Subjt:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPA-ARVQAPIVTQNLPDMLSAEAKH

Query:  LKDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFL
        L+DFRKYNP TFDGSL DP++A +WLSS+ETIF YMKC  DQKVQCV+ +LTDRG  WW++ ERMLGGD+SQITW+QFKE+F AKFFSA++RDAK QEFL
Subjt:  LKDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFL

Query:  KLEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITF
         LEQ +M VE+                                          PAT ADALR+ ++LSL ER+  +    +GST+GQKRKAEQ P  +  
Subjt:  KLEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITF

Query:  RDSSSSGTFRHHQQEVIKTE
        R+  S G FR  QQ+  + E
Subjt:  RDSSSSGTFRHHQQEVIKTE

A0A5A7V8X5 Pol protein6.3e-7151.27Show/hide
Query:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL
        RRGGR  RGR   R QP+ Q   Q  DP A VT ADLAAMEQR+ DL+ +   QQQP      A    PA       A  Q P+  Q +PD LSAEAKHL
Subjt:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL

Query:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK
        +DFRKYNP TFDGSL DP++A +WLSS+ETIF YMKC  DQKVQC I +LTDRG  WW++ ERMLGGD+SQITW+QFKE+F AKFFSA++RDAK QEFL 
Subjt:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK

Query:  LEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR
        LEQD+M VE+                                          PAT ADALR+ ++LSL ER+  +    +GST+G+KRKAEQ P     R
Subjt:  LEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR

Query:  DSSSSGTFRHHQQE
        +  S G FR  QQ+
Subjt:  DSSSSGTFRHHQQE

A0A5A7VJX7 Gag protease polyprotein2.1e-7150.96Show/hide
Query:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL
        RRGGR  RGR   R QP+ Q   Q  DPTA VT  DLAAMEQR+ DL+ +   QQQP           PA       A    P+  Q +PD LSAEAKHL
Subjt:  RRGGR--RGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIVTQNLPDMLSAEAKHL

Query:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK
        +DFRKYNP TFDGSL DP++A +WLSS+ETIF YMKC  DQKVQC + +LTDRG  WW++ ERMLGGD+SQITW+QFKE+F AKFFSA++RDAK QEFL 
Subjt:  KDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKCQEFLK

Query:  LEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR
        LEQD+M VE+                                          PAT ADALR+ ++LSL ER+  + A  +GST+GQKRKAEQ P  +  R
Subjt:  LEQDNMIVEE------------------------------------------PATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFR

Query:  DSSSSGTFRHHQQE
        +  S G FR  QQ+
Subjt:  DSSSSGTFRHHQQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAATATGGTCATTCGTCACTTGCCAGAACTTTTGGTGGTGGAATTTAGGAAACATGTCACTATGCAGAAGTGTGTTCGTAGAGGTGGTAGGAGAGGCAGAGAAGT
TATACGTAACCAGCCTAAAGGACAACATGCTATACAAGTTGTCGACCCTACTGCATATGTTACGCAAGCAGACCTTGCTGCCATGGAGCAAAGGTACATAGACTTGTTGT
TCGAAGCATTGGCACAACAACAGCCTGTCCAGCAGACCCAGATAGCTCTTGTTCAAACACCAGCTACAGATGTTCAGATCCCAGCTGCAAGAGTTCAGGCCCCAATTGTA
ACCCAGAACCTACCTGATATGCTTTCAGCAGAAGCCAAACACCTGAAGGATTTCAGAAAGTATAACCCTCGAACTTTTGATGGATCCTTGGCAGACCCCAGCAAGGCACA
TATGTGGTTGAGTTCGGTGGAGACTATCTTTTGCTACATGAAGTGCTCCAACGACCAGAAAGTTCAGTGCGTCATTTGCTTACTGACAGATAGGGGCAGAGGCTGGTGGA
AGTCCGCAGAGAGGATGTTGGGTGGAGATATGAGTCAAATCACCTGGGAGCAATTTAAGGAGAATTTTTCTGCCAAATTCTTCTCCGCCACAGTGAGAGATGCCAAGTGC
CAGGAGTTTCTAAAGCTGGAGCAAGACAATATGATCGTTGAGGAACCAGCTACAAAGGCTGATGCACTACGCATGACAATGAATTTGAGTTTGCATGAGAGGTCGGAGTT
GGCCAATGCTACAGAGAAGGGATCAACTACAGGACAAAAGAGGAAGGCTGAGCAGCATCCTGCTAACATTACCTTTAGAGATTCAAGTTCAAGTGGCACCTTCCGTCATC
ACCAACAGGAGGTCATTAAAACAGAAGGCCAAGAAGGGCACTTAGCTAACCATTTCCCTCAAGGTGTTGTTGCCACTAATCGGCAGGGGGGCGAGAAGCCTGCACATAAA
TGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAATATGGTCATTCGTCACTTGCCAGAACTTTTGGTGGTGGAATTTAGGAAACATGTCACTATGCAGAAGTGTGTTCGTAGAGGTGGTAGGAGAGGCAGAGAAGT
TATACGTAACCAGCCTAAAGGACAACATGCTATACAAGTTGTCGACCCTACTGCATATGTTACGCAAGCAGACCTTGCTGCCATGGAGCAAAGGTACATAGACTTGTTGT
TCGAAGCATTGGCACAACAACAGCCTGTCCAGCAGACCCAGATAGCTCTTGTTCAAACACCAGCTACAGATGTTCAGATCCCAGCTGCAAGAGTTCAGGCCCCAATTGTA
ACCCAGAACCTACCTGATATGCTTTCAGCAGAAGCCAAACACCTGAAGGATTTCAGAAAGTATAACCCTCGAACTTTTGATGGATCCTTGGCAGACCCCAGCAAGGCACA
TATGTGGTTGAGTTCGGTGGAGACTATCTTTTGCTACATGAAGTGCTCCAACGACCAGAAAGTTCAGTGCGTCATTTGCTTACTGACAGATAGGGGCAGAGGCTGGTGGA
AGTCCGCAGAGAGGATGTTGGGTGGAGATATGAGTCAAATCACCTGGGAGCAATTTAAGGAGAATTTTTCTGCCAAATTCTTCTCCGCCACAGTGAGAGATGCCAAGTGC
CAGGAGTTTCTAAAGCTGGAGCAAGACAATATGATCGTTGAGGAACCAGCTACAAAGGCTGATGCACTACGCATGACAATGAATTTGAGTTTGCATGAGAGGTCGGAGTT
GGCCAATGCTACAGAGAAGGGATCAACTACAGGACAAAAGAGGAAGGCTGAGCAGCATCCTGCTAACATTACCTTTAGAGATTCAAGTTCAAGTGGCACCTTCCGTCATC
ACCAACAGGAGGTCATTAAAACAGAAGGCCAAGAAGGGCACTTAGCTAACCATTTCCCTCAAGGTGTTGTTGCCACTAATCGGCAGGGGGGCGAGAAGCCTGCACATAAA
TGA
Protein sequenceShow/hide protein sequence
MVNMVIRHLPELLVVEFRKHVTMQKCVRRGGRRGREVIRNQPKGQHAIQVVDPTAYVTQADLAAMEQRYIDLLFEALAQQQPVQQTQIALVQTPATDVQIPAARVQAPIV
TQNLPDMLSAEAKHLKDFRKYNPRTFDGSLADPSKAHMWLSSVETIFCYMKCSNDQKVQCVICLLTDRGRGWWKSAERMLGGDMSQITWEQFKENFSAKFFSATVRDAKC
QEFLKLEQDNMIVEEPATKADALRMTMNLSLHERSELANATEKGSTTGQKRKAEQHPANITFRDSSSSGTFRHHQQEVIKTEGQEGHLANHFPQGVVATNRQGGEKPAHK