; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0019491 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0019491
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionPol protein
Genome locationCMiso1.1chr01:17347409..17348091
RNA-Seq ExpressionCmc01g0019491
SyntenyCmc01g0019491
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO45752.1 pol protein [Cucumis melo subsp. melo]6.5e-10078.9Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------
        MKREVAEFVSKCLVCQQVK PRQ+PAGLLQPLS+PEWKWENV MDFITGLPRTLRGFTVI VVVD+LTKSAHF                           
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------

Query:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC
           VSDRDARFTSKFWKGLQTAM T LDFSTAFHPQTD QTERLNQVL+DMLRACA +FPGSWDSHLHLMEFAYNN +QATIGMAPFEALYG+ CRSPVC
Subjt:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC

Query:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR
        WGEVGEQRLMGPELVQSTNEAIQKIRS MHTAQS ++
Subjt:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR

KAA0035938.1 pol protein [Cucumis melo var. makuwa]6.5e-10078.9Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------
        MKREVAEFVSKCLVCQQVK PRQ+PAGLLQPLS+PEWKWENV MDFITGLPRTLRGFTVI VVVD+LTKSAHF                           
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------

Query:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC
           VSDRDARFTSKFWKGLQTAM T LDFSTAFHPQTD QTERLNQVL+DMLRACA +FPGSWD+HLHLMEFAYNN +QATIGMAPFEALYGK CRSPVC
Subjt:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC

Query:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR
        WGEVGEQRLMGPELVQSTNEAIQKIRS MHTAQS ++
Subjt:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR

KAA0043401.1 pol protein [Cucumis melo var. makuwa]6.5e-10078.9Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------
        MKREVAEFVSKCLVCQQVK PRQ+PAGLLQPLS+PEWKWENV MDFITGLPRTLRGFTVI VVVD+LTKSAHF                           
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------

Query:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC
           VSDRDARFTSKFWKGLQTAM T LDFSTAFHPQTD QTERLNQVL+DMLRACA +FPGSWDSHLHLMEFAYNN +QATIGMAPFEALYG+ CRSPVC
Subjt:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC

Query:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR
        WGEVGEQRLMGPELVQSTNEAIQKIRS MHTAQS ++
Subjt:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR

KAA0056300.1 pol protein [Cucumis melo var. makuwa]6.5e-10078.48Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHFVS-------------------------
        MKREVAEFVS+CLVCQQVK PRQ+PAGLLQPLS+PEWKWENV MDFITGLPRTLRGFTVI VVVD+LTKSAHFV                          
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHFVS-------------------------

Query:  -----DRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC
             DRDARFTSKFWKGLQTAM T LDFSTAFHPQTD QTERLNQVL+DML ACA +FPGSWDSHLHLMEFAYNN +QATIGMAPFEALYGK CRSPVC
Subjt:  -----DRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC

Query:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR
        WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQS ++
Subjt:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR

KAA0066456.1 pol protein [Cucumis melo var. makuwa]6.5e-10078.9Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------
        MKREVAEFVSKCLVCQQVK PRQ+PAGLLQPLS+PEWKWENV MDFITGLPRTLRGFTVI VVVD+LTKSAHF                           
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------

Query:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC
           VSDRDARFTSKFWKGLQTAM T LDFSTAFHPQTD QTERLNQVL+DMLRACA +FPGSWDSHLHLMEFAYNN +QATIGMAPFEALYG+ CRSPVC
Subjt:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC

Query:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR
        WGEVGEQRLMGPELVQSTNEAIQKIRS MHTAQS ++
Subjt:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR

TrEMBL top hitse value%identityAlignment
A0A5A7SZD6 Pol protein3.1e-10078.9Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------
        MKREVAEFVSKCLVCQQVK PRQ+PAGLLQPLS+PEWKWENV MDFITGLPRTLRGFTVI VVVD+LTKSAHF                           
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------

Query:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC
           VSDRDARFTSKFWKGLQTAM T LDFSTAFHPQTD QTERLNQVL+DMLRACA +FPGSWD+HLHLMEFAYNN +QATIGMAPFEALYGK CRSPVC
Subjt:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC

Query:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR
        WGEVGEQRLMGPELVQSTNEAIQKIRS MHTAQS ++
Subjt:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR

A0A5A7TPA9 Pol protein3.1e-10078.9Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------
        MKREVAEFVSKCLVCQQVK PRQ+PAGLLQPLS+PEWKWENV MDFITGLPRTLRGFTVI VVVD+LTKSAHF                           
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------

Query:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC
           VSDRDARFTSKFWKGLQTAM T LDFSTAFHPQTD QTERLNQVL+DMLRACA +FPGSWDSHLHLMEFAYNN +QATIGMAPFEALYG+ CRSPVC
Subjt:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC

Query:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR
        WGEVGEQRLMGPELVQSTNEAIQKIRS MHTAQS ++
Subjt:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR

A0A5A7U7V9 Reverse transcriptase3.1e-10078.9Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------
        MKREVAEFVSKCLVCQQVK PRQ+PAGLLQPLS+PEWKWENV MDFITGLPRTLRGFTVI VVVD+LTKSAHF                           
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------

Query:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC
           VSDRDARFTSKFWKGLQTAM T LDFSTAFHPQTD QTERLNQVL+DMLRACA +FPGSWDSHLHLMEFAYNN +QATIGMAPFEALYG+ CRSPVC
Subjt:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC

Query:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR
        WGEVGEQRLMGPELVQSTNEAIQKIRS MHTAQS ++
Subjt:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR

A0A5A7UMD7 Pol protein3.1e-10078.48Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHFVS-------------------------
        MKREVAEFVS+CLVCQQVK PRQ+PAGLLQPLS+PEWKWENV MDFITGLPRTLRGFTVI VVVD+LTKSAHFV                          
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHFVS-------------------------

Query:  -----DRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC
             DRDARFTSKFWKGLQTAM T LDFSTAFHPQTD QTERLNQVL+DML ACA +FPGSWDSHLHLMEFAYNN +QATIGMAPFEALYGK CRSPVC
Subjt:  -----DRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC

Query:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR
        WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQS ++
Subjt:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR

A0A5A7VJE2 Reverse transcriptase3.1e-10078.9Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------
        MKREVAEFVSKCLVCQQVK PRQ+PAGLLQPLS+PEWKWENV MDFITGLPRTLRGFTVI VVVD+LTKSAHF                           
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHF---------------------------

Query:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC
           VSDRDARFTSKFWKGLQTAM T LDFSTAFHPQTD QTERLNQVL+DMLRACA +FPGSWDSHLHLMEFAYNN +QATIGMAPFEALYG+ CRSPVC
Subjt:  ---VSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC

Query:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR
        WGEVGEQRLMGPELVQSTNEAIQKIRS MHTAQS ++
Subjt:  WGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRR

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.6e-2431.94Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSA-----------------------------
        +++++ E+V  C  CQ  K+   +P G LQP+   E  WE++ MDFIT LP +  G+  + VVVD+ +K A                             
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSA-----------------------------

Query:  -HFVSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALY
           ++D D  FTS+ WK         + FS  + PQTD QTER NQ ++ +LR   S  P +W  H+ L++ +YNN   +   M PFE ++
Subjt:  -HFVSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALY

P0CT41 Transposon Tf2-12 polyprotein1.6e-2431.94Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSA-----------------------------
        +++++ E+V  C  CQ  K+   +P G LQP+   E  WE++ MDFIT LP +  G+  + VVVD+ +K A                             
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSA-----------------------------

Query:  -HFVSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALY
           ++D D  FTS+ WK         + FS  + PQTD QTER NQ ++ +LR   S  P +W  H+ L++ +YNN   +   M PFE ++
Subjt:  -HFVSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALY

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.2e-2631.3Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHFV--------------------------
        ++  + +++  C+ CQ +K+ R R  GLLQPL + E +W ++ MDF+TGLP T     +I VVVD+ +K AHF+                          
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHFV--------------------------

Query:  ----SDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC
            SDRD R T+  ++ L   +      S+A HPQTD Q+ER  Q L  +LRA  S    +W  +L  +EF YN+    T+G +PFE   G    +P  
Subjt:  ----SDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC

Query:  WG--EVGEQRLMGPELVQ-------STNEAIQKIRSCMHTAQSSRR
            EV  +     EL +        T E ++  +  M T  + RR
Subjt:  WG--EVGEQRLMGPELVQ-------STNEAIQKIRSCMHTAQSSRR

Q99315 Transposon Ty3-G Gag-Pol polyprotein7.6e-2731.71Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHFV--------------------------
        ++  + +++  C+ CQ +K+ R R  GLLQPL + E +W ++ MDF+TGLP T     +I VVVD+ +K AHF+                          
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHFV--------------------------

Query:  ----SDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC
            SDRD R T+  ++ L   +      S+A HPQTD Q+ER  Q L  +LRA AS    +W  +L  +EF YN+    T+G +PFE   G    +P  
Subjt:  ----SDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVC

Query:  WG--EVGEQRLMGPELVQ-------STNEAIQKIRSCMHTAQSSRR
            EV  +     EL +        T E ++  +  M T  + RR
Subjt:  WG--EVGEQRLMGPELVQ-------STNEAIQKIRSCMHTAQSSRR

Q9UR07 Transposon Tf2-11 polyprotein1.6e-2431.94Show/hide
Query:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSA-----------------------------
        +++++ E+V  C  CQ  K+   +P G LQP+   E  WE++ MDFIT LP +  G+  + VVVD+ +K A                             
Subjt:  MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSA-----------------------------

Query:  -HFVSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALY
           ++D D  FTS+ WK         + FS  + PQTD QTER NQ ++ +LR   S  P +W  H+ L++ +YNN   +   M PFE ++
Subjt:  -HFVSDRDARFTSKFWKGLQTAMATGLDFSTAFHPQTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGAGAGGTGGCAGAATTTGTTAGTAAATGTTTGGTGTGTCAGCAGGTTAAAACACCAAGGCAGAGACCAGCGGGTTTATTACAACCCTTGAGCGTACCA
GAATGGAAGTGGGAAAATGTGTACATGGATTTCATTACAGGACTGCCAAGAACTCTGAGGGGTTTTACAGTGATTAGGGTTGTGGTTGACAAGCTTACCAAATCA
GCGCACTTCGTTTCGGATAGAGATGCCCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGCCACAGGGTTAGACTTTAGTACAGCTTTCCATCCA
CAGACTGACGATCAGACTGAGCGTCTGAACCAAGTTTTAAAGGATATGTTGCGAGCGTGTGCATCAAAATTTCCAGGTAGCTGGGACTCTCACTTGCATTTGATG
GAATTTGCTTATAATAACAGATTTCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGGAAAACGTTGTAGATCCCCTGTTTGCTGGGGTGAGGTGGGT
GAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACGAAGCGATACAGAAAATTAGATCATGTATGCATACCGCACAGAGTAGCAGAAGAGTTATGCGG
ATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGAGAGGTGGCAGAATTTGTTAGTAAATGTTTGGTGTGTCAGCAGGTTAAAACACCAAGGCAGAGACCAGCGGGTTTATTACAACCCTTGAGCGTACCA
GAATGGAAGTGGGAAAATGTGTACATGGATTTCATTACAGGACTGCCAAGAACTCTGAGGGGTTTTACAGTGATTAGGGTTGTGGTTGACAAGCTTACCAAATCA
GCGCACTTCGTTTCGGATAGAGATGCCCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGCCACAGGGTTAGACTTTAGTACAGCTTTCCATCCA
CAGACTGACGATCAGACTGAGCGTCTGAACCAAGTTTTAAAGGATATGTTGCGAGCGTGTGCATCAAAATTTCCAGGTAGCTGGGACTCTCACTTGCATTTGATG
GAATTTGCTTATAATAACAGATTTCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGGAAAACGTTGTAGATCCCCTGTTTGCTGGGGTGAGGTGGGT
GAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACGAAGCGATACAGAAAATTAGATCATGTATGCATACCGCACAGAGTAGCAGAAGAGTTATGCGG
ATGTGA
Protein sequenceShow/hide protein sequence
MKREVAEFVSKCLVCQQVKTPRQRPAGLLQPLSVPEWKWENVYMDFITGLPRTLRGFTVIRVVVDKLTKSAHFVSDRDARFTSKFWKGLQTAMATGLDFSTAFHP
QTDDQTERLNQVLKDMLRACASKFPGSWDSHLHLMEFAYNNRFQATIGMAPFEALYGKRCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSCMHTAQSSRRVMR
M