; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G11170 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G11170
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationClcChr11:16605810..16606754
RNA-Seq ExpressionClc11G11170
SyntenyClc11G11170
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]3.9e-9258.97Show/hide
Query:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL
        +W  PFELMCDAS+ A+GAVLGQRK+KI   IYYASK LN +Q NYTTTEKE+LA+VF                              DAKPRLIRWVLL
Subjt:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL

Query:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP
        LQEFDLEI+DRKGTENQ+ADHLSRLE+       + I + FPDE ++    S  PWYADIVNYL C   P + +AQQKKK   +++ Y WD+P+L++ GP
Subjt:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP

Query:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV
         +ILRRCV E E + IL  CH +PYGGHF   RTAAK+LQSG+FWP LFKDA ++   CDRCQRTGNIS R+EMPLN++LEVELFDVWG+
Subjt:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV

PIN21854.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]9.7e-9158.28Show/hide
Query:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL
        +W  PFELMCDAS+ AIGAVLGQRK+KI   IYYASK LN +Q NYTTTEKE+LA+VF                              DAKPRLIRWVLL
Subjt:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL

Query:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP
        LQEFDLEI+DRKG ENQ+ADHLSRLE+       + I + FPDE ++    S  PWYADIVNYL C   P + +AQQKKK   +++ Y WD+P+L++ GP
Subjt:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP

Query:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV
         +ILRRCV E E + I   CH +PYGGHF R RTAAK+LQSG+FWP LFKD  ++   CDRCQRTGNIS R+EMPL ++LEVELFDVWG+
Subjt:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV

PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]5.1e-9258.62Show/hide
Query:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL
        +W  PFELMCDAS+ A+GAVLGQRK+KI   IYYASK LN +Q NYTTTEKE+LA+VF                              DAKPRLIRWVLL
Subjt:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL

Query:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP
        LQEFDLEI+DRKGTENQ+ADHLSRLE+       + I + FPDE ++    S+ PWYADIVNYL C   P + + QQKKK   +++ Y WD+P+L++ GP
Subjt:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP

Query:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV
         +ILRRCV E E + IL  CH +PYGGHF   RTAAK+LQSG+FWP LFKDA ++   CDRCQRTGNIS R+EMPLN++LEVELFDVWG+
Subjt:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV

PIN22518.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]3.9e-9258.97Show/hide
Query:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL
        +W  PFELMCDAS+ AIGAVLGQRK+KI   IYYASK LN +Q NYTTTEKE+LA+VF                              DAKPRLIRWVLL
Subjt:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL

Query:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP
        LQEFDLEI+DRKGTENQ+ADHLSRLE+       + I + FPDE ++    S+ PWYADIVNYL C   P + + QQKKK   +++ Y WD+P+L++ GP
Subjt:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP

Query:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV
         +ILRRCV E E + IL  CH +PYGGHF   RTAAK+LQSG+FWP LFKDA ++   CDRCQRTGNIS R+EMPLN++LEVELFDVWG+
Subjt:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV

XP_012829396.1 PREDICTED: uncharacterized protein LOC105950575 [Erythranthe guttata]2.0e-8855.02Show/hide
Query:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL
        NW  PFE+MCDAS++A+GAVLGQR++KI   IYY+S+ L+ +Q+NY+TTEKEMLA+V+                              DAKPRLIRWVLL
Subjt:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL

Query:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGPH
        LQEFDLEI+D+KG+EN VADHLSRL  +EV     +I+E FPDE ++   +  PWYAD+ N+L     P++ +  QKKK  H+S+FY WDEP L+R GP 
Subjt:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGPH

Query:  HILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV
         ++RRCV E E   IL  CH +P GGH G  RTAAKVLQSG+FWPTLF+D+  +   CDRCQRTGN+SN+++MPLN+M EVELFDVWG+
Subjt:  HILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase1.9e-9258.97Show/hide
Query:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL
        +W  PFELMCDAS+ A+GAVLGQRK+KI   IYYASK LN +Q NYTTTEKE+LA+VF                              DAKPRLIRWVLL
Subjt:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL

Query:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP
        LQEFDLEI+DRKGTENQ+ADHLSRLE+       + I + FPDE ++    S  PWYADIVNYL C   P + +AQQKKK   +++ Y WD+P+L++ GP
Subjt:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP

Query:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV
         +ILRRCV E E + IL  CH +PYGGHF   RTAAK+LQSG+FWP LFKDA ++   CDRCQRTGNIS R+EMPLN++LEVELFDVWG+
Subjt:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV

A0A2G9HBV9 DNA-directed DNA polymerase3.7e-8857.24Show/hide
Query:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL
        +W LPFELMCDAS+ A+GAVLGQRK+KI   IYYASK LN +Q NYTTTEKE+LA+VF                              DA P LI WV L
Subjt:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL

Query:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP
        LQEFDLEI+DRKGTENQ+ADHLSRLE+       + I + F DE ++    S  PWYADIVNYL C   P + +AQQKKK+  +++ Y WD+ +L++ GP
Subjt:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP

Query:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV
         +ILRRCV E E + IL  CH +PYGGHF   RTAAK+LQSG+FWP LFKDA ++   CDRCQRTGNIS R+EMPLN++LEVELFDVWG+
Subjt:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV

A0A2G9HWF8 Reverse transcriptase4.7e-9158.28Show/hide
Query:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL
        +W  PFELMCDAS+ AIGAVLGQRK+KI   IYYASK LN +Q NYTTTEKE+LA+VF                              DAKPRLIRWVLL
Subjt:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL

Query:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP
        LQEFDLEI+DRKG ENQ+ADHLSRLE+       + I + FPDE ++    S  PWYADIVNYL C   P + +AQQKKK   +++ Y WD+P+L++ GP
Subjt:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP

Query:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV
         +ILRRCV E E + I   CH +PYGGHF R RTAAK+LQSG+FWP LFKD  ++   CDRCQRTGNIS R+EMPL ++LEVELFDVWG+
Subjt:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV

A0A2G9HYA0 Reverse transcriptase2.5e-9258.62Show/hide
Query:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL
        +W  PFELMCDAS+ A+GAVLGQRK+KI   IYYASK LN +Q NYTTTEKE+LA+VF                              DAKPRLIRWVLL
Subjt:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL

Query:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP
        LQEFDLEI+DRKGTENQ+ADHLSRLE+       + I + FPDE ++    S+ PWYADIVNYL C   P + + QQKKK   +++ Y WD+P+L++ GP
Subjt:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP

Query:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV
         +ILRRCV E E + IL  CH +PYGGHF   RTAAK+LQSG+FWP LFKDA ++   CDRCQRTGNIS R+EMPLN++LEVELFDVWG+
Subjt:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV

A0A2G9HYD8 Reverse transcriptase1.9e-9258.97Show/hide
Query:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL
        +W  PFELMCDAS+ AIGAVLGQRK+KI   IYYASK LN +Q NYTTTEKE+LA+VF                              DAKPRLIRWVLL
Subjt:  NWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLL

Query:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP
        LQEFDLEI+DRKGTENQ+ADHLSRLE+       + I + FPDE ++    S+ PWYADIVNYL C   P + + QQKKK   +++ Y WD+P+L++ GP
Subjt:  LQEFDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGP

Query:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV
         +ILRRCV E E + IL  CH +PYGGHF   RTAAK+LQSG+FWP LFKDA ++   CDRCQRTGNIS R+EMPLN++LEVELFDVWG+
Subjt:  HHILRRCVSEYETHSILRSCHEAPYGGHFGRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.6e-1138.28Show/hide
Query:  FELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLLLQEFD
        F L  DAS+ A+GAVL Q      HP+ Y S+ LN  + NY+T EKE+LAIV+                              D   +L RW + L EFD
Subjt:  FELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFT-----------------------------DAKPRLIRWVLLLQEFD

Query:  LEIKDRKGTENQVADHLSRLENKEVQVS
         +IK  KG EN VAD LSR++ +E  +S
Subjt:  LEIKDRKGTENQVADHLSRLENKEVQVS

P20825 Retrovirus-related Pol polyprotein from transposon 2977.4e-0937.1Show/hide
Query:  FELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVF-----------------TDAKP------------RLIRWVLLLQEFD
        F L  DASN A+GAVL Q      HPI + S+ LN  + NY+  EKE+LAIV+                 +D +P            +L RW + L E+ 
Subjt:  FELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVF-----------------TDAKP------------RLIRWVLLLQEFD

Query:  LEIKDRKGTENQVADHLSRLENKE
         +I   KG EN VAD LSR++ +E
Subjt:  LEIKDRKGTENQVADHLSRLENKE

P92516 Uncharacterized mitochondrial protein AtMg007506.9e-1560.38Show/hide
Query:  VLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV
        VLQ+G++WPT FKDA  +  +CD CQR GN + RNEMP + +LEVE+FDVWG+
Subjt:  VLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.2e-1236.88Show/hide
Query:  PFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAI------------------VFTDAKP------------RLIRWVLLLQE
        PF L  DASN AIGAVL Q  +    PI Y S+ LN ++ENY T EKEMLAI                  V+TD +P            +L RW   ++E
Subjt:  PFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAI------------------VFTDAKP------------RLIRWVLLLQE

Query:  FDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDE
        ++ E+  + G  N VAD LSR+  +  Q+S +D++    D+
Subjt:  FDLEIKDRKGTENQVADHLSRLENKEVQVSWSDIEERFPDE

Q9UR07 Transposon Tf2-11 polyprotein2.8e-0834.15Show/hide
Query:  LMCDASNHAIGAVLGQR-KEKIMHPIYYASKPLNASQENYTTTEKEMLAI-------------------VFTDAK--------------PRLIRWVLLLQ
        L  DAS+ A+GAVL Q+  +   +P+ Y S  ++ +Q NY+ ++KEMLAI                   + TD +               RL RW L LQ
Subjt:  LMCDASNHAIGAVLGQR-KEKIMHPIYYASKPLNASQENYTTTEKEMLAI-------------------VFTDAK--------------PRLIRWVLLLQ

Query:  EFDLEIKDRKGTENQVADHLSRL
        +F+ EI  R G+ N +AD LSR+
Subjt:  EFDLEIKDRKGTENQVADHLSRL

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein4.9e-1660.38Show/hide
Query:  VLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV
        VLQ+G++WPT FKDA  +  +CD CQR GN + RNEMP + +LEVE+FDVWG+
Subjt:  VLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTAAATGCGTTCGAGTCTCTAAGGCAAGCTCTAATTTCAGCACCAATTTTAGTTGCACCAACTGGTTTCTCCCATTTGAATTAATGTGCGATGCAAGCAACCATGC
AATAGGAGCAGTATTGGGGCAAAGAAAAGAGAAAATAATGCACCCCATCTATTATGCTAGTAAACCATTGAATGCATCTCAGGAGAACTACACTACTACTGAGAAGGAAA
TGTTAGCCATAGTCTTTACGGACGCAAAGCCTAGACTCATCCGCTGGGTCCTGTTGTTACAAGAATTTGACTTGGAGATTAAAGACAGAAAGGGAACCGAGAATCAGGTT
GCGGACCACCTATCTCGGTTGGAAAATAAAGAGGTCCAAGTAAGCTGGAGTGATATAGAGGAACGATTCCCAGACGAGCATGTCATGAACGCAGAGAGTCAGGAACCATG
GTATGCAGACATAGTAAATTACCTGGTCTGCAACCAATGGCCTGAAGAATTCAATGCTCAACAAAAGAAAAAGCTCCAACATGAAAGTAAGTTCTACTGCTGGGATGAGC
CATATCTATACAGACTTGGCCCTCACCATATCCTGCGTCGATGTGTTTCAGAATATGAAACGCATAGCATTTTGAGAAGCTGTCATGAAGCACCTTACGGAGGACACTTT
GGAAGGCAGAGAACAGCTGCAAAGGTGTTGCAAAGTGGGTATTTCTGGCCCACATTATTCAAAGACGCAAGGGCATATGCGGTGGCTTGTGATCGTTGTCAGAGAACAGG
CAACATTTCCAACCGAAATGAGATGCCTCTGAATTCAATGCTGGAAGTTGAGTTGTTTGACGTATGGGGTGTAACACCCTGTTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTAAATGCGTTCGAGTCTCTAAGGCAAGCTCTAATTTCAGCACCAATTTTAGTTGCACCAACTGGTTTCTCCCATTTGAATTAATGTGCGATGCAAGCAACCATGC
AATAGGAGCAGTATTGGGGCAAAGAAAAGAGAAAATAATGCACCCCATCTATTATGCTAGTAAACCATTGAATGCATCTCAGGAGAACTACACTACTACTGAGAAGGAAA
TGTTAGCCATAGTCTTTACGGACGCAAAGCCTAGACTCATCCGCTGGGTCCTGTTGTTACAAGAATTTGACTTGGAGATTAAAGACAGAAAGGGAACCGAGAATCAGGTT
GCGGACCACCTATCTCGGTTGGAAAATAAAGAGGTCCAAGTAAGCTGGAGTGATATAGAGGAACGATTCCCAGACGAGCATGTCATGAACGCAGAGAGTCAGGAACCATG
GTATGCAGACATAGTAAATTACCTGGTCTGCAACCAATGGCCTGAAGAATTCAATGCTCAACAAAAGAAAAAGCTCCAACATGAAAGTAAGTTCTACTGCTGGGATGAGC
CATATCTATACAGACTTGGCCCTCACCATATCCTGCGTCGATGTGTTTCAGAATATGAAACGCATAGCATTTTGAGAAGCTGTCATGAAGCACCTTACGGAGGACACTTT
GGAAGGCAGAGAACAGCTGCAAAGGTGTTGCAAAGTGGGTATTTCTGGCCCACATTATTCAAAGACGCAAGGGCATATGCGGTGGCTTGTGATCGTTGTCAGAGAACAGG
CAACATTTCCAACCGAAATGAGATGCCTCTGAATTCAATGCTGGAAGTTGAGTTGTTTGACGTATGGGGTGTAACACCCTGTTCCTGA
Protein sequenceShow/hide protein sequence
MPKCVRVSKASSNFSTNFSCTNWFLPFELMCDASNHAIGAVLGQRKEKIMHPIYYASKPLNASQENYTTTEKEMLAIVFTDAKPRLIRWVLLLQEFDLEIKDRKGTENQV
ADHLSRLENKEVQVSWSDIEERFPDEHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLQHESKFYCWDEPYLYRLGPHHILRRCVSEYETHSILRSCHEAPYGGHF
GRQRTAAKVLQSGYFWPTLFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGVTPCS