; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G40100 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G40100
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr3:34350756..34351379
RNA-Seq ExpressionCSPI03G40100
SyntenyCSPI03G40100
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034981.1 Transposon Tf2-9 polyprotein [Cucumis melo var. makuwa]1.2e-9074.4Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        ML+YKDRL++SKTS L+ TI HT HD V GGHSGFLRT KRL GELYWEG+K D+K+YCE+C++ QKN+SLAL+P  LLLPLE+PN VWSD+SMD IEGL
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        PKSNGFE  FVVVDRFS YGHFL +KHP+T K+VA+LF KEIV+LHGYPKSIVSDRD +FLS+FWKE+F+++ T+L+RSTAYH Q+DGQTEVV+RG+ETY
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFCGE
        LRCFCGE
Subjt:  LRCFCGE

KAA0051400.1 putative retroelement pol polyprotein [Cucumis melo var. makuwa]6.0e-9074.4Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        +L++K+RL+LSKTS LI TI HTYHDSV GGHSGFL+T KR+ GELYW+GMK D++KYCEEC++ QKN+S AL+PTGLLLPLEIP+ +WSD+SMD IEGL
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        PKS+G+EVI VVVDR S Y HFLTLKHP+T KTVAE+FVKEIV+LHG+PKSIVSDRD +FLS+FW EMFRL+ T+LNRS++YH QTDGQTEVV++ VE Y
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFCGE
        LRCFCGE
Subjt:  LRCFCGE

KAA0066118.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.6e-8872.95Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        +L++K RL+LSK S LI TI HTYHDSV GGHSGFLRT KR+ GELYW+GMK D++KYC+EC++ QKN+S AL+P GLLLPLEIP+ +WSD+SMD IEGL
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        PKS G+EVI VVVDR S Y HFLTLKHP+T KTVAE+FVKE+V+LHG+PKSIVSDRD +FLS+FW EMFRL+ T+LNRS++YH QTDGQTEVV++ VE Y
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFCGE
        LRCFCGE
Subjt:  LRCFCGE

TYK16931.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.9e-8872.95Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        +L++K RL+LSKTS LI TI HTYHDSV GGHSGFLRT KR+ GELYW+GMK D++KYC+EC++ QKN+S AL+P GLLLPLEIP+ +WSD+SMD IEGL
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        PKS+G+EVI VVVDR S Y HFLTLKHP+T KTVAE+FVKE+V+LHG+PKSIVSDRD +FLS+FW EMF+L+ T+LNRS++YH QTDGQTEVV++ VE Y
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFCGE
        LRCFCGE
Subjt:  LRCFCGE

TYK23171.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.5e-8873.91Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        +L++K RL+LSKTS LI TI HTYHDSV GGHSGFLRT KR+ GELYW GMKSDIKKYCEEC++ Q+N++ AL+PTGLLLPLEIP+ +WSD+SMD IEGL
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        PKS+G+EVI VVVDR S Y HFLTLKHP+T KTV E+FVKE+V+LH YPKSIVSDRD +FLS+FW EMFRL+ T+LNRS++YH QTDGQT+VV++ VE Y
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFCGE
        LRCFCGE
Subjt:  LRCFCGE

TrEMBL top hitse value%identityAlignment
A0A5A7U8A5 Putative retroelement pol polyprotein2.9e-9074.4Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        +L++K+RL+LSKTS LI TI HTYHDSV GGHSGFL+T KR+ GELYW+GMK D++KYCEEC++ QKN+S AL+PTGLLLPLEIP+ +WSD+SMD IEGL
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        PKS+G+EVI VVVDR S Y HFLTLKHP+T KTVAE+FVKEIV+LHG+PKSIVSDRD +FLS+FW EMFRL+ T+LNRS++YH QTDGQTEVV++ VE Y
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFCGE
        LRCFCGE
Subjt:  LRCFCGE

A0A5D3BRN6 Transposon Tf2-9 polyprotein5.8e-9174.4Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        ML+YKDRL++SKTS L+ TI HT HD V GGHSGFLRT KRL GELYWEG+K D+K+YCE+C++ QKN+SLAL+P  LLLPLE+PN VWSD+SMD IEGL
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        PKSNGFE  FVVVDRFS YGHFL +KHP+T K+VA+LF KEIV+LHGYPKSIVSDRD +FLS+FWKE+F+++ T+L+RSTAYH Q+DGQTEVV+RG+ETY
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFCGE
        LRCFCGE
Subjt:  LRCFCGE

A0A5D3CTA1 Ty3/gypsy retrotransposon protein2.7e-8872.95Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        +L++K RL+LSK S LI TI HTYHDSV GGHSGFLRT KR+ GELYW+GMK D++KYC+EC++ QKN+S AL+P GLLLPLEIP+ +WSD+SMD IEGL
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        PKS G+EVI VVVDR S Y HFLTLKHP+T KTVAE+FVKE+V+LHG+PKSIVSDRD +FLS+FW EMFRL+ T+LNRS++YH QTDGQTEVV++ VE Y
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFCGE
        LRCFCGE
Subjt:  LRCFCGE

A0A5D3CY75 Ty3/gypsy retrotransposon protein9.3e-8972.95Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        +L++K RL+LSKTS LI TI HTYHDSV GGHSGFLRT KR+ GELYW+GMK D++KYC+EC++ QKN+S AL+P GLLLPLEIP+ +WSD+SMD IEGL
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        PKS+G+EVI VVVDR S Y HFLTLKHP+T KTVAE+FVKE+V+LHG+PKSIVSDRD +FLS+FW EMF+L+ T+LNRS++YH QTDGQTEVV++ VE Y
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFCGE
        LRCFCGE
Subjt:  LRCFCGE

A0A5D3DHM2 Ty3/gypsy retrotransposon protein7.1e-8973.91Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        +L++K RL+LSKTS LI TI HTYHDSV GGHSGFLRT KR+ GELYW GMKSDIKKYCEEC++ Q+N++ AL+PTGLLLPLEIP+ +WSD+SMD IEGL
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        PKS+G+EVI VVVDR S Y HFLTLKHP+T KTV E+FVKE+V+LH YPKSIVSDRD +FLS+FW EMFRL+ T+LNRS++YH QTDGQT+VV++ VE Y
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFCGE
        LRCFCGE
Subjt:  LRCFCGE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.6e-2932.68Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        ++  KD+++L   + L  TI   YH+     H G       +     W+G++  I++Y + C   Q N+S    P G L P+      W  +SMD I  L
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        P+S+G+  +FVVVDRFS     +      T +  A +F + ++   G PK I++D D++F S  WK+     +  +  S  Y  QTDGQTE  ++ VE  
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFC
        LRC C
Subjt:  LRCFC

P0CT35 Transposon Tf2-2 polyprotein3.6e-2932.68Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        ++  KD+++L   + L  TI   YH+     H G       +     W+G++  I++Y + C   Q N+S    P G L P+      W  +SMD I  L
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        P+S+G+  +FVVVDRFS     +      T +  A +F + ++   G PK I++D D++F S  WK+     +  +  S  Y  QTDGQTE  ++ VE  
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFC
        LRC C
Subjt:  LRCFC

P0CT36 Transposon Tf2-3 polyprotein3.6e-2932.68Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        ++  KD+++L   + L  TI   YH+     H G       +     W+G++  I++Y + C   Q N+S    P G L P+      W  +SMD I  L
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        P+S+G+  +FVVVDRFS     +      T +  A +F + ++   G PK I++D D++F S  WK+     +  +  S  Y  QTDGQTE  ++ VE  
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFC
        LRC C
Subjt:  LRCFC

P0CT41 Transposon Tf2-12 polyprotein3.6e-2932.68Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        ++  KD+++L   + L  TI   YH+     H G       +     W+G++  I++Y + C   Q N+S    P G L P+      W  +SMD I  L
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        P+S+G+  +FVVVDRFS     +      T +  A +F + ++   G PK I++D D++F S  WK+     +  +  S  Y  QTDGQTE  ++ VE  
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFC
        LRC C
Subjt:  LRCFC

Q9UR07 Transposon Tf2-11 polyprotein3.6e-2932.68Show/hide
Query:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL
        ++  KD+++L   + L  TI   YH+     H G       +     W+G++  I++Y + C   Q N+S    P G L P+      W  +SMD I  L
Subjt:  MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGL

Query:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY
        P+S+G+  +FVVVDRFS     +      T +  A +F + ++   G PK I++D D++F S  WK+     +  +  S  Y  QTDGQTE  ++ VE  
Subjt:  PKSNGFEVIFVVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETY

Query:  LRCFC
        LRC C
Subjt:  LRCFC

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAGATACAAGGATAGGTTAATATTGTCCAAAACTTCAGCATTAATTACGACAATATCACATACCTATCATGATTCAGTGTTAGGAGGGCACTCTGGTTTTTTACG
AACTTGTAAGAGGCTAACTGGGGAGCTGTACTGGGAAGGAATGAAATCTGATATTAAGAAATATTGTGAGGAGTGTGTGGTTTTTCAGAAGAATAGGTCATTGGCACTGA
CACCAACAGGGTTATTACTACCATTAGAAATTCCAAATGGTGTTTGGAGTGATGTATCAATGGACATTATTGAGGGATTGCCTAAGTCAAATGGATTTGAAGTCATATTT
GTGGTGGTGGATAGGTTCAGTACATATGGACACTTCCTAACTCTTAAGCATCCTTTTACGGTGAAGACAGTGGCAGAGTTGTTCGTGAAGGAAATTGTTCAACTACATGG
GTATCCAAAGTCAATAGTATCTGACAGGGACAATGTATTCTTGAGTAACTTTTGGAAGGAGATGTTCAGGTTGTCTAGTACCAGATTGAACCGAAGCACAGCTTACCACC
TACAAACAGATGGCCAAACTGAGGTAGTCCACAGGGGCGTAGAGACTTACTTAAGATGCTTCTGTGGAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAGATACAAGGATAGGTTAATATTGTCCAAAACTTCAGCATTAATTACGACAATATCACATACCTATCATGATTCAGTGTTAGGAGGGCACTCTGGTTTTTTACG
AACTTGTAAGAGGCTAACTGGGGAGCTGTACTGGGAAGGAATGAAATCTGATATTAAGAAATATTGTGAGGAGTGTGTGGTTTTTCAGAAGAATAGGTCATTGGCACTGA
CACCAACAGGGTTATTACTACCATTAGAAATTCCAAATGGTGTTTGGAGTGATGTATCAATGGACATTATTGAGGGATTGCCTAAGTCAAATGGATTTGAAGTCATATTT
GTGGTGGTGGATAGGTTCAGTACATATGGACACTTCCTAACTCTTAAGCATCCTTTTACGGTGAAGACAGTGGCAGAGTTGTTCGTGAAGGAAATTGTTCAACTACATGG
GTATCCAAAGTCAATAGTATCTGACAGGGACAATGTATTCTTGAGTAACTTTTGGAAGGAGATGTTCAGGTTGTCTAGTACCAGATTGAACCGAAGCACAGCTTACCACC
TACAAACAGATGGCCAAACTGAGGTAGTCCACAGGGGCGTAGAGACTTACTTAAGATGCTTCTGTGGAGAATGA
Protein sequenceShow/hide protein sequence
MLRYKDRLILSKTSALITTISHTYHDSVLGGHSGFLRTCKRLTGELYWEGMKSDIKKYCEECVVFQKNRSLALTPTGLLLPLEIPNGVWSDVSMDIIEGLPKSNGFEVIF
VVVDRFSTYGHFLTLKHPFTVKTVAELFVKEIVQLHGYPKSIVSDRDNVFLSNFWKEMFRLSSTRLNRSTAYHLQTDGQTEVVHRGVETYLRCFCGE