; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0249351 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0249351
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr09:15111733..15112482
RNA-Seq ExpressionCmc09g0249351
SyntenyCmc09g0249351
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]4.6e-12188.76Show/hide
Query:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT
        MTLKVGTGDVI+A AVGDAKLFFGN+FMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI KNGV+IC+AKLENNLY+LRPNEAK VLNHEMFRT
Subjt:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT

Query:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI
        ANTQNK+QRISP   NNTYLWHLRLGHINLDRIGRLVKNGLLN+LKD SLPPCE CLE KMTKRPFT KGYRAK PLELIHSDLCGPMNVKA+GGFEYFI
Subjt:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI

Query:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG
        SFI DY RYG LYLMEHKSEA +KFKE K E+ENLLSKKIKILRSD+GG
Subjt:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.5e-11987.55Show/hide
Query:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT
        MTLKVGTGDVI+A AVGDAKLFFGN+FMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI KNGV+IC+AKLENNLY+LRPNEAK VLNHEMFRT
Subjt:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT

Query:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI
        ANTQNK+QRISP   NNTYLWHLRLGHINLDRIGRLVK+GLLN+LKD SLPPCE CLE KMTKRPFT KGYRAK PLELIHSDLCGPMNVKA+G FEYFI
Subjt:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI

Query:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG
        SFI DY RYG LYLMEHKSEA +KFKE K E+ENLLSKKIKI RSD+GG
Subjt:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG

KAA0046415.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-9067.47Show/hide
Query:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT
        MT++VGTG VI+A AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FI KNGV IC+AKLENNLY+LR   +K +LN EMF+T
Subjt:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT

Query:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI
        A TQNK+ +ISPK+  N +LWHLRLGHINL+RI RLVKNGLL+EL+++SLP CE CLE KMTKRPFT KG+RAK PLEL+HSDLCGPMNVKA+GGFEYFI
Subjt:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI

Query:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG
        +F  DY RYG +YLM+HKSEA +KFKE KAE+EN LSK IK  RSD+GG
Subjt:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]9.9e-11685.14Show/hide
Query:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT
        M LKVGTGDVI+A AVGDAKLFFGN+FMFLENLYIVPKIKRNLVSVSCLIEHMYSI+FSMNEAFISKNGV+IC+ KLE+NLY+L+PNE K VLNHEMFRT
Subjt:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT

Query:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI
        ANTQNK+QRIS    NNTYLWHLRLGHINLDRIGRLVKNGLLN+L+D SLPPCE CLE KMTKRPFT KGYRAK PLELIHSDLCGPMNVKA GGFEYFI
Subjt:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI

Query:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG
        SFI DY  YG LYL+EHKSEA +KFKE K E+ENLLSKKIKILRSD+GG
Subjt:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]5.1e-11284.74Show/hide
Query:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT
        MTL VGTGDVI+A AVGD KLFFG +FMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEAFISKNG     AKLE+NLY+LRPNEAK VLNHEMFRT
Subjt:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT

Query:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI
        ANTQNK+QRISP   NNTYLWHLRL HINLDRIGRLVKNGLLN+LKD SLPPCE CLE KMTKRPFT K YRAK PLELIHSDLCGPMNVKA+GGFEYFI
Subjt:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI

Query:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG
        SFI DY RYG LYLMEHK EA +KFKE K E+ENLLSKKIKILRSD+GG
Subjt:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.2e-11987.55Show/hide
Query:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT
        MTLKVGTGDVI+A AVGDAKLFFGN+FMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI KNGV+IC+AKLENNLY+LRPNEAK VLNHEMFRT
Subjt:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT

Query:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI
        ANTQNK+QRISP   NNTYLWHLRLGHINLDRIGRLVK+GLLN+LKD SLPPCE CLE KMTKRPFT KGYRAK PLELIHSDLCGPMNVKA+G FEYFI
Subjt:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI

Query:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG
        SFI DY RYG LYLMEHKSEA +KFKE K E+ENLLSKKIKI RSD+GG
Subjt:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG

A0A5A7TYF5 Gag/pol protein9.1e-9167.47Show/hide
Query:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT
        MT++VGTG VI+A AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++N+ FI KNGV IC+AKLENNLY+LR   +K +LN EMF+T
Subjt:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT

Query:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI
        A TQNK+ +ISPK+  N +LWHLRLGHINL+RI RLVKNGLL+EL+++SLP CE CLE KMTKRPFT KG+RAK PLEL+HSDLCGPMNVKA+GGFEYFI
Subjt:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI

Query:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG
        +F  DY RYG +YLM+HKSEA +KFKE KAE+EN LSK IK  RSD+GG
Subjt:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG

A0A5A7TZD0 Gag/pol protein2.2e-12188.76Show/hide
Query:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT
        MTLKVGTGDVI+A AVGDAKLFFGN+FMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI KNGV+IC+AKLENNLY+LRPNEAK VLNHEMFRT
Subjt:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT

Query:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI
        ANTQNK+QRISP   NNTYLWHLRLGHINLDRIGRLVKNGLLN+LKD SLPPCE CLE KMTKRPFT KGYRAK PLELIHSDLCGPMNVKA+GGFEYFI
Subjt:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI

Query:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG
        SFI DY RYG LYLMEHKSEA +KFKE K E+ENLLSKKIKILRSD+GG
Subjt:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG

A0A5A7VJG3 Gag/pol protein2.5e-11284.74Show/hide
Query:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT
        MTL VGTGDVI+A AVGD KLFFG +FMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEAFISKNG     AKLE+NLY+LRPNEAK VLNHEMFRT
Subjt:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT

Query:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI
        ANTQNK+QRISP   NNTYLWHLRL HINLDRIGRLVKNGLLN+LKD SLPPCE CLE KMTKRPFT K YRAK PLELIHSDLCGPMNVKA+GGFEYFI
Subjt:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI

Query:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG
        SFI DY RYG LYLMEHK EA +KFKE K E+ENLLSKKIKILRSD+GG
Subjt:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG

A0A5D3BNE1 Gag/pol protein4.8e-11685.14Show/hide
Query:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT
        M LKVGTGDVI+A AVGDAKLFFGN+FMFLENLYIVPKIKRNLVSVSCLIEHMYSI+FSMNEAFISKNGV+IC+ KLE+NLY+L+PNE K VLNHEMFRT
Subjt:  MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRT

Query:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI
        ANTQNK+QRIS    NNTYLWHLRLGHINLDRIGRLVKNGLLN+L+D SLPPCE CLE KMTKRPFT KGYRAK PLELIHSDLCGPMNVKA GGFEYFI
Subjt:  ANTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFI

Query:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG
        SFI DY  YG LYL+EHKSEA +KFKE K E+ENLLSKKIKILRSD+GG
Subjt:  SFIGDYLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.2e-1630.26Show/hide
Query:  LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICA-AKLENNLYILRPNEAKTVLNHEMFRTANTQNKKQRISPKKKNNTYLWHLRLGHI
        LE++    +   NL+SV  L E   SI F  +   ISKNG+ +   + + NN+          V+N + +           I+ K KNN  LWH R GHI
Subjt:  LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICA-AKLENNLYILRPNEAKTVLNHEMFRTANTQNKKQRISPKKKNNTYLWHLRLGHI

Query:  N------LDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPF--TRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFISFIGDYLRYGNLYLMEHKSE
        +      + R        LLN L + S   CE CL  K  + PF   +     K PL ++HSD+CGP+         YF+ F+  +  Y   YL+++KS+
Subjt:  N------LDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPF--TRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFISFIGDYLRYGNLYLMEHKSE

Query:  APKKFKENKAEIENLLSKKIKILRSDQG
            F++  A+ E   + K+  L  D G
Subjt:  APKKFKENKAEIENLLSKKIKILRSDQG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-2027.46Show/hide
Query:  GTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRTANTQN
        G GD+     VG          + L+++  VP ++ NL+S   L    Y   F+  +  ++K  + I                AK V    ++RT     
Subjt:  GTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRTANTQN

Query:  KKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFISFIGD
        + +  + + + +  LWH R+GH++   +  L K  L++  K  ++ PC++CL  K  +  F     R    L+L++SD+CGPM +++ GG +YF++FI D
Subjt:  KKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFISFIGD

Query:  YLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG
          R   +Y+++ K +  + F++  A +E    +K+K LRSD GG
Subjt:  YLRYGNLYLMEHKSEAPKKFKENKAEIENLLSKKIKILRSDQGG

P93293 Uncharacterized mitochondrial protein AtMg003003.5e-0733.71Show/hide
Query:  NTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNV
        + +  +  ++   K+ T LWH RL H++   +  LVK G L+  K  SL  CE C+  K  +  F+   +  K PL+ +HSDL G  +V
Subjt:  NTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNV

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein2.5e-0833.71Show/hide
Query:  NTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNV
        + +  +  ++   K+ T LWH RL H++   +  LVK G L+  K  SL  CE C+  K  +  F+   +  K PL+ +HSDL G  +V
Subjt:  NTQNKKQRISPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTCAAGGTTGGAACGGGAGATGTCATTGCAGCTCACGCAGTAGGAGATGCTAAGTTGTTTTTTGGAAATAGATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTATATATTTGTG
CGGCTAAGCTCGAAAACAACTTGTATATATTAAGACCTAATGAAGCCAAAACAGTTTTAAATCATGAGATGTTTAGAACTGCTAACACTCAAAATAAAAAGCAAAGAATT
TCTCCAAAAAAAAAAAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACGAGTTAAAAGA
TCATTCATTACCTCCATGTGAATTTTGTCTTGAAGAAAAAATGACAAAGAGACCTTTTACTAGAAAAGGTTATAGAGCCAAAGCGCCTTTAGAACTTATACATTCAGACC
TCTGTGGTCCAATGAATGTAAAAGCTAAAGGGGGTTTTGAATACTTCATCTCTTTTATAGGTGATTATTTGAGGTATGGTAATTTATACTTAATGGAGCATAAGTCTGAA
GCTCCTAAAAAGTTTAAGGAGAATAAGGCTGAAATTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCAAGGTGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGACACTCAAGGTTGGAACGGGAGATGTCATTGCAGCTCACGCAGTAGGAGATGCTAAGTTGTTTTTTGGAAATAGATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTATATATTTGTG
CGGCTAAGCTCGAAAACAACTTGTATATATTAAGACCTAATGAAGCCAAAACAGTTTTAAATCATGAGATGTTTAGAACTGCTAACACTCAAAATAAAAAGCAAAGAATT
TCTCCAAAAAAAAAAAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACGAGTTAAAAGA
TCATTCATTACCTCCATGTGAATTTTGTCTTGAAGAAAAAATGACAAAGAGACCTTTTACTAGAAAAGGTTATAGAGCCAAAGCGCCTTTAGAACTTATACATTCAGACC
TCTGTGGTCCAATGAATGTAAAAGCTAAAGGGGGTTTTGAATACTTCATCTCTTTTATAGGTGATTATTTGAGGTATGGTAATTTATACTTAATGGAGCATAAGTCTGAA
GCTCCTAAAAAGTTTAAGGAGAATAAGGCTGAAATTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCAAGGTGGATAG
Protein sequenceShow/hide protein sequence
MTLKVGTGDVIAAHAVGDAKLFFGNRFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVYICAAKLENNLYILRPNEAKTVLNHEMFRTANTQNKKQRI
SPKKKNNTYLWHLRLGHINLDRIGRLVKNGLLNELKDHSLPPCEFCLEEKMTKRPFTRKGYRAKAPLELIHSDLCGPMNVKAKGGFEYFISFIGDYLRYGNLYLMEHKSE
APKKFKENKAEIENLLSKKIKILRSDQGG