; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0169491 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0169491
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr06:24034973..24035692
RNA-Seq ExpressionCmc06g0169491
SyntenyCmc06g0169491
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-11394.93Show/hide
Query:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN
        K KRNLVSVSCLIEHMYSINFSMNEAFI KNGVHICSAKLEN+LYVLRPNEAKAVLNHEM RTANTQNKRQRISPNNNTYLWHLRL HINLDRIGRLVKN
Subjt:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN

Query:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK
        GLLNKL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPL+LIHSDLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHKS+ALEKFKEYK EVENLLSKK
Subjt:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK

Query:  IKILRSDRGGEYMDLRF
        IKILRSDRGGEYMDLRF
Subjt:  IKILRSDRGGEYMDLRF

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-11193.55Show/hide
Query:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN
        K KRNLVSVSCLIEHMYSINFSMNEAFI KNGVHICSAKLEN+LYVLRPNEAKAVLNHEM RTANTQNKRQRISPNNNTYLWHLRL HINLDRIGRLVK+
Subjt:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN

Query:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK
        GLLNKL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPL+LIHSDLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHKS+ALEKFKEYK EVENLLSKK
Subjt:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK

Query:  IKILRSDRGGEYMDLRF
        IKI RSDRGGEYMDL F
Subjt:  IKILRSDRGGEYMDLRF

KAA0037509.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-9093.33Show/hide
Query:  AKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEP
        AKLEN+LYVLRPNEAKAVLNHEM RT NTQNKRQRISPNNNTYLWHLRL HINLDRI RLVKNGLLN+LE+DSLPPCESCLEGKMTKRPFTGKGYRAKEP
Subjt:  AKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEP

Query:  LKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRF
        L+LIHSDLCG MNVKAR DFEYFISFIDDY RYGYLYLMEHKSKALEKFK+YKAEVENLLSKKIKILRSDRGGEYMDLRF
Subjt:  LKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRF

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]4.6e-11092.17Show/hide
Query:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN
        K KRNLVSVSCLIEHMYSI+FSMNEAFISKNGVHICS KLE++LYVL+PNE KAVLNHEM RTANTQNKRQRIS NNNTYLWHLRL HINLDRIGRLVKN
Subjt:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN

Query:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK
        GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPL+LIHSDLCGPMNVKA G FEYFISFIDDYS YGYLYL+EHKS+ALEKFKEYK EVENLLSKK
Subjt:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK

Query:  IKILRSDRGGEYMDLRF
        IKILRSDRGGEYMDLRF
Subjt:  IKILRSDRGGEYMDLRF

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-10791.71Show/hide
Query:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN
        K KRNLV VSCLIEHMYSINFSMNEAFISKNG     AKLE++LYVLRPNEAKAVLNHEM RTANTQNKRQRISPNNNTYLWHLRL HINLDRIGRLVKN
Subjt:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN

Query:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK
        GLLNKL+DDSLPPCESCLEGKMTKRPFTGK YRAKEPL+LIHSDLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHK +ALEKFKEYK EVENLLSKK
Subjt:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK

Query:  IKILRSDRGGEYMDLRF
        IKILRSDRGGEYMDLRF
Subjt:  IKILRSDRGGEYMDLRF

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein9.0e-11293.55Show/hide
Query:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN
        K KRNLVSVSCLIEHMYSINFSMNEAFI KNGVHICSAKLEN+LYVLRPNEAKAVLNHEM RTANTQNKRQRISPNNNTYLWHLRL HINLDRIGRLVK+
Subjt:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN

Query:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK
        GLLNKL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPL+LIHSDLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHKS+ALEKFKEYK EVENLLSKK
Subjt:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK

Query:  IKILRSDRGGEYMDLRF
        IKI RSDRGGEYMDL F
Subjt:  IKILRSDRGGEYMDLRF

A0A5A7T820 Gag/pol protein8.7e-9193.33Show/hide
Query:  AKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEP
        AKLEN+LYVLRPNEAKAVLNHEM RT NTQNKRQRISPNNNTYLWHLRL HINLDRI RLVKNGLLN+LE+DSLPPCESCLEGKMTKRPFTGKGYRAKEP
Subjt:  AKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEP

Query:  LKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRF
        L+LIHSDLCG MNVKAR DFEYFISFIDDY RYGYLYLMEHKSKALEKFK+YKAEVENLLSKKIKILRSDRGGEYMDLRF
Subjt:  LKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRF

A0A5A7TZD0 Gag/pol protein1.3e-11394.93Show/hide
Query:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN
        K KRNLVSVSCLIEHMYSINFSMNEAFI KNGVHICSAKLEN+LYVLRPNEAKAVLNHEM RTANTQNKRQRISPNNNTYLWHLRL HINLDRIGRLVKN
Subjt:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN

Query:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK
        GLLNKL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPL+LIHSDLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHKS+ALEKFKEYK EVENLLSKK
Subjt:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK

Query:  IKILRSDRGGEYMDLRF
        IKILRSDRGGEYMDLRF
Subjt:  IKILRSDRGGEYMDLRF

A0A5A7VJG3 Gag/pol protein2.3e-10791.71Show/hide
Query:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN
        K KRNLV VSCLIEHMYSINFSMNEAFISKNG     AKLE++LYVLRPNEAKAVLNHEM RTANTQNKRQRISPNNNTYLWHLRL HINLDRIGRLVKN
Subjt:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN

Query:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK
        GLLNKL+DDSLPPCESCLEGKMTKRPFTGK YRAKEPL+LIHSDLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHK +ALEKFKEYK EVENLLSKK
Subjt:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK

Query:  IKILRSDRGGEYMDLRF
        IKILRSDRGGEYMDLRF
Subjt:  IKILRSDRGGEYMDLRF

A0A5D3BNE1 Gag/pol protein2.2e-11092.17Show/hide
Query:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN
        K KRNLVSVSCLIEHMYSI+FSMNEAFISKNGVHICS KLE++LYVL+PNE KAVLNHEM RTANTQNKRQRIS NNNTYLWHLRL HINLDRIGRLVKN
Subjt:  KTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKN

Query:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK
        GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPL+LIHSDLCGPMNVKA G FEYFISFIDDYS YGYLYL+EHKS+ALEKFKEYK EVENLLSKK
Subjt:  GLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKK

Query:  IKILRSDRGGEYMDLRF
        IKILRSDRGGEYMDLRF
Subjt:  IKILRSDRGGEYMDLRF

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.3e-1832.88Show/hide
Query:  NLVSVSCLIEHMYSINFSMNEAFISKNGVHIC--SAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHIN------LDRIGR
        NL+SV  L E   SI F  +   ISKNG+ +   S  L N            V+N + + + N ++K       NN  LWH R  HI+      + R   
Subjt:  NLVSVSCLIEHMYSINFSMNEAFISKNGVHIC--SAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHIN------LDRIGR

Query:  LVKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRA--KEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVE
             LLN LE  S   CE CL GK  + PF     +   K PL ++HSD+CGP+      D  YF+ F+D ++ Y   YL+++KS     F+++ A+ E
Subjt:  LVKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRA--KEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVE

Query:  NLLSKKIKILRSDRGGEYM
           + K+  L  D G EY+
Subjt:  NLLSKKIKILRSDRGGEYM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-2236.5Show/hide
Query:  LWHLRLSHINLDRIGRLVKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKS
        LWH R+ H++   +  L K  L++  +  ++ PC+ CL GK  +  F     R    L L++SD+CGPM +++ G  +YF++FIDD SR  ++Y+++ K 
Subjt:  LWHLRLSHINLDRIGRLVKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKS

Query:  KALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRF
        +  + F+++ A VE    +K+K LRSD GGEY    F
Subjt:  KALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLRF

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.3e-1026.91Show/hide
Query:  NLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTY-LWHLRLSHINLDRIGRLVKNGLL
        +L+S+S L     +  F+ N      +G  +       D Y L  ++   + +H    T N  NK +  S N   Y L H  L H N   I + +K   +
Subjt:  NLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTY-LWHLRLSHINLDRIGRLVKNGLL

Query:  NKLEDDSLP-------PCESCLEGKMTKRPFTGKGYRAK-----EPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLM--EHKSKALEKFKEY
          L++  +         C  CL GK TK     KG R K     EP + +H+D+ GP++   +    YFISF D+ +R+ ++Y +    +   L  F   
Subjt:  NKLEDDSLP-------PCESCLEGKMTKRPFTGKGYRAK-----EPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLM--EHKSKALEKFKEY

Query:  KAEVENLLSKKIKILRSDRGGEY
         A ++N  + ++ +++ DRG EY
Subjt:  KAEVENLLSKKIKILRSDRGGEY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-1227.57Show/hide
Query:  GVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKNGLLNKLE-DDSLPPCESCLEGKMTKRPFTGK
        GV +   K +++LY     E     +  +S  A+  +K    S       WH RL H     +  ++ N  L+ L        C  CL  K  K PF+  
Subjt:  GVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKNGLLNKLE-DDSLPPCESCLEGKMTKRPFTGK

Query:  GYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDL
           +  PL+ I+SD+     + +  ++ Y++ F+D ++RY +LY ++ KS+  E F  +K  +EN    +I    SD GGE++ L
Subjt:  GYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-1329.03Show/hide
Query:  GVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKNGLLNKLE-DDSLPPCESCLEGKMTKRPFTGK
        GV +   K +++LY     E     +  +S  A+  +K    S       WH RL H +L  +  ++ N  L  L     L  C  C   K  K PF+  
Subjt:  GVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWHLRLSHINLDRIGRLVKNGLLNKLE-DDSLPPCESCLEGKMTKRPFTGK

Query:  GYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLR
           + +PL+ I+SD+     + +  ++ Y++ F+D ++RY +LY ++ KS+  + F  +K+ VEN    +I  L SD GGE++ LR
Subjt:  GYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKEYKAEVENLLSKKIKILRSDRGGEYMDLR

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.1e-0838.67Show/hide
Query:  NNTYLWHLRLSHINLDRIGRLVKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNV
        + T LWH RL+H++   +  LVK G L+  +  SL  CE C+ GK  +  F+   +  K PL  +HSDL G  +V
Subjt:  NNTYLWHLRLSHINLDRIGRLVKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTATCGGAAATAAATTCATGTTTTTGGAAAACTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATT
GAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTACATATTTGTTCGGCTAAGCTTGAAAATGACTTGTATGTATTAAGA
CCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTCTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCAT
TTAAGATTAAGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACAAGTTAGAAGATGATTCATTACCTCCATGTGAATCTTGTCTT
GAAGGAAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAAAACTTATACATTCAGACCTCTGTGGTCCGATGAATGTAAAAGCT
AGAGGGGATTTTGAATACTTCATCTCTTTTATAGATGATTATTCAAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTAAAGCTCTTGAAAAGTTCAAGGAG
TATAAGGCTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTATCGGAAATAAATTCATGTTTTTGGAAAACTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATT
GAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTACATATTTGTTCGGCTAAGCTTGAAAATGACTTGTATGTATTAAGA
CCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTCTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCAT
TTAAGATTAAGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACAAGTTAGAAGATGATTCATTACCTCCATGTGAATCTTGTCTT
GAAGGAAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAAAACTTATACATTCAGACCTCTGTGGTCCGATGAATGTAAAAGCT
AGAGGGGATTTTGAATACTTCATCTCTTTTATAGATGATTATTCAAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTAAAGCTCTTGAAAAGTTCAAGGAG
TATAAGGCTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTCTAG
Protein sequenceShow/hide protein sequence
MSFQLVQWEMLSCLSEINSCFWKTKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENDLYVLRPNEAKAVLNHEMSRTANTQNKRQRISPNNNTYLWH
LRLSHINLDRIGRLVKNGLLNKLEDDSLPPCESCLEGKMTKRPFTGKGYRAKEPLKLIHSDLCGPMNVKARGDFEYFISFIDDYSRYGYLYLMEHKSKALEKFKE
YKAEVENLLSKKIKILRSDRGGEYMDLRF