; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G06410 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G06410
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag/pol protein
Genome locationClcChr07:11635747..11642045
RNA-Seq ExpressionClc07G06410
SyntenyClc07G06410
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR025724 - GAG-pre-integrase domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.7e-6942.89Show/hide
Query:  MFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA---------------------------
        MFGQPS S+RH+AIK++Y   MKEGTSV E V                IDE +QVSFIL+SL KSF+PF TNA                           
Subjt:  MFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA---------------------------

Query:  -------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV--------
                     F RGSSS+ K GPS   K  MKKKGK K P N    KK  DKGKCF+CN+DG WKR+C KYL++KKAEK  QGKYDLLV        
Subjt:  -------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV--------

Query:  -------------------------RKLQDGKITLNVGSGDIIAANAVGDM-------------------------------------------------
                                 +KL++G+ITL VG+G++++A AVGD+                                                 
Subjt:  -------------------------RKLQDGKITLNVGSGDIIAANAVGDM-------------------------------------------------

Query:  --------------------------------------KQKISRYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLS
                                              KQK+S   YLWHLRLGHINLNRI RLVKSG+LNQLEDNSLPPCESCLEGK+TKRSFT KGL 
Subjt:  --------------------------------------KQKISRYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLS

Query:  AKKPLELVHSDLWGLMNVKARGGYEYVIN
        AK PLELVHSDL G MNVKARGGYEY I+
Subjt:  AKKPLELVHSDLWGLMNVKARGGYEYVIN

KAA0040307.1 gag/pol protein [Cucumis melo var. makuwa]4.9e-6450.31Show/hide
Query:  SLKGSGLVMFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA-------------------
        SLKG    MFGQP   +RH+AIKY+Y   MKEGTSV E V                IDE +QVSFILESL KSF+PF TNA                   
Subjt:  SLKGSGLVMFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA-------------------

Query:  ---------------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV
                             F+RGSSS++K GP    +K ++KKGK KTP   KG KK ++KGKC++C E+G W ++C KYL++KKAEKE QGKYD   
Subjt:  ---------------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV

Query:  --RKLQDGKITLNVGSGDIIAANAVGDMKQKIS-RYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELVH
          ++L +G+ITL VG+G+I++A AVG++K   + RY  L ++         IGRLVKSGLL +LEDNSLPPC+S LEGK+TKRSFT KGL AK PLEL+H
Subjt:  --RKLQDGKITLNVGSGDIIAANAVGDMKQKIS-RYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELVH

Query:  SDLWGLMNVKARGGYEYVIN
        S+L   MNVKARGGYEY IN
Subjt:  SDLWGLMNVKARGGYEYVIN

KAA0040701.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-6249.53Show/hide
Query:  SLKGSGLVMFGQPSSSIRHDAIKYVYNSHMKEGTS----------------VNEAVIDEKSQVSFILESLLKSFLPFLTNA-------------------
        SLKG    MFGQP  S+RH+ IKY+Y   MKEGTS                VN   IDE +QVSFILESL KSF+PF TNA                   
Subjt:  SLKGSGLVMFGQPSSSIRHDAIKYVYNSHMKEGTS----------------VNEAVIDEKSQVSFILESLLKSFLPFLTNA-------------------

Query:  ---------------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV
                             F+RG SS++K GPS   +K ++KKGK+KTP      KK  +KGKC++C E+G W R+C KYL++KKAEKE QGKYDLL 
Subjt:  ---------------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV

Query:  ----RKLQDGKITLNVGSGDIIAANAVGDMKQKISRYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELV
            ++L +G+ITLNVG+ ++++A AVGD+K                     IGRLVKSGLLN+LE NSLP  +S LEGK+TKRSFT KGL AK PLELV
Subjt:  ----RKLQDGKITLNVGSGDIIAANAVGDMKQKISRYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELV

Query:  HSDLWGLMNVKARGGYEYVIN
        HSDL G MNVKARGGYEY IN
Subjt:  HSDLWGLMNVKARGGYEYVIN

KAA0059877.1 gag/pol protein [Cucumis melo var. makuwa]9.9e-6548.4Show/hide
Query:  MFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA---------------------------
        MFGQ   S+RH+AIKY+Y   MKEGTSV E V                IDE +QVSFILESL KSF+PF TNA                           
Subjt:  MFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA---------------------------

Query:  -------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV--------
                     F+RGSSS++K GPS   +K ++KKGK KTP   KG KK  +KGKC++C E+G W R+CLKYL++KKAEKE QGKYDLLV        
Subjt:  -------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV--------

Query:  -------------------------RKLQDGKITLNVGSGDIIAANAVGDMKQKIS-RYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLE
                                 ++L +GKITL VG+G++++A AVGD+K   + RY  L +          IG LVKSGLL+QLEDNSLPPC+S LE
Subjt:  -------------------------RKLQDGKITLNVGSGDIIAANAVGDMKQKIS-RYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLE

Query:  GKITKRSFTEKGLSAKKPLELVHSDLWGLMNVKARGGYEYVIN
        GK+TKRSFT KGL AK PLELVHSDL G MNVKARGGYEY I+
Subjt:  GKITKRSFTEKGLSAKKPLELVHSDLWGLMNVKARGGYEYVIN

KAA0063246.1 gag/pol protein [Cucumis melo var. makuwa]4.9e-6449.24Show/hide
Query:  SLKGSGLVMFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA-------------------
        SLKG    MFGQP  S+RH+AIKY+Y   MKE TSV E V                IDE +QVSFILES  KSF+PF TNA                   
Subjt:  SLKGSGLVMFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA-------------------

Query:  ---------------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV
                             F+RGSSS++K  PS   +K ++KKGK KTP   KG KK  +KGKC++C E+G W R+C KYL++KKAEKE QG  + + 
Subjt:  ---------------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV

Query:  ---------RKLQDGKITLNVGSGDIIAANAVGDMKQKIS-RYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAK
                 ++L +G+ITL VG+G++++A AVGD+K   + RY  L ++         IGRLVKSGLLNQLEDNSLPPC+SCLEGK+TKRSFT KGL AK
Subjt:  ---------RKLQDGKITLNVGSGDIIAANAVGDMKQKIS-RYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAK

Query:  KPLELVHSDLWGLMNVKARGGYEYVIN
         PLELVHSD +G MNVKARGGY+Y I+
Subjt:  KPLELVHSDLWGLMNVKARGGYEYVIN

TrEMBL top hitse value%identityAlignment
A0A5A7TFI0 Gag/pol protein2.4e-6450.31Show/hide
Query:  SLKGSGLVMFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA-------------------
        SLKG    MFGQP   +RH+AIKY+Y   MKEGTSV E V                IDE +QVSFILESL KSF+PF TNA                   
Subjt:  SLKGSGLVMFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA-------------------

Query:  ---------------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV
                             F+RGSSS++K GP    +K ++KKGK KTP   KG KK ++KGKC++C E+G W ++C KYL++KKAEKE QGKYD   
Subjt:  ---------------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV

Query:  --RKLQDGKITLNVGSGDIIAANAVGDMKQKIS-RYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELVH
          ++L +G+ITL VG+G+I++A AVG++K   + RY  L ++         IGRLVKSGLL +LEDNSLPPC+S LEGK+TKRSFT KGL AK PLEL+H
Subjt:  --RKLQDGKITLNVGSGDIIAANAVGDMKQKIS-RYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELVH

Query:  SDLWGLMNVKARGGYEYVIN
        S+L   MNVKARGGYEY IN
Subjt:  SDLWGLMNVKARGGYEYVIN

A0A5A7TGB4 Gag/pol protein1.7e-6249.53Show/hide
Query:  SLKGSGLVMFGQPSSSIRHDAIKYVYNSHMKEGTS----------------VNEAVIDEKSQVSFILESLLKSFLPFLTNA-------------------
        SLKG    MFGQP  S+RH+ IKY+Y   MKEGTS                VN   IDE +QVSFILESL KSF+PF TNA                   
Subjt:  SLKGSGLVMFGQPSSSIRHDAIKYVYNSHMKEGTS----------------VNEAVIDEKSQVSFILESLLKSFLPFLTNA-------------------

Query:  ---------------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV
                             F+RG SS++K GPS   +K ++KKGK+KTP      KK  +KGKC++C E+G W R+C KYL++KKAEKE QGKYDLL 
Subjt:  ---------------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV

Query:  ----RKLQDGKITLNVGSGDIIAANAVGDMKQKISRYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELV
            ++L +G+ITLNVG+ ++++A AVGD+K                     IGRLVKSGLLN+LE NSLP  +S LEGK+TKRSFT KGL AK PLELV
Subjt:  ----RKLQDGKITLNVGSGDIIAANAVGDMKQKISRYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELV

Query:  HSDLWGLMNVKARGGYEYVIN
        HSDL G MNVKARGGYEY IN
Subjt:  HSDLWGLMNVKARGGYEYVIN

A0A5A7V9X9 Gag/pol protein2.4e-6449.24Show/hide
Query:  SLKGSGLVMFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA-------------------
        SLKG    MFGQP  S+RH+AIKY+Y   MKE TSV E V                IDE +QVSFILES  KSF+PF TNA                   
Subjt:  SLKGSGLVMFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA-------------------

Query:  ---------------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV
                             F+RGSSS++K  PS   +K ++KKGK KTP   KG KK  +KGKC++C E+G W R+C KYL++KKAEKE QG  + + 
Subjt:  ---------------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV

Query:  ---------RKLQDGKITLNVGSGDIIAANAVGDMKQKIS-RYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAK
                 ++L +G+ITL VG+G++++A AVGD+K   + RY  L ++         IGRLVKSGLLNQLEDNSLPPC+SCLEGK+TKRSFT KGL AK
Subjt:  ---------RKLQDGKITLNVGSGDIIAANAVGDMKQKIS-RYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAK

Query:  KPLELVHSDLWGLMNVKARGGYEYVIN
         PLELVHSD +G MNVKARGGY+Y I+
Subjt:  KPLELVHSDLWGLMNVKARGGYEYVIN

A0A5D3DMH3 Gag/pol protein4.8e-6548.4Show/hide
Query:  MFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA---------------------------
        MFGQ   S+RH+AIKY+Y   MKEGTSV E V                IDE +QVSFILESL KSF+PF TNA                           
Subjt:  MFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA---------------------------

Query:  -------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV--------
                     F+RGSSS++K GPS   +K ++KKGK KTP   KG KK  +KGKC++C E+G W R+CLKYL++KKAEKE QGKYDLLV        
Subjt:  -------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV--------

Query:  -------------------------RKLQDGKITLNVGSGDIIAANAVGDMKQKIS-RYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLE
                                 ++L +GKITL VG+G++++A AVGD+K   + RY  L +          IG LVKSGLL+QLEDNSLPPC+S LE
Subjt:  -------------------------RKLQDGKITLNVGSGDIIAANAVGDMKQKIS-RYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLE

Query:  GKITKRSFTEKGLSAKKPLELVHSDLWGLMNVKARGGYEYVIN
        GK+TKRSFT KGL AK PLELVHSDL G MNVKARGGYEY I+
Subjt:  GKITKRSFTEKGLSAKKPLELVHSDLWGLMNVKARGGYEYVIN

E2GK51 Gag/pol protein (Fragment)8.5e-7042.89Show/hide
Query:  MFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA---------------------------
        MFGQPS S+RH+AIK++Y   MKEGTSV E V                IDE +QVSFIL+SL KSF+PF TNA                           
Subjt:  MFGQPSSSIRHDAIKYVYNSHMKEGTSVNEAV----------------IDEKSQVSFILESLLKSFLPFLTNA---------------------------

Query:  -------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV--------
                     F RGSSS+ K GPS   K  MKKKGK K P N    KK  DKGKCF+CN+DG WKR+C KYL++KKAEK  QGKYDLLV        
Subjt:  -------------FQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKENQGKYDLLV--------

Query:  -------------------------RKLQDGKITLNVGSGDIIAANAVGDM-------------------------------------------------
                                 +KL++G+ITL VG+G++++A AVGD+                                                 
Subjt:  -------------------------RKLQDGKITLNVGSGDIIAANAVGDM-------------------------------------------------

Query:  --------------------------------------KQKISRYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLS
                                              KQK+S   YLWHLRLGHINLNRI RLVKSG+LNQLEDNSLPPCESCLEGK+TKRSFT KGL 
Subjt:  --------------------------------------KQKISRYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLS

Query:  AKKPLELVHSDLWGLMNVKARGGYEYVIN
        AK PLELVHSDL G MNVKARGGYEY I+
Subjt:  AKKPLELVHSDLWGLMNVKARGGYEYVIN

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-0733.33Show/hide
Query:  LWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELVHSDLWGLMNVKARGGYEYVI
        LWH R+GH++   +  L K  L++  +  ++ PC+ CL GK  + SF          L+LV+SD+ G M +++ GG +Y +
Subjt:  LWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELVHSDLWGLMNVKARGGYEYVI

P93293 Uncharacterized mitochondrial protein AtMg003004.7e-0942.47Show/hide
Query:  TYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELVHSDLWGLMNV
        T LWH RL H++   +  LVK G L+  + +SL  CE C+ GK  + +F+    + K PL+ VHSDLWG  +V
Subjt:  TYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELVHSDLWGLMNV

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein3.3e-1042.47Show/hide
Query:  TYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELVHSDLWGLMNV
        T LWH RL H++   +  LVK G L+  + +SL  CE C+ GK  + +F+    + K PL+ VHSDLWG  +V
Subjt:  TYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELVHSDLWGLMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGAAGAAGGGTCTGTAAATTTGTTAGAGTCACTAGTGTTGATCTCTGTAATTTGGAGAAATCGAAGGAAGAACCAAAATGTTCAAGCAGCGGGGACCTACTTTTT
CACTGAGCATCAATGCTCCTTCAACAGCATCATGACATTAAAGGGTTCAGTTGGACTGGTGGCAATTTTGTTCACCAATTTGTACTTGGTTCAGCCTTTATCCAGCCGAT
TCAGCCCTATTTTCAACATTAGTTTTCTGGCCAGTTTGAAGGGTTCGGGGTTGGTGATGTTTGGACAACCGTCTTCCTCGATCAGGCACGATGCTATCAAATACGTTTAC
AATTCCCACATGAAAGAAGGAACCTCTGTAAACGAGGCTGTCATAGACGAGAAGAGTCAAGTTAGTTTTATTCTAGAATCTCTTCTAAAAAGTTTTCTACCATTCCTCAC
TAATGCATTCCAAAGAGGATCATCCTCAAGAACTAAGTTTGGACCCTCTTCCTTAAAGAAGAAGAATATGAAGAAGAAGGGTAAAAAGAAAACTCCTATAAATCTCAAAG
GTAAGAAAAAAGTTGTAGATAAAGGAAAATGTTTCTACTGCAACGAGGATGGGCGTTGGAAGAGAGACTGCCTGAAATATCTTTCTAAGAAAAAAGCTGAAAAAGAAAAC
CAAGGTAAATATGATTTACTAGTTAGAAAGCTTCAAGATGGCAAGATAACTCTCAATGTCGGATCAGGGGATATCATCGCAGCCAATGCAGTGGGAGATATGAAGCAAAA
AATTTCTCGTTATACCTATCTTTGGCATTTAAGACTTGGCCACATTAATCTCAATAGGATTGGGAGGTTGGTAAAAAGTGGACTCTTAAACCAGTTAGAAGACAACTCTT
TACCTCCATGTGAGTCTTGTCTTGAGGGTAAAATTACGAAAAGATCTTTTACTGAAAAAGGTCTTAGCGCCAAAAAACCCTTAGAACTCGTGCACTCAGACCTTTGGGGT
CTTATGAATGTCAAAGCACGAGGAGGGTATGAATATGTCATTAATAATCCTAATCTTCTTCCTTTTAACCCTGAGATTGATAGGACTTACCAGAGGAATCTAAGAGGACA
AACAAATTCAACCGGTGAGGTGGCGAAAGAGGCATCACCAAAGGCAATTCAACGTTCCACCAAGGAGCCTTACAAGCACCTTCGATCTTTCCTAGAGATATGCGGGACGG
ATTGGCTAGAGACTATACTGCCGGAGAGCATAATTACATGGGATGCACTAGTTCAAGCTTTCTTGAACAACTACTTCCCATCGGCGAAGTCACAAAGATTGAGGACAGAG
ATTGACACATTTTGTCAACAAGAAGGTGAGCATTTTTATGAGGTTTTGGAGAGGTACAAGGATCTTTTGAGGAGATGCCCACAACATGGCTACCCGGATTGCTCTCTAAA
GGCTCAATTGGCTTCTCTTACTAATGCTTTGTCTAAATTGACTCAAGGAGGCCAAGCCCAAGCAAGTCCACCATCCATAGCTTCCCTTCCGGCCACGGCAAGTCAACAGG
AGCCAAGTGAGTTGGAGATGGCCAACTATGCGGATAGAGAACAATTAGGAATTGAAGCTAGTCTGGAATTCGTGCAATCTGGACAAGTTGTGATGTTCGAAGAATATCGA
GCTGTAGAGATCAAATCTGGAGTTGAGCTTTTGAGATTAGTAAATAGCCTGGAGCAAGTTGAAGGAATTTTAAGTTATCTGTCGAACAAAAAGAAGTTAGACCTTGCAAT
TTGGAGATTGGGTAAGACAAGTCTCGCTAGTAAACATATAACGACATTGCGGGTGTGGAACGAGACAGGTCTCGCTAGTCTGACGAAGTCTTGCTTTCTCGCTAGTCTCA
CTAATCTTGTTGGTCTCGCTAGTTGCTTGGTCTCATTAAAGGTCCCAATGCCGAAGGACACAGAGAACACATATGGGCAGCCTGAATATCGCATATGGGACCTGGTATTG
AAGGACTTGAGAAGGTACACAGGCAACCCAGTTTATGATTCCTATGTGAATATAGATATTCAAGGCTGTTGTGTTGAGAATACTCCACACAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGAAGAAGGGTCTGTAAATTTGTTAGAGTCACTAGTGTTGATCTCTGTAATTTGGAGAAATCGAAGGAAGAACCAAAATGTTCAAGCAGCGGGGACCTACTTTTT
CACTGAGCATCAATGCTCCTTCAACAGCATCATGACATTAAAGGGTTCAGTTGGACTGGTGGCAATTTTGTTCACCAATTTGTACTTGGTTCAGCCTTTATCCAGCCGAT
TCAGCCCTATTTTCAACATTAGTTTTCTGGCCAGTTTGAAGGGTTCGGGGTTGGTGATGTTTGGACAACCGTCTTCCTCGATCAGGCACGATGCTATCAAATACGTTTAC
AATTCCCACATGAAAGAAGGAACCTCTGTAAACGAGGCTGTCATAGACGAGAAGAGTCAAGTTAGTTTTATTCTAGAATCTCTTCTAAAAAGTTTTCTACCATTCCTCAC
TAATGCATTCCAAAGAGGATCATCCTCAAGAACTAAGTTTGGACCCTCTTCCTTAAAGAAGAAGAATATGAAGAAGAAGGGTAAAAAGAAAACTCCTATAAATCTCAAAG
GTAAGAAAAAAGTTGTAGATAAAGGAAAATGTTTCTACTGCAACGAGGATGGGCGTTGGAAGAGAGACTGCCTGAAATATCTTTCTAAGAAAAAAGCTGAAAAAGAAAAC
CAAGGTAAATATGATTTACTAGTTAGAAAGCTTCAAGATGGCAAGATAACTCTCAATGTCGGATCAGGGGATATCATCGCAGCCAATGCAGTGGGAGATATGAAGCAAAA
AATTTCTCGTTATACCTATCTTTGGCATTTAAGACTTGGCCACATTAATCTCAATAGGATTGGGAGGTTGGTAAAAAGTGGACTCTTAAACCAGTTAGAAGACAACTCTT
TACCTCCATGTGAGTCTTGTCTTGAGGGTAAAATTACGAAAAGATCTTTTACTGAAAAAGGTCTTAGCGCCAAAAAACCCTTAGAACTCGTGCACTCAGACCTTTGGGGT
CTTATGAATGTCAAAGCACGAGGAGGGTATGAATATGTCATTAATAATCCTAATCTTCTTCCTTTTAACCCTGAGATTGATAGGACTTACCAGAGGAATCTAAGAGGACA
AACAAATTCAACCGGTGAGGTGGCGAAAGAGGCATCACCAAAGGCAATTCAACGTTCCACCAAGGAGCCTTACAAGCACCTTCGATCTTTCCTAGAGATATGCGGGACGG
ATTGGCTAGAGACTATACTGCCGGAGAGCATAATTACATGGGATGCACTAGTTCAAGCTTTCTTGAACAACTACTTCCCATCGGCGAAGTCACAAAGATTGAGGACAGAG
ATTGACACATTTTGTCAACAAGAAGGTGAGCATTTTTATGAGGTTTTGGAGAGGTACAAGGATCTTTTGAGGAGATGCCCACAACATGGCTACCCGGATTGCTCTCTAAA
GGCTCAATTGGCTTCTCTTACTAATGCTTTGTCTAAATTGACTCAAGGAGGCCAAGCCCAAGCAAGTCCACCATCCATAGCTTCCCTTCCGGCCACGGCAAGTCAACAGG
AGCCAAGTGAGTTGGAGATGGCCAACTATGCGGATAGAGAACAATTAGGAATTGAAGCTAGTCTGGAATTCGTGCAATCTGGACAAGTTGTGATGTTCGAAGAATATCGA
GCTGTAGAGATCAAATCTGGAGTTGAGCTTTTGAGATTAGTAAATAGCCTGGAGCAAGTTGAAGGAATTTTAAGTTATCTGTCGAACAAAAAGAAGTTAGACCTTGCAAT
TTGGAGATTGGGTAAGACAAGTCTCGCTAGTAAACATATAACGACATTGCGGGTGTGGAACGAGACAGGTCTCGCTAGTCTGACGAAGTCTTGCTTTCTCGCTAGTCTCA
CTAATCTTGTTGGTCTCGCTAGTTGCTTGGTCTCATTAAAGGTCCCAATGCCGAAGGACACAGAGAACACATATGGGCAGCCTGAATATCGCATATGGGACCTGGTATTG
AAGGACTTGAGAAGGTACACAGGCAACCCAGTTTATGATTCCTATGTGAATATAGATATTCAAGGCTGTTGTGTTGAGAATACTCCACACAACTAA
Protein sequenceShow/hide protein sequence
MCEEGSVNLLESLVLISVIWRNRRKNQNVQAAGTYFFTEHQCSFNSIMTLKGSVGLVAILFTNLYLVQPLSSRFSPIFNISFLASLKGSGLVMFGQPSSSIRHDAIKYVY
NSHMKEGTSVNEAVIDEKSQVSFILESLLKSFLPFLTNAFQRGSSSRTKFGPSSLKKKNMKKKGKKKTPINLKGKKKVVDKGKCFYCNEDGRWKRDCLKYLSKKKAEKEN
QGKYDLLVRKLQDGKITLNVGSGDIIAANAVGDMKQKISRYTYLWHLRLGHINLNRIGRLVKSGLLNQLEDNSLPPCESCLEGKITKRSFTEKGLSAKKPLELVHSDLWG
LMNVKARGGYEYVINNPNLLPFNPEIDRTYQRNLRGQTNSTGEVAKEASPKAIQRSTKEPYKHLRSFLEICGTDWLETILPESIITWDALVQAFLNNYFPSAKSQRLRTE
IDTFCQQEGEHFYEVLERYKDLLRRCPQHGYPDCSLKAQLASLTNALSKLTQGGQAQASPPSIASLPATASQQEPSELEMANYADREQLGIEASLEFVQSGQVVMFEEYR
AVEIKSGVELLRLVNSLEQVEGILSYLSNKKKLDLAIWRLGKTSLASKHITTLRVWNETGLASLTKSCFLASLTNLVGLASCLVSLKVPMPKDTENTYGQPEYRIWDLVL
KDLRRYTGNPVYDSYVNIDIQGCCVENTPHN