; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0172011 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0172011
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr06:27802509..27803360
RNA-Seq ExpressionCmc06g0172011
SyntenyCmc06g0172011
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035996.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.3e-10868.09Show/hide
Query:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME
        +IHAK LPL+ W EA+N  CHIH RITTRSG+ VTLYELWKGRK NV+YFH+F  TCYILADREYHRKWD KSEQG+FLG SQNSR Y+VFNNR+  VME
Subjt:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME

Query:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK
        TIN+VVND E T  +  DE+DET  + +  ++ PA+V KAD +   +N+ SK  SKE + + +  +PS+HV KN+PS SIIGDPS GI TRKK+K+DYSK
Subjt:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK

Query:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY
        MI DLCYTS IEP++++ ALKDEYWINAMQEEL+QF+RNNVWTLVPK  G NIIGTKW+ KNKTDE+ CVT+NKA LVAQGY
Subjt:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY

KAA0039208.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.9e-10369.5Show/hide
Query:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME
        MIHAK+L L+ W +A+NT CHIHNRI TRSGTT  LYELWKGRK NV+YFHVF STCYILADREYHRKWDVKS+QGIFLG S NSR YR+FN +SG VME
Subjt:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME

Query:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK
         INVVVNDFEST  Q  DEDDET+N+PVD+S  P EV KADA  DG                                   GDPS GIIT+KKE+VDY K
Subjt:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK

Query:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY
        MIA+LCYTSTIEPST+D+A KDE WINAMQEELL+FRRNNVWTLVPK E ANIIGTKWI KNKTDEA CVTKNKA LVAQGY
Subjt:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY

KAA0064117.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.5e-9978.01Show/hide
Query:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME
        MIH K+LPL+   EA+NT CHIHNRITTRSG TVTLYELW GRK NV+YFHVFG+TCYILADREYHRKWDVK EQGIFL YS NS  YRVFN +SGIVME
Subjt:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME

Query:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK
        TINVVVNDFEST  Q  DEDDET+N+PVD+S    EV KADA  DG+ INS+ ISKEVIADN E VPSTHV+KN+PS SIIGDP   IIT+KKEKVDY K
Subjt:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK

Query:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNV
        MIADLCYT  I+PSTIDVALKDEYWIN MQEELLQF RNNV
Subjt:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNV

TYK11800.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.1e-11883.14Show/hide
Query:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME
        MIHAKHLPLN W EAVNT CHIH+RITTRSGT VTLYELW GRK NV+YFHVFGSTCYILA+REYHRKWDVKS+QGIFLGYSQNSR YRVFN+ SG VME
Subjt:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME

Query:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK
        TIN +VNDFES   QTYDEDDETLN+ +DSST   +VLKAD QADGT+INS   SKEVIADNSELV S H+RKN+P  SIIGDPS  I TRKKEKVDYSK
Subjt:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK

Query:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILK
        MIADLCYTSTIEPST+ VALKDEYWINAMQEELLQFRRNNVWTLVPK EGANIIGTKWI K
Subjt:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILK

TYK21443.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.1e-10064.24Show/hide
Query:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME
        MIHAK+LPLN W EAVNT CHIHNR+TTRSG TVTLYEL KGRK NV+YFH+FGSTCYILADREYHRKWD KS QGIFLGYSQNSR YRVFN +SG VME
Subjt:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME

Query:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK
        TINVVVNDFES   Q   EDDET   P  +ST   E+ K ++Q      +S  I+ EVI + + LVPS HV+KN+PS SII DPS GI TR+KE ++ + 
Subjt:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK

Query:  MIA-------------------DLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQG
         I+                   DLCY S IEP++++ +LKDEYWI  MQEE LQF+RNNVWTLVPK +GANIIGTKWI KNKTDE+  + +NKARLVAQG
Subjt:  MIA-------------------DLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQG

Query:  YT
        YT
Subjt:  YT

TrEMBL top hitse value%identityAlignment
A0A5A7T197 Gag-pol polyprotein6.5e-10968.09Show/hide
Query:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME
        +IHAK LPL+ W EA+N  CHIH RITTRSG+ VTLYELWKGRK NV+YFH+F  TCYILADREYHRKWD KSEQG+FLG SQNSR Y+VFNNR+  VME
Subjt:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME

Query:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK
        TIN+VVND E T  +  DE+DET  + +  ++ PA+V KAD +   +N+ SK  SKE + + +  +PS+HV KN+PS SIIGDPS GI TRKK+K+DYSK
Subjt:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK

Query:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY
        MI DLCYTS IEP++++ ALKDEYWINAMQEEL+QF+RNNVWTLVPK  G NIIGTKW+ KNKTDE+ CVT+NKA LVAQGY
Subjt:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY

A0A5A7VAS8 Gag-pol polyprotein7.2e-10078.01Show/hide
Query:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME
        MIH K+LPL+   EA+NT CHIHNRITTRSG TVTLYELW GRK NV+YFHVFG+TCYILADREYHRKWDVK EQGIFL YS NS  YRVFN +SGIVME
Subjt:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME

Query:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK
        TINVVVNDFEST  Q  DEDDET+N+PVD+S    EV KADA  DG+ INS+ ISKEVIADN E VPSTHV+KN+PS SIIGDP   IIT+KKEKVDY K
Subjt:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK

Query:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNV
        MIADLCYT  I+PSTIDVALKDEYWIN MQEELLQF RNNV
Subjt:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNV

A0A5D3CK56 Gag-pol polyprotein3.4e-11883.14Show/hide
Query:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME
        MIHAKHLPLN W EAVNT CHIH+RITTRSGT VTLYELW GRK NV+YFHVFGSTCYILA+REYHRKWDVKS+QGIFLGYSQNSR YRVFN+ SG VME
Subjt:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME

Query:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK
        TIN +VNDFES   QTYDEDDETLN+ +DSST   +VLKAD QADGT+INS   SKEVIADNSELV S H+RKN+P  SIIGDPS  I TRKKEKVDYSK
Subjt:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK

Query:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILK
        MIADLCYTSTIEPST+ VALKDEYWINAMQEELLQFRRNNVWTLVPK EGANIIGTKWI K
Subjt:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILK

A0A5D3D4K0 Gag-pol polyprotein1.4e-10369.5Show/hide
Query:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME
        MIHAK+L L+ W +A+NT CHIHNRI TRSGTT  LYELWKGRK NV+YFHVF STCYILADREYHRKWDVKS+QGIFLG S NSR YR+FN +SG VME
Subjt:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME

Query:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK
         INVVVNDFEST  Q  DEDDET+N+PVD+S  P EV KADA  DG                                   GDPS GIIT+KKE+VDY K
Subjt:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK

Query:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY
        MIA+LCYTSTIEPST+D+A KDE WINAMQEELL+FRRNNVWTLVPK E ANIIGTKWI KNKTDEA CVTKNKA LVAQGY
Subjt:  MIADLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY

A0A5D3DCZ8 Gag-pol polyprotein2.5e-10064.24Show/hide
Query:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME
        MIHAK+LPLN W EAVNT CHIHNR+TTRSG TVTLYEL KGRK NV+YFH+FGSTCYILADREYHRKWD KS QGIFLGYSQNSR YRVFN +SG VME
Subjt:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME

Query:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK
        TINVVVNDFES   Q   EDDET   P  +ST   E+ K ++Q      +S  I+ EVI + + LVPS HV+KN+PS SII DPS GI TR+KE ++ + 
Subjt:  TINVVVNDFESTTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSK

Query:  MIA-------------------DLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQG
         I+                   DLCY S IEP++++ +LKDEYWI  MQEE LQF+RNNVWTLVPK +GANIIGTKWI KNKTDE+  + +NKARLVAQG
Subjt:  MIA-------------------DLCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQG

Query:  YT
        YT
Subjt:  YT

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-0437.29Show/hide
Query:  WINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGYT
        W  A+  EL   + NN WT+  + E  NI+ ++W+   K +E     + KARLVA+G+T
Subjt:  WINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGYT

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-1022.9Show/hide
Query:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME
        M+    LP + W EAV T C++ NR  +          +W  ++ +  +  VFG   +    +E   K D KS   IF+GY      YR+++     V+ 
Subjt:  MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVME

Query:  TINVVVNDFESTTIQTYDED------------DETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGI
        + +VV  + E  T     E               T N P  + +   EV +   Q        + + + V     E+   T   + +  +     P   +
Subjt:  TINVVVNDFESTTIQTYDED------------DETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGI

Query:  ITRKKEKVDYSKMIADLCYTSTIEPSTIDVAL---KDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY
         +R+    +Y  +  D       EP ++   L   +    + AMQEE+   ++N  + LV   +G   +  KW+ K K D    + + KARLV +G+
Subjt:  ITRKKEKVDYSKMIADLCYTSTIEPSTIDVAL---KDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY

P92520 Uncharacterized mitochondrial protein AtMg008209.2e-1241.24Show/hide
Query:  IITRKKEKVDYSKMIADLCYTSTI--EPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY
        ++TR K  ++       L  T+TI  EP ++  ALKD  W  AMQEEL    RN  W LVP     NI+G KW+ K K      + + KARLVA+G+
Subjt:  IITRKKEKVDYSKMIADLCYTSTI--EPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-0632.91Show/hide
Query:  LCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGYT
        +C     EPST + A +   W  AM +E+      + W +         IG KW+ K K +    + + KARLVA+GYT
Subjt:  LCYTSTIEPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGYT

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.6e-1341.24Show/hide
Query:  IITRKKEKVDYSKMIADLCYTSTI--EPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY
        ++TR K  ++       L  T+TI  EP ++  ALKD  W  AMQEEL    RN  W LVP     NI+G KW+ K K      + + KARLVA+G+
Subjt:  IITRKKEKVDYSKMIADLCYTSTI--EPSTIDVALKDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATACATGCTAAACATTTACCTTTAAATGTTTGGACAGAAGCTGTTAACACAACGTGCCACATTCATAATAGAATTACTACTAGATCTGGAACAACTGTCACATTATA
TGAACTTTGGAAGGGAAGAAAGTCAAATGTACAGTATTTTCATGTATTTGGAAGTACTTGTTATATTCTAGCTGATAGAGAATATCATCGAAAGTGGGATGTGAAATCAG
AACAAGGAATATTTCTTGGATATTCTCAAAATAGCCGAGTTTATAGAGTCTTTAATAATAGATCTGGTATAGTTATGGAAACGATCAATGTTGTGGTAAATGATTTTGAA
TCAACTACCATACAAACTTATGATGAGGATGATGAGACTTTAAATATGCCTGTGGATTCTTCTACGCTTCCTGCGGAAGTACTTAAAGCTGATGCTCAGGCAGATGGTAC
TAATATAAACTCAAAAATGATATCTAAGGAAGTTATAGCTGATAACTCTGAACTTGTTCCATCTACACATGTGAGAAAGAATTATCCATCAATTTCTATAATAGGTGATC
CTTCAACCGGAATTATCACCAGAAAGAAAGAGAAAGTAGATTACTCAAAGATGATTGCTGATTTATGTTATACATCTACCATTGAACCCTCGACTATTGATGTTGCTCTT
AAAGATGAATATTGGATAAATGCAATGCAAGAAGAACTACTTCAATTCAGGCGTAACAATGTCTGGACATTAGTTCCCAAGCTAGAAGGAGCAAATATTATAGGTACAAA
ATGGATCTTAAAAAATAAGACTGATGAAGCACGGTGTGTGACAAAGAATAAAGCTCGTTTAGTTGCTCAGGGTTATACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATACATGCTAAACATTTACCTTTAAATGTTTGGACAGAAGCTGTTAACACAACGTGCCACATTCATAATAGAATTACTACTAGATCTGGAACAACTGTCACATTATA
TGAACTTTGGAAGGGAAGAAAGTCAAATGTACAGTATTTTCATGTATTTGGAAGTACTTGTTATATTCTAGCTGATAGAGAATATCATCGAAAGTGGGATGTGAAATCAG
AACAAGGAATATTTCTTGGATATTCTCAAAATAGCCGAGTTTATAGAGTCTTTAATAATAGATCTGGTATAGTTATGGAAACGATCAATGTTGTGGTAAATGATTTTGAA
TCAACTACCATACAAACTTATGATGAGGATGATGAGACTTTAAATATGCCTGTGGATTCTTCTACGCTTCCTGCGGAAGTACTTAAAGCTGATGCTCAGGCAGATGGTAC
TAATATAAACTCAAAAATGATATCTAAGGAAGTTATAGCTGATAACTCTGAACTTGTTCCATCTACACATGTGAGAAAGAATTATCCATCAATTTCTATAATAGGTGATC
CTTCAACCGGAATTATCACCAGAAAGAAAGAGAAAGTAGATTACTCAAAGATGATTGCTGATTTATGTTATACATCTACCATTGAACCCTCGACTATTGATGTTGCTCTT
AAAGATGAATATTGGATAAATGCAATGCAAGAAGAACTACTTCAATTCAGGCGTAACAATGTCTGGACATTAGTTCCCAAGCTAGAAGGAGCAAATATTATAGGTACAAA
ATGGATCTTAAAAAATAAGACTGATGAAGCACGGTGTGTGACAAAGAATAAAGCTCGTTTAGTTGCTCAGGGTTATACTTAA
Protein sequenceShow/hide protein sequence
MIHAKHLPLNVWTEAVNTTCHIHNRITTRSGTTVTLYELWKGRKSNVQYFHVFGSTCYILADREYHRKWDVKSEQGIFLGYSQNSRVYRVFNNRSGIVMETINVVVNDFE
STTIQTYDEDDETLNMPVDSSTLPAEVLKADAQADGTNINSKMISKEVIADNSELVPSTHVRKNYPSISIIGDPSTGIITRKKEKVDYSKMIADLCYTSTIEPSTIDVAL
KDEYWINAMQEELLQFRRNNVWTLVPKLEGANIIGTKWILKNKTDEARCVTKNKARLVAQGYT