; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0016076 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0016076
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGag/pol protein
Genome locationchr05:15005683..15008120
RNA-Seq ExpressionPI0016076
SyntenyPI0016076
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048500.1 protein MNN4-like [Cucumis melo var. makuwa]1.0e-4039.1Show/hide
Query:  APCESKSLEPEKNLEVIAEELEEELEAMSPLDDRPPPRKPRKV-------AGPSRGQKKTGRSGPEERPSGGDTMSKTPSINSLIKVEKGLFPFNGQLPD
        A  E K    EK  +  ++E E+ELE +SPL+DR   ++P+K        A     + KT +   E + S  + +     +     +EKG+FPF GQLP 
Subjt:  APCESKSLEPEKNLEVIAEELEEELEAMSPLDDRPPPRKPRKV-------AGPSRGQKKTGRSGPEERPSGGDTMSKTPSINSLIKVEKGLFPFNGQLPD

Query:  FLFAPIQAFGWKSFFKGHTKVRLGVVEKFYAAKLNAEEFSVQISGKTVSFSAEAINALYDL-SNDVEMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTP
        FL +PI+A  WK FF+G T +R  V+  FY   +N E     + GK V+F  + +N LY L    VE P       P+    + ALE +AW G  W++TP
Subjt:  FLFAPIQAFGWKSFFKGHTKVRLGVVEKFYAAKLNAEEFSVQISGKTVSFSAEAINALYDL-SNDVEMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTP

Query:  TGKYQLYPHQLTTEASVWLFFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWM
          KYQL+PH L T ASVWL FIKK + P RHD+TI+L+  MLLYCI+ +  +N+ + I   I  W+
Subjt:  TGKYQLYPHQLTTEASVWLFFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWM

KAA0049609.1 transposase [Cucumis melo var. makuwa]6.3e-3047.85Show/hide
Query:  ISGKTVSFSAEAINALYDLSNDV-EMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPMRHDSTINLKSAML
        +  + V F+ E IN LYDL ND+   PGQ  +    +  A++ +++I W  A    TPT + QL+PHQLT EA+VWLFFIKKKIFP  HDSTI  + A++
Subjt:  ISGKTVSFSAEAINALYDLSNDV-EMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPMRHDSTINLKSAML

Query:  LYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGRLCN
        LYCI AKK  NLG  +  + L WMR PK A PFP+T++ LCLK +      I  I + G  CN
Subjt:  LYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGRLCN

KAA0054837.1 hypothetical protein E6C27_scaffold406G00150 [Cucumis melo var. makuwa]8.0e-3333.33Show/hide
Query:  GAPCESKSLEPEKNLEVIAEELEEELEAMSPLDDRPPPRKPRKVAGPSRGQKKTGRSGPEERPSGGDTMSKTPSINSL-----IKVEKGLFPFNGQLPDF
        G P  +     +  +    E  EE+   +SPL++    R+PRK      GQ    R   E+     ++      + S        VEKG F F  QL  F
Subjt:  GAPCESKSLEPEKNLEVIAEELEEELEAMSPLDDRPPPRKPRKVAGPSRGQKKTGRSGPEERPSGGDTMSKTPSINSL-----IKVEKGLFPFNGQLPDF

Query:  LFAPIQAFGWKSFFKGHTKVRLGVVEKFYAAKLNAEEFSVQISGKTVSFSAEAINALYDLSNDVEMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTG
        L  PI+A GW+ F +G   +R GVV+ FY  K++ E+    +  +                             P+    +EALE +AW    W+VT   
Subjt:  LFAPIQAFGWKSFFKGHTKVRLGVVEKFYAAKLNAEEFSVQISGKTVSFSAEAINALYDLSNDVEMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTG

Query:  KYQLYPHQLTTEASVWLFFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVVPFL-SAIQTISIPGR
        KY+L+ H LTTEASVWL FIKKK+ P RHD+TI+ +  MLLYCI+ +  V++ + I   I  W++ P+GA PFP  IE LCL+    L  + Q   +   
Subjt:  KYQLYPHQLTTEASVWLFFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVVPFL-SAIQTISIPGR

Query:  LCN
        +CN
Subjt:  LCN

KAA0062900.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-3848.9Show/hide
Query:  KVRLGVVEKFYAAKLNAEEFSVQISGKTVSFSAEAINALYDLSNDVEMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTG-KYQLYPHQLTTEASVWL
        K+R+ VV KFY  K N  +  + I  +   F+ E IN LY+  ND E  GQ  V   TK +A+EAL+V+AW G   EV P   +YQLYPH LTT+A+VW+
Subjt:  KVRLGVVEKFYAAKLNAEEFSVQISGKTVSFSAEAINALYDLSNDVEMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTG-KYQLYPHQLTTEASVWL

Query:  FFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVVPFLSAIQTISIPGR
        FF K KIFP  +DSTI++   ++LYCI+ KK +NL + I  +IL WM  PK AMPFPS +E LCLK +P L      +IP R
Subjt:  FFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVVPFLSAIQTISIPGR

TYK15967.1 hypothetical protein E5676_scaffold94G00870 [Cucumis melo var. makuwa]3.7e-3043.23Show/hide
Query:  KGHTKVRLGVVEKFYAAKL--NAEEFSVQISGKTVSFSAEAINALYDLSNDV-EMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTGKYQLYPHQLTT
        +G  +  +G  E     K+    E +++ +  + V F+ E IN LYDL ND+   PGQ  +    +  A++ +++I W  A    TPT + QL+PHQLT 
Subjt:  KGHTKVRLGVVEKFYAAKL--NAEEFSVQISGKTVSFSAEAINALYDLSNDV-EMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTGKYQLYPHQLTT

Query:  EASVWLFFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGRLCN
        EA+VWLFFIKKKIFP  HDSTI  + A++LYCI AKK  NLG  +  + L WMR PK A PFP+T++ LCLK +      I  I + G  CN
Subjt:  EASVWLFFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGRLCN

TrEMBL top hitse value%identityAlignment
A0A5A7TZE0 Protein MNN4-like5.0e-4139.1Show/hide
Query:  APCESKSLEPEKNLEVIAEELEEELEAMSPLDDRPPPRKPRKV-------AGPSRGQKKTGRSGPEERPSGGDTMSKTPSINSLIKVEKGLFPFNGQLPD
        A  E K    EK  +  ++E E+ELE +SPL+DR   ++P+K        A     + KT +   E + S  + +     +     +EKG+FPF GQLP 
Subjt:  APCESKSLEPEKNLEVIAEELEEELEAMSPLDDRPPPRKPRKV-------AGPSRGQKKTGRSGPEERPSGGDTMSKTPSINSLIKVEKGLFPFNGQLPD

Query:  FLFAPIQAFGWKSFFKGHTKVRLGVVEKFYAAKLNAEEFSVQISGKTVSFSAEAINALYDL-SNDVEMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTP
        FL +PI+A  WK FF+G T +R  V+  FY   +N E     + GK V+F  + +N LY L    VE P       P+    + ALE +AW G  W++TP
Subjt:  FLFAPIQAFGWKSFFKGHTKVRLGVVEKFYAAKLNAEEFSVQISGKTVSFSAEAINALYDL-SNDVEMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTP

Query:  TGKYQLYPHQLTTEASVWLFFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWM
          KYQL+PH L T ASVWL FIKK + P RHD+TI+L+  MLLYCI+ +  +N+ + I   I  W+
Subjt:  TGKYQLYPHQLTTEASVWLFFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWM

A0A5A7U806 Transposase3.1e-3047.85Show/hide
Query:  ISGKTVSFSAEAINALYDLSNDV-EMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPMRHDSTINLKSAML
        +  + V F+ E IN LYDL ND+   PGQ  +    +  A++ +++I W  A    TPT + QL+PHQLT EA+VWLFFIKKKIFP  HDSTI  + A++
Subjt:  ISGKTVSFSAEAINALYDLSNDV-EMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPMRHDSTINLKSAML

Query:  LYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGRLCN
        LYCI AKK  NLG  +  + L WMR PK A PFP+T++ LCLK +      I  I + G  CN
Subjt:  LYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGRLCN

A0A5A7V6M5 Gag/pol protein6.2e-3948.9Show/hide
Query:  KVRLGVVEKFYAAKLNAEEFSVQISGKTVSFSAEAINALYDLSNDVEMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTG-KYQLYPHQLTTEASVWL
        K+R+ VV KFY  K N  +  + I  +   F+ E IN LY+  ND E  GQ  V   TK +A+EAL+V+AW G   EV P   +YQLYPH LTT+A+VW+
Subjt:  KVRLGVVEKFYAAKLNAEEFSVQISGKTVSFSAEAINALYDLSNDVEMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTG-KYQLYPHQLTTEASVWL

Query:  FFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVVPFLSAIQTISIPGR
        FF K KIFP  +DSTI++   ++LYCI+ KK +NL + I  +IL WM  PK AMPFPS +E LCLK +P L      +IP R
Subjt:  FFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVVPFLSAIQTISIPGR

A0A5D3CVL7 Uncharacterized protein1.8e-3043.23Show/hide
Query:  KGHTKVRLGVVEKFYAAKL--NAEEFSVQISGKTVSFSAEAINALYDLSNDV-EMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTGKYQLYPHQLTT
        +G  +  +G  E     K+    E +++ +  + V F+ E IN LYDL ND+   PGQ  +    +  A++ +++I W  A    TPT + QL+PHQLT 
Subjt:  KGHTKVRLGVVEKFYAAKL--NAEEFSVQISGKTVSFSAEAINALYDLSNDV-EMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTGKYQLYPHQLTT

Query:  EASVWLFFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGRLCN
        EA+VWLFFIKKKIFP  HDSTI  + A++LYCI AKK  NLG  +  + L WMR PK A PFP+T++ LCLK +      I  I + G  CN
Subjt:  EASVWLFFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGRLCN

A0A5D3DVQ6 Uncharacterized protein3.9e-3333.33Show/hide
Query:  GAPCESKSLEPEKNLEVIAEELEEELEAMSPLDDRPPPRKPRKVAGPSRGQKKTGRSGPEERPSGGDTMSKTPSINSL-----IKVEKGLFPFNGQLPDF
        G P  +     +  +    E  EE+   +SPL++    R+PRK      GQ    R   E+     ++      + S        VEKG F F  QL  F
Subjt:  GAPCESKSLEPEKNLEVIAEELEEELEAMSPLDDRPPPRKPRKVAGPSRGQKKTGRSGPEERPSGGDTMSKTPSINSL-----IKVEKGLFPFNGQLPDF

Query:  LFAPIQAFGWKSFFKGHTKVRLGVVEKFYAAKLNAEEFSVQISGKTVSFSAEAINALYDLSNDVEMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTG
        L  PI+A GW+ F +G   +R GVV+ FY  K++ E+    +  +                             P+    +EALE +AW    W+VT   
Subjt:  LFAPIQAFGWKSFFKGHTKVRLGVVEKFYAAKLNAEEFSVQISGKTVSFSAEAINALYDLSNDVEMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTG

Query:  KYQLYPHQLTTEASVWLFFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVVPFL-SAIQTISIPGR
        KY+L+ H LTTEASVWL FIKKK+ P RHD+TI+ +  MLLYCI+ +  V++ + I   I  W++ P+GA PFP  IE LCL+    L  + Q   +   
Subjt:  KYQLYPHQLTTEASVWLFFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDFIATSILFWMRAPKGAMPFPSTIEALCLKVVPFL-SAIQTISIPGR

Query:  LCN
        +CN
Subjt:  LCN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCTGAATCAGAACGCCCCACATCGCGAGGAAGTGGCGAACAAAAAACGCCACAAGGTAGAGGTTGACAACCTGGATCAAGATGCGGTGCTCGAGTTTCTCAACGA
ACGAGTAAGAAAGAGGAAGGACGCTCACATCAAAAGGACTAAAGAAGCTCGTCGCCGAAAGAACGAGCACTGCCTTAAAAAGGATGGAAGAAAAGAAAAGGATGCTCGCA
GAACTGAGCGAGCAAGTGGCGGAGCTCCCTGCGAAAGTAAATCGCTGGAGCCTGAAAAGAACCTCGAGGTGATCGCAGAAGAACTCGAGGAGGAGCTGGAGGCGATGAGC
CCGCTTGATGATAGGCCACCGCCAAGAAAACCAAGGAAGGTCGCAGGACCATCAAGAGGACAGAAGAAAACTGGGCGTTCTGGACCTGAAGAGCGCCCATCAGGCGGCGA
CACCATGAGCAAAACACCCTCCATTAACTCCCTCATCAAGGTTGAGAAAGGGCTGTTTCCGTTCAATGGTCAACTTCCAGATTTCCTCTTCGCGCCAATCCAGGCATTCG
GATGGAAGTCATTCTTCAAGGGGCACACCAAAGTAAGATTAGGGGTGGTGGAGAAGTTTTATGCGGCTAAGCTCAACGCTGAAGAGTTTAGCGTACAAATCAGCGGAAAA
ACAGTGAGTTTCAGCGCAGAGGCCATCAACGCATTGTATGATTTGTCCAATGATGTTGAGATGCCAGGGCAAATATACGTAGACAGTCCTACGAAGAGGATGGCCCGTGA
AGCGCTGGAAGTCATCGCATGGCATGGGGCTGCATGGGAAGTAACGCCAACGGGGAAGTATCAGTTATATCCACATCAGCTAACCACTGAGGCGAGCGTGTGGTTGTTTT
TTATCAAAAAGAAGATCTTCCCAATGCGCCATGATAGCACCATCAACTTAAAGTCAGCAATGCTACTCTATTGTATCCTGGCGAAGAAGCGCGTCAACCTTGGCGATTTT
ATAGCCACATCCATTCTGTTTTGGATGCGAGCTCCCAAAGGCGCCATGCCCTTCCCTTCAACCATTGAGGCCCTTTGCCTTAAAGTTGTGCCATTCTTATCCGCCATCCA
AACAATTTCAATACCAGGCAGGCTGTGTAATCAAATGGCTTTAAACCGCATGATTACTTTCTATGGACATAAGGAAATGGAAAGGAGGGCAAAGACATTGGGCGACACAC
CTGAAGGAATGGCCCAAGCAGAACGCAAAAGGAAATCCCCAGGCGTCGCATCAACCCCCCCACCGAAAGCCAAAAAAGCAAAGGTTGATGTGACGAAGAAGCCTCCTCTG
AAAATTATCCACCCCTCATCCCACCCAAAACAGCGAGCTCCCCCATCAGTCCAACAAAAATCCAGCAGCTCTCAACCTCCCCGCCCTTCTTCGCCCATCCCAATCACCCC
ACCTTCCCCCAACCTCTCTCCCCGCCATTCACCTCTCCCCCACATTCTTTCCCCTATTCACGTCCCTCACCTTTCCCCCGATCGCCTACCCCTCCGCCTACAAAATCGAC
TTCACCCCTTCCTTCTAAAGCATCCTCACCCCGACAACAAGAGCCTCTTTCGCCTCTCATTCTTTTACCCATCGTGGACCTTACCAACCTTCGCCATGACCAGCCCACGA
TCGCCACTGAGGTCGTTGAGGTTCCCCCCTCATCACTCACCCAAAGAGGACGCCCTTCCCCCACCATCACCACCGCCACCTGTTATGGATACCGCGGAAAATGTTGATGA
CCCACACGTTAAGGACAAAAACGCTATCCTTAATGAAGTTGGCGAGACAGCTTCTTCGGTGCATACACCCACCGCCCAACCTTCCATCCCACCTGGCGAAGATGAAGATT
TGGGCGACATATTAGGTTCCCTTGTGTGTAAGCCAATGATGGAACACTTTGAACAAATATTGGCTAATCAAGGGGACATGGCGACGCAACTTCAGATTTTGCAAAATAAA
GTTGATCAGTTGCAACGTCCCAACGCCGAAAACCCTATGACGCCCGCTTCTCAAGACGACATAAACATTTTGCGAAACGAAATGCGGGACCTTTCGGCGAACAACGCCCA
GATCTCCACCGCCGTATCCAACCTCTCCACTTCCGTCTTCCACCTCACTTCCTGGTCATGGCCCAAGCTGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCTGAATCAGAACGCCCCACATCGCGAGGAAGTGGCGAACAAAAAACGCCACAAGGTAGAGGTTGACAACCTGGATCAAGATGCGGTGCTCGAGTTTCTCAACGA
ACGAGTAAGAAAGAGGAAGGACGCTCACATCAAAAGGACTAAAGAAGCTCGTCGCCGAAAGAACGAGCACTGCCTTAAAAAGGATGGAAGAAAAGAAAAGGATGCTCGCA
GAACTGAGCGAGCAAGTGGCGGAGCTCCCTGCGAAAGTAAATCGCTGGAGCCTGAAAAGAACCTCGAGGTGATCGCAGAAGAACTCGAGGAGGAGCTGGAGGCGATGAGC
CCGCTTGATGATAGGCCACCGCCAAGAAAACCAAGGAAGGTCGCAGGACCATCAAGAGGACAGAAGAAAACTGGGCGTTCTGGACCTGAAGAGCGCCCATCAGGCGGCGA
CACCATGAGCAAAACACCCTCCATTAACTCCCTCATCAAGGTTGAGAAAGGGCTGTTTCCGTTCAATGGTCAACTTCCAGATTTCCTCTTCGCGCCAATCCAGGCATTCG
GATGGAAGTCATTCTTCAAGGGGCACACCAAAGTAAGATTAGGGGTGGTGGAGAAGTTTTATGCGGCTAAGCTCAACGCTGAAGAGTTTAGCGTACAAATCAGCGGAAAA
ACAGTGAGTTTCAGCGCAGAGGCCATCAACGCATTGTATGATTTGTCCAATGATGTTGAGATGCCAGGGCAAATATACGTAGACAGTCCTACGAAGAGGATGGCCCGTGA
AGCGCTGGAAGTCATCGCATGGCATGGGGCTGCATGGGAAGTAACGCCAACGGGGAAGTATCAGTTATATCCACATCAGCTAACCACTGAGGCGAGCGTGTGGTTGTTTT
TTATCAAAAAGAAGATCTTCCCAATGCGCCATGATAGCACCATCAACTTAAAGTCAGCAATGCTACTCTATTGTATCCTGGCGAAGAAGCGCGTCAACCTTGGCGATTTT
ATAGCCACATCCATTCTGTTTTGGATGCGAGCTCCCAAAGGCGCCATGCCCTTCCCTTCAACCATTGAGGCCCTTTGCCTTAAAGTTGTGCCATTCTTATCCGCCATCCA
AACAATTTCAATACCAGGCAGGCTGTGTAATCAAATGGCTTTAAACCGCATGATTACTTTCTATGGACATAAGGAAATGGAAAGGAGGGCAAAGACATTGGGCGACACAC
CTGAAGGAATGGCCCAAGCAGAACGCAAAAGGAAATCCCCAGGCGTCGCATCAACCCCCCCACCGAAAGCCAAAAAAGCAAAGGTTGATGTGACGAAGAAGCCTCCTCTG
AAAATTATCCACCCCTCATCCCACCCAAAACAGCGAGCTCCCCCATCAGTCCAACAAAAATCCAGCAGCTCTCAACCTCCCCGCCCTTCTTCGCCCATCCCAATCACCCC
ACCTTCCCCCAACCTCTCTCCCCGCCATTCACCTCTCCCCCACATTCTTTCCCCTATTCACGTCCCTCACCTTTCCCCCGATCGCCTACCCCTCCGCCTACAAAATCGAC
TTCACCCCTTCCTTCTAAAGCATCCTCACCCCGACAACAAGAGCCTCTTTCGCCTCTCATTCTTTTACCCATCGTGGACCTTACCAACCTTCGCCATGACCAGCCCACGA
TCGCCACTGAGGTCGTTGAGGTTCCCCCCTCATCACTCACCCAAAGAGGACGCCCTTCCCCCACCATCACCACCGCCACCTGTTATGGATACCGCGGAAAATGTTGATGA
CCCACACGTTAAGGACAAAAACGCTATCCTTAATGAAGTTGGCGAGACAGCTTCTTCGGTGCATACACCCACCGCCCAACCTTCCATCCCACCTGGCGAAGATGAAGATT
TGGGCGACATATTAGGTTCCCTTGTGTGTAAGCCAATGATGGAACACTTTGAACAAATATTGGCTAATCAAGGGGACATGGCGACGCAACTTCAGATTTTGCAAAATAAA
GTTGATCAGTTGCAACGTCCCAACGCCGAAAACCCTATGACGCCCGCTTCTCAAGACGACATAAACATTTTGCGAAACGAAATGCGGGACCTTTCGGCGAACAACGCCCA
GATCTCCACCGCCGTATCCAACCTCTCCACTTCCGTCTTCCACCTCACTTCCTGGTCATGGCCCAAGCTGAAATGA
Protein sequenceShow/hide protein sequence
MALNQNAPHREEVANKKRHKVEVDNLDQDAVLEFLNERVRKRKDAHIKRTKEARRRKNEHCLKKDGRKEKDARRTERASGGAPCESKSLEPEKNLEVIAEELEEELEAMS
PLDDRPPPRKPRKVAGPSRGQKKTGRSGPEERPSGGDTMSKTPSINSLIKVEKGLFPFNGQLPDFLFAPIQAFGWKSFFKGHTKVRLGVVEKFYAAKLNAEEFSVQISGK
TVSFSAEAINALYDLSNDVEMPGQIYVDSPTKRMAREALEVIAWHGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPMRHDSTINLKSAMLLYCILAKKRVNLGDF
IATSILFWMRAPKGAMPFPSTIEALCLKVVPFLSAIQTISIPGRLCNQMALNRMITFYGHKEMERRAKTLGDTPEGMAQAERKRKSPGVASTPPPKAKKAKVDVTKKPPL
KIIHPSSHPKQRAPPSVQQKSSSSQPPRPSSPIPITPPSPNLSPRHSPLPHILSPIHVPHLSPDRLPLRLQNRLHPFLLKHPHPDNKSLFRLSFFYPSWTLPTFAMTSPR
SPLRSLRFPPHHSPKEDALPPPSPPPPVMDTAENVDDPHVKDKNAILNEVGETASSVHTPTAQPSIPPGEDEDLGDILGSLVCKPMMEHFEQILANQGDMATQLQILQNK
VDQLQRPNAENPMTPASQDDINILRNEMRDLSANNAQISTAVSNLSTSVFHLTSWSWPKLK