; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0016699 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0016699
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGag/pol protein
Genome locationchr03:9150421..9153081
RNA-Seq ExpressionPI0016699
SyntenyPI0016699
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041264.1 hypothetical protein E6C27_scaffold128G002490 [Cucumis melo var. makuwa]4.4e-3343.65Show/hide
Query:  IRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNNIETPGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWF-F
        IR  VV  FY A +N  E   ++  K V F  +AINALY L NN    G +  ++P  R  ++ALE I WPG  W+  PT KYQL+P+ L TE SVW  F
Subjt:  IRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNNIETPGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWF-F

Query:  IKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVVPFLSAIQTISIPGGLCNQMALNRMITFHGHK
        IKK I PTRHDSTI++E  MLLY      + N  E+    ++AW++ P GA PFL   + L +K  P L   Q + +  G+C    L+R IT H +K
Subjt:  IKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVVPFLSAIQTISIPGGLCNQMALNRMITFHGHK

KAA0048500.1 protein MNN4-like [Cucumis melo var. makuwa]1.1e-3937.67Show/hide
Query:  AVKATLKRKEEKKKMF-AELSEQVAELPAKARALEPERNLEAIAEEFEDELEAMSPLDD---GPSPRKPREVAGPS----RGRKKVGRSGPEERPSCGDT
        A KA  K ++ KK++   ++  Q  +  A+ +    E+  +  ++EFE ELE +SPL+D      P+K R + G        + K  +   E + S  + 
Subjt:  AVKATLKRKEEKKKMF-AELSEQVAELPAKARALEPERNLEAIAEEFEDELEAMSPLDD---GPSPRKPREVAGPS----RGRKKVGRSGPEERPSCGDT

Query:  ISKTPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDL-PNNIETPGQIYV
        +     +     +EKG+F F GQLP FL +PI+A  WK FF+G T IR  V+  FY   +N       + GK V+F  + +N LY L    +E P     
Subjt:  ISKTPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDL-PNNIETPGQIYV

Query:  DSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWF-FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWI
          P+    + ALE +AWPG  W++TP  KYQL+PH L T ASVW  FIKK + PTRHD+TI+LE  MLLYCI+ +  +N+ E+I   I AW+
Subjt:  DSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWF-FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWI

KAA0049609.1 transposase [Cucumis melo var. makuwa]1.3e-2942.45Show/hide
Query:  ISGKTVSFSAEAINALYDLPNNIET-PGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVW-FFIKKKIFPTRHDSTINLESAML
        +  + V F+ E IN LYDLPN++   PGQ  +    +  A++ +++I WP A    TPT + QL+PHQLT EA+VW FFIKKKIFPT HDSTI  E A++
Subjt:  ISGKTVSFSAEAINALYDLPNNIET-PGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVW-FFIKKKIFPTRHDSTINLESAML

Query:  LYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVV-PFLSAIQTISIPGGLCNQMALNRMITFHGHKEMERRAKT------LGDTPKGMA
        LYCI AKK  NLG ++  + L+W+R PK A PF +T++ LCLK +      I  I + GG CN      + T      ++R+A T      LG   + + 
Subjt:  LYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVV-PFLSAIQTISIPGGLCNQMALNRMITFHGHKEMERRAKT------LGDTPKGMA

Query:  QAERKKKSPVVA
           RKK+   VA
Subjt:  QAERKKKSPVVA

KAA0054837.1 hypothetical protein E6C27_scaffold406G00150 [Cucumis melo var. makuwa]9.8e-3335.66Show/hide
Query:  EEF-EDELEAMSPLDDGPSPRKPREVAGPSRGR------KKVGRSGPEERPSCGDTISKTPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGH
        +EF E++   +SPL++    R+PR+      G+      KK      EE     D +         + VEKG F F  QL  FL  PI+A GW+ F +G 
Subjt:  EEF-EDELEAMSPLDDGPSPRKPREVAGPSRGR------KKVGRSGPEERPSCGDTISKTPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGH

Query:  TKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNNIETPGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWF
          IR GVV+ FY  K++  +    +  +                             P+    +EALE +AW    W+VT   KY+L+ H LTTEASVW 
Subjt:  TKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNNIETPGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWF

Query:  -FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVVPFL-SAIQTISIPGGLCN
         FIKKK+ PTRHD+TI+ E  MLLYCI+ +  V++ E+I   I AW++ P+GA PF   IE LCL+    L  + Q   +  G+CN
Subjt:  -FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVVPFL-SAIQTISIPGGLCN

KAA0062900.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-3750Show/hide
Query:  KIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNNIETPGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTG-KYQLYPHQLTTEASVW-
        KIR+ VV KFY  K N ++  + I  +   F+ E IN LY+ PN+ E  GQ  V   TK +A+EAL+V+AWPG   EV P   +YQLYPH LTT+A+VW 
Subjt:  KIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNNIETPGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTG-KYQLYPHQLTTEASVW-

Query:  FFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVVPFLSAIQTISIP
        FF K KIFPT +DSTI+++  ++LYCI+ KK +NL E+I  +IL W+  PK AMPF S +E LCLK +P L      +IP
Subjt:  FFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVVPFLSAIQTISIP

TrEMBL top hitse value%identityAlignment
A0A5A7TZE0 Protein MNN4-like5.3e-4037.67Show/hide
Query:  AVKATLKRKEEKKKMF-AELSEQVAELPAKARALEPERNLEAIAEEFEDELEAMSPLDD---GPSPRKPREVAGPS----RGRKKVGRSGPEERPSCGDT
        A KA  K ++ KK++   ++  Q  +  A+ +    E+  +  ++EFE ELE +SPL+D      P+K R + G        + K  +   E + S  + 
Subjt:  AVKATLKRKEEKKKMF-AELSEQVAELPAKARALEPERNLEAIAEEFEDELEAMSPLDD---GPSPRKPREVAGPS----RGRKKVGRSGPEERPSCGDT

Query:  ISKTPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDL-PNNIETPGQIYV
        +     +     +EKG+F F GQLP FL +PI+A  WK FF+G T IR  V+  FY   +N       + GK V+F  + +N LY L    +E P     
Subjt:  ISKTPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDL-PNNIETPGQIYV

Query:  DSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWF-FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWI
          P+    + ALE +AWPG  W++TP  KYQL+PH L T ASVW  FIKK + PTRHD+TI+LE  MLLYCI+ +  +N+ E+I   I AW+
Subjt:  DSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWF-FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWI

A0A5A7U806 Transposase6.4e-3042.45Show/hide
Query:  ISGKTVSFSAEAINALYDLPNNIET-PGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVW-FFIKKKIFPTRHDSTINLESAML
        +  + V F+ E IN LYDLPN++   PGQ  +    +  A++ +++I WP A    TPT + QL+PHQLT EA+VW FFIKKKIFPT HDSTI  E A++
Subjt:  ISGKTVSFSAEAINALYDLPNNIET-PGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVW-FFIKKKIFPTRHDSTINLESAML

Query:  LYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVV-PFLSAIQTISIPGGLCNQMALNRMITFHGHKEMERRAKT------LGDTPKGMA
        LYCI AKK  NLG ++  + L+W+R PK A PF +T++ LCLK +      I  I + GG CN      + T      ++R+A T      LG   + + 
Subjt:  LYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVV-PFLSAIQTISIPGGLCNQMALNRMITFHGHKEMERRAKT------LGDTPKGMA

Query:  QAERKKKSPVVA
           RKK+   VA
Subjt:  QAERKKKSPVVA

A0A5A7V6M5 Gag/pol protein8.4e-3850Show/hide
Query:  KIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNNIETPGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTG-KYQLYPHQLTTEASVW-
        KIR+ VV KFY  K N ++  + I  +   F+ E IN LY+ PN+ E  GQ  V   TK +A+EAL+V+AWPG   EV P   +YQLYPH LTT+A+VW 
Subjt:  KIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNNIETPGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTG-KYQLYPHQLTTEASVW-

Query:  FFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVVPFLSAIQTISIP
        FF K KIFPT +DSTI+++  ++LYCI+ KK +NL E+I  +IL W+  PK AMPF S +E LCLK +P L      +IP
Subjt:  FFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVVPFLSAIQTISIP

A0A5D3CW17 Uncharacterized protein2.1e-3343.65Show/hide
Query:  IRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNNIETPGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWF-F
        IR  VV  FY A +N  E   ++  K V F  +AINALY L NN    G +  ++P  R  ++ALE I WPG  W+  PT KYQL+P+ L TE SVW  F
Subjt:  IRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNNIETPGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWF-F

Query:  IKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVVPFLSAIQTISIPGGLCNQMALNRMITFHGHK
        IKK I PTRHDSTI++E  MLLY      + N  E+    ++AW++ P GA PFL   + L +K  P L   Q + +  G+C    L+R IT H +K
Subjt:  IKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVVPFLSAIQTISIPGGLCNQMALNRMITFHGHK

A0A5D3DVQ6 Uncharacterized protein4.8e-3335.66Show/hide
Query:  EEF-EDELEAMSPLDDGPSPRKPREVAGPSRGR------KKVGRSGPEERPSCGDTISKTPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGH
        +EF E++   +SPL++    R+PR+      G+      KK      EE     D +         + VEKG F F  QL  FL  PI+A GW+ F +G 
Subjt:  EEF-EDELEAMSPLDDGPSPRKPREVAGPSRGR------KKVGRSGPEERPSCGDTISKTPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGH

Query:  TKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNNIETPGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWF
          IR GVV+ FY  K++  +    +  +                             P+    +EALE +AW    W+VT   KY+L+ H LTTEASVW 
Subjt:  TKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNNIETPGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWF

Query:  -FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVVPFL-SAIQTISIPGGLCN
         FIKKK+ PTRHD+TI+ E  MLLYCI+ +  V++ E+I   I AW++ P+GA PF   IE LCL+    L  + Q   +  G+CN
Subjt:  -FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWIRAPKGAMPFLSTIEALCLKVVPFL-SAIQTISIPGGLCN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGGATCTTCAAGTCATCAAGATCAGATTAGAGTACGTTCTGATGGCCCTGAATCAGAACGCCCAACATCACGAGGAAATGGCGAACAAGAAATGCCACAATACAC
CCACTTGAAACACGGTAAGGGAAAGGTCTTACTGAAACCCCCGCCGGAAGCAGCTGACAATTTCTTGGAGGTTGAGGTTGATAATCAGGATACAGAGGCGGTGCTCGAAT
TTCTTAATAAACGAGCAAAGAAGAGGAAGGAAGCCCACATCAAAAGAACTAAGGAAGCTCGTCTCCGAAAGGACGTGCATGAGCGAACAAAAGTTGATGCAATTCGAAAG
GCAAAGAGCACACTAGAAATACGCTCACCGCCTAACGAGGTTGCGGAGCTTCACAAGAAGATCTCTGACAAGCTCGCACAAGTCTCGTTCGCTAAAACGAGGAAAACAAT
CGAGGCAGTCAAGGCTACCTTAAAAAGGAAAGAAGAAAAGAAAAAGATGTTCGCAGAGTTGAGCGAACAAGTGGCGGAGCTCCCCGCGAAAGCAAGAGCATTGGAGCCTG
AAAGAAACCTCGAAGCAATCGCGGAAGAATTCGAGGATGAGCTGGAGGCGATGAGCCCACTTGATGACGGACCATCGCCAAGAAAACCACGGGAGGTTGCAGGACCATCA
AGAGGAAGGAAGAAAGTTGGGCGTTCTGGACCTGAAGAACGCCCATCATGCGGCGACACCATAAGCAAGACACCCTCTATCAACTCTCTCATCAAAGTCGAAAAAGGGTT
GTTTTCGTTCAATGGTCAACTCCCAGACTTCCTCTACGCGCCAATCCAGGCGTTTGGATGGAAGTCATTTTTTAAGGGGCACACCAAGATAAGATTAGGAGTGGTAGAAA
AGTTTTACGCGGCTAAGCTCAACGCTGCAGAGTTTAGCGTACAAATAAGTGGGAAGACAGTGAGTTTTAGCGCGGAGGCCATCAATGCGTTGTATGATTTGCCCAATAAC
ATTGAAACCCCAGGGCAAATATACGTAGACAGTCCTACAAAGAGGATGGCCCGTGAAGCGTTGGAAGTCATTGCATGGCCTGGGGCCGCATGGGAGGTAACGCCAACAGG
GAAGTATCAGTTGTATCCACACCAACTAACCACTGAAGCAAGCGTGTGGTTCTTTATCAAGAAGAAGATCTTCCCAACACGCCATGATAGCACCATCAATTTAGAGTCAG
CGATGCTACTCTATTGCATCTTAGCGAAGAAGCGTGTTAACCTTGGCGAACTTATAGTCACATCCATTCTAGCATGGATACGAGCTCCCAAAGGCGCGATGCCCTTCCTT
TCAACCATTGAGGCCCTTTGCCTTAAAGTTGTGCCATTCTTATCCGCCATCCAAACCATCTCAATACCAGGCGGGCTGTGTAATCAAATGGCCTTAAACCGCATGATTAC
TTTTCATGGACACAAGGAAATGGAAAGGCGGGCAAAGACATTAGGCGACACACCTAAAGGAATGGCTCAAGCAGAAAGAAAAAAGAAATCCCCAGTCGTCGCATCAACCC
CCCACCTAAAGCCAAAAAAAACAAAGGTTCGTGCGACGAAGCAGCCTCCACTGAAATTTCTCCACTCCTCATCTCGCCCAATACAGCGAGCTCCCCCATCAGTTCAAAAT
TCCAGCAGCTCCAACCCTCCCCGCTCTTCTTCGCCCATTCCATTCACCCCACCATCACCAAACATCTCTCCCCGCCATTCACCTCTCCCCCACATTCGTTCCCCTACCAA
CATCCCCCACCTTTCCCCACGACTGCCTACACCTCTACCTACAAAATCTACTTCTCCCCTTCCTTCCAAATCACCCTCACCAAGGCGAGCTGAACCCCTTTCGCCTTTTC
TTCTTTCGCCCATCATGGACCTGACCACTCTCCGCCATGACCAACCCGCGACCAACACTGCGGTTGTTGAGGTTTCTTCGCCCATCACTCACCCAACCAACCGTCCTCTG
CAAACTTCTCCCATACTTCTAATCTCAGAAGAGGACGCCCCTCCCACCAACCAACTATCCCAACCATCACCACCATCGCCTATTATGGCCGCCGCGGAAAATGTTGATGA
CCCACACGTTAAGGACAAAAACCCCATCCTTAATGAAGTTGGCAAGACTGCTTCCTCTGCGCATACCCCCATCGCCCAACCTTCCACCGCACCTGGAGACGATGAAGATT
TTGGCGAATTGCTGGGTTCCCTTGTATGTAAGCCAATGATGGAGCAGTTCGAACACATTTTGGCTAACCAAGGGGATCAGGCGATGCAACTCCACAATTTGAAAACTCGA
GTTGACCAGTTGCAGCGTCCCAACGCCGAAAATCTTGTAACGCCCGCATCTCAAGACATCAATATCTTGCGAAGCGAAATGAGAGACCTTTCGGCGAACAACGCCCAGAT
CTCCACTGCGGTCTCCAACCTCTTCACTTCCGTCTCCCACCTTACTTCCTTGGTCTTGGCCCAATCAGAAATGATGCGGCAAATAGCTACGCGACATGATAGGAAGTTTC
GCACACAGATGGAATACACGTATGCAGCAATTGTGCAACGCGTGCCTGCCCCAATCATACTGCCAGATCTCGAAGCACCCTTCCCACCACTCACGCGTCCAGGCGATCCT
GCTCCCCGCCAAGACAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGGATCTTCAAGTCATCAAGATCAGATTAGAGTACGTTCTGATGGCCCTGAATCAGAACGCCCAACATCACGAGGAAATGGCGAACAAGAAATGCCACAATACAC
CCACTTGAAACACGGTAAGGGAAAGGTCTTACTGAAACCCCCGCCGGAAGCAGCTGACAATTTCTTGGAGGTTGAGGTTGATAATCAGGATACAGAGGCGGTGCTCGAAT
TTCTTAATAAACGAGCAAAGAAGAGGAAGGAAGCCCACATCAAAAGAACTAAGGAAGCTCGTCTCCGAAAGGACGTGCATGAGCGAACAAAAGTTGATGCAATTCGAAAG
GCAAAGAGCACACTAGAAATACGCTCACCGCCTAACGAGGTTGCGGAGCTTCACAAGAAGATCTCTGACAAGCTCGCACAAGTCTCGTTCGCTAAAACGAGGAAAACAAT
CGAGGCAGTCAAGGCTACCTTAAAAAGGAAAGAAGAAAAGAAAAAGATGTTCGCAGAGTTGAGCGAACAAGTGGCGGAGCTCCCCGCGAAAGCAAGAGCATTGGAGCCTG
AAAGAAACCTCGAAGCAATCGCGGAAGAATTCGAGGATGAGCTGGAGGCGATGAGCCCACTTGATGACGGACCATCGCCAAGAAAACCACGGGAGGTTGCAGGACCATCA
AGAGGAAGGAAGAAAGTTGGGCGTTCTGGACCTGAAGAACGCCCATCATGCGGCGACACCATAAGCAAGACACCCTCTATCAACTCTCTCATCAAAGTCGAAAAAGGGTT
GTTTTCGTTCAATGGTCAACTCCCAGACTTCCTCTACGCGCCAATCCAGGCGTTTGGATGGAAGTCATTTTTTAAGGGGCACACCAAGATAAGATTAGGAGTGGTAGAAA
AGTTTTACGCGGCTAAGCTCAACGCTGCAGAGTTTAGCGTACAAATAAGTGGGAAGACAGTGAGTTTTAGCGCGGAGGCCATCAATGCGTTGTATGATTTGCCCAATAAC
ATTGAAACCCCAGGGCAAATATACGTAGACAGTCCTACAAAGAGGATGGCCCGTGAAGCGTTGGAAGTCATTGCATGGCCTGGGGCCGCATGGGAGGTAACGCCAACAGG
GAAGTATCAGTTGTATCCACACCAACTAACCACTGAAGCAAGCGTGTGGTTCTTTATCAAGAAGAAGATCTTCCCAACACGCCATGATAGCACCATCAATTTAGAGTCAG
CGATGCTACTCTATTGCATCTTAGCGAAGAAGCGTGTTAACCTTGGCGAACTTATAGTCACATCCATTCTAGCATGGATACGAGCTCCCAAAGGCGCGATGCCCTTCCTT
TCAACCATTGAGGCCCTTTGCCTTAAAGTTGTGCCATTCTTATCCGCCATCCAAACCATCTCAATACCAGGCGGGCTGTGTAATCAAATGGCCTTAAACCGCATGATTAC
TTTTCATGGACACAAGGAAATGGAAAGGCGGGCAAAGACATTAGGCGACACACCTAAAGGAATGGCTCAAGCAGAAAGAAAAAAGAAATCCCCAGTCGTCGCATCAACCC
CCCACCTAAAGCCAAAAAAAACAAAGGTTCGTGCGACGAAGCAGCCTCCACTGAAATTTCTCCACTCCTCATCTCGCCCAATACAGCGAGCTCCCCCATCAGTTCAAAAT
TCCAGCAGCTCCAACCCTCCCCGCTCTTCTTCGCCCATTCCATTCACCCCACCATCACCAAACATCTCTCCCCGCCATTCACCTCTCCCCCACATTCGTTCCCCTACCAA
CATCCCCCACCTTTCCCCACGACTGCCTACACCTCTACCTACAAAATCTACTTCTCCCCTTCCTTCCAAATCACCCTCACCAAGGCGAGCTGAACCCCTTTCGCCTTTTC
TTCTTTCGCCCATCATGGACCTGACCACTCTCCGCCATGACCAACCCGCGACCAACACTGCGGTTGTTGAGGTTTCTTCGCCCATCACTCACCCAACCAACCGTCCTCTG
CAAACTTCTCCCATACTTCTAATCTCAGAAGAGGACGCCCCTCCCACCAACCAACTATCCCAACCATCACCACCATCGCCTATTATGGCCGCCGCGGAAAATGTTGATGA
CCCACACGTTAAGGACAAAAACCCCATCCTTAATGAAGTTGGCAAGACTGCTTCCTCTGCGCATACCCCCATCGCCCAACCTTCCACCGCACCTGGAGACGATGAAGATT
TTGGCGAATTGCTGGGTTCCCTTGTATGTAAGCCAATGATGGAGCAGTTCGAACACATTTTGGCTAACCAAGGGGATCAGGCGATGCAACTCCACAATTTGAAAACTCGA
GTTGACCAGTTGCAGCGTCCCAACGCCGAAAATCTTGTAACGCCCGCATCTCAAGACATCAATATCTTGCGAAGCGAAATGAGAGACCTTTCGGCGAACAACGCCCAGAT
CTCCACTGCGGTCTCCAACCTCTTCACTTCCGTCTCCCACCTTACTTCCTTGGTCTTGGCCCAATCAGAAATGATGCGGCAAATAGCTACGCGACATGATAGGAAGTTTC
GCACACAGATGGAATACACGTATGCAGCAATTGTGCAACGCGTGCCTGCCCCAATCATACTGCCAGATCTCGAAGCACCCTTCCCACCACTCACGCGTCCAGGCGATCCT
GCTCCCCGCCAAGACAACTAA
Protein sequenceShow/hide protein sequence
MVGSSSHQDQIRVRSDGPESERPTSRGNGEQEMPQYTHLKHGKGKVLLKPPPEAADNFLEVEVDNQDTEAVLEFLNKRAKKRKEAHIKRTKEARLRKDVHERTKVDAIRK
AKSTLEIRSPPNEVAELHKKISDKLAQVSFAKTRKTIEAVKATLKRKEEKKKMFAELSEQVAELPAKARALEPERNLEAIAEEFEDELEAMSPLDDGPSPRKPREVAGPS
RGRKKVGRSGPEERPSCGDTISKTPSINSLIKVEKGLFSFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNN
IETPGQIYVDSPTKRMAREALEVIAWPGAAWEVTPTGKYQLYPHQLTTEASVWFFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIVTSILAWIRAPKGAMPFL
STIEALCLKVVPFLSAIQTISIPGGLCNQMALNRMITFHGHKEMERRAKTLGDTPKGMAQAERKKKSPVVASTPHLKPKKTKVRATKQPPLKFLHSSSRPIQRAPPSVQN
SSSSNPPRSSSPIPFTPPSPNISPRHSPLPHIRSPTNIPHLSPRLPTPLPTKSTSPLPSKSPSPRRAEPLSPFLLSPIMDLTTLRHDQPATNTAVVEVSSPITHPTNRPL
QTSPILLISEEDAPPTNQLSQPSPPSPIMAAAENVDDPHVKDKNPILNEVGKTASSAHTPIAQPSTAPGDDEDFGELLGSLVCKPMMEQFEHILANQGDQAMQLHNLKTR
VDQLQRPNAENLVTPASQDINILRSEMRDLSANNAQISTAVSNLFTSVSHLTSLVLAQSEMMRQIATRHDRKFRTQMEYTYAAIVQRVPAPIILPDLEAPFPPLTRPGDP
APRQDN