; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0025053 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0025053
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGag/pol protein
Genome locationchr04:22160683..22162575
RNA-Seq ExpressionPI0025053
SyntenyPI0025053
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041264.1 hypothetical protein E6C27_scaffold128G002490 [Cucumis melo var. makuwa]2.3e-3645.69Show/hide
Query:  IRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDLPNNVETPGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFF
        IR  VV  FY A +N EE   ++  K V F  +AINALY L NN    G L  +NP  R  ++ALE I WPG  W+  PT KYQL+P+ L TE SVWL F
Subjt:  IRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDLPNNVETPGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFF

Query:  IKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAVPFLSVIQTISIPGGLCNQMALNRMITFHGHK
        IKK I PTRHDSTI++E  MLLY      + N  E+    ++AW++ P GA PF    + L +KA P L   Q + +  G+C    L+R IT H +K
Subjt:  IKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAVPFLSVIQTISIPGGLCNQMALNRMITFHGHK

KAA0048500.1 protein MNN4-like [Cucumis melo var. makuwa]6.1e-4539.39Show/hide
Query:  EAVKVALKRREEKKKMFAEL-----SEQVAELPAKVRALEPERNLDTIAEEFEEELEAMSPLDD---GPSPRKSREVAGPS----RGRKKLGRSGPEEHL
        E  KVA K +E+ +K+   L       Q  +  A+ +    E+  D  ++EFE+ELE +SPL+D      P+K R + G        + K  +   E   
Subjt:  EAVKVALKRREEKKKMFAEL-----SEQVAELPAKVRALEPERNLDTIAEEFEEELEAMSPLDD---GPSPRKSREVAGPS----RGRKKLGRSGPEEHL

Query:  SGGDTISKTPSINSLIKVEKGLFPFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDL-PNNVETP
        S  + +     +     +EKG+FPF GQLP FL +PI+A  WK FF+G T IR  V+  FY   +N E     + GK V+F  + +N LY L    VE P
Subjt:  SGGDTISKTPSINSLIKVEKGLFPFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDL-PNNVETP

Query:  GQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWM
               P+    + ALE +AWPG  W++TP  KYQL+PH L T ASVWL FIKK + PTRHD+TI+LE  MLLYCI+ +  +N+ E+I   I AW+
Subjt:  GQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWM

KAA0049609.1 transposase [Cucumis melo var. makuwa]1.3e-3448.13Show/hide
Query:  ISGKTVSFNTEAINALYDLPNNVET-PGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAML
        +  + V F TE IN LYDLPN++   PGQ  + +  +  A++ ++ I WP A    TPT + QL+PHQLT EA+VWLFFIKKKIFPT HDSTI  E A++
Subjt:  ISGKTVSFNTEAINALYDLPNNVET-PGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAML

Query:  LYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAV-PFLSVIQTISIPGGLCNQMALNRMITFHGHKEMERRAKTSG
        LYCI AKK  NLG ++  + L+WMR PK A PFP+TV+ LCLK +      I  I + GG CN      + T      ++R+A TSG
Subjt:  LYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAV-PFLSVIQTISIPGGLCNQMALNRMITFHGHKEMERRAKTSG

KAA0054837.1 hypothetical protein E6C27_scaffold406G00150 [Cucumis melo var. makuwa]4.0e-3635.79Show/hide
Query:  EEF-EEELEAMSPLDDGPSPRKSREVAGPSRGRKKLGRSGPEEHLSGGDTISKTPSINSL-----IKVEKGLFPFNGQLPDFLYAPIQAFGWKSFFKGHT
        +EF EE+   +SPL++    R+ R+      G+  + R   E+     ++      + S        VEKG F F  QL  FL  PI+A GW+ F +G  
Subjt:  EEF-EEELEAMSPLDDGPSPRKSREVAGPSRGRKKLGRSGPEEHLSGGDTISKTPSINSL-----IKVEKGLFPFNGQLPDFLYAPIQAFGWKSFFKGHT

Query:  KIRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDLPNNVETPGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLF
         IR GVV+ FY  K++ E+    +  +                             P+    +EALE +AW    W+VT   KY+L+ H LTTEASVWL 
Subjt:  KIRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDLPNNVETPGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLF

Query:  FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAVPFLSVIQTIS-IPGGLCN
        FIKKK+ PTRHD+TI+ E  MLLYCI+ +  V++ E+I   I AW++ P+GA PFP  +E LCL++   L     I+ +  G+CN
Subjt:  FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAVPFLSVIQTIS-IPGGLCN

KAA0062900.1 gag/pol protein [Cucumis melo var. makuwa]5.3e-4151.11Show/hide
Query:  KIRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDLPNNVETPGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTG-KYQLYPHQLTTEASVWL
        KIR+ VV KFY  K N  +  + I  +   FN E IN LY+ PN+ E  GQ  V   TK +A+EAL+ +AWPG   EV P   +YQLYPH LTT+A+VW+
Subjt:  KIRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDLPNNVETPGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTG-KYQLYPHQLTTEASVWL

Query:  FFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAVPFLSVIQTISIP
        FF K KIFPT +DSTI+++  ++LYCI+ KK +NL E+I  +IL WM  PK AMPFPS +E LCLK +P L      +IP
Subjt:  FFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAVPFLSVIQTISIP

TrEMBL top hitse value%identityAlignment
A0A5A7TZE0 Protein MNN4-like3.0e-4539.39Show/hide
Query:  EAVKVALKRREEKKKMFAEL-----SEQVAELPAKVRALEPERNLDTIAEEFEEELEAMSPLDD---GPSPRKSREVAGPS----RGRKKLGRSGPEEHL
        E  KVA K +E+ +K+   L       Q  +  A+ +    E+  D  ++EFE+ELE +SPL+D      P+K R + G        + K  +   E   
Subjt:  EAVKVALKRREEKKKMFAEL-----SEQVAELPAKVRALEPERNLDTIAEEFEEELEAMSPLDD---GPSPRKSREVAGPS----RGRKKLGRSGPEEHL

Query:  SGGDTISKTPSINSLIKVEKGLFPFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDL-PNNVETP
        S  + +     +     +EKG+FPF GQLP FL +PI+A  WK FF+G T IR  V+  FY   +N E     + GK V+F  + +N LY L    VE P
Subjt:  SGGDTISKTPSINSLIKVEKGLFPFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDL-PNNVETP

Query:  GQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWM
               P+    + ALE +AWPG  W++TP  KYQL+PH L T ASVWL FIKK + PTRHD+TI+LE  MLLYCI+ +  +N+ E+I   I AW+
Subjt:  GQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWM

A0A5A7U806 Transposase6.2e-3548.13Show/hide
Query:  ISGKTVSFNTEAINALYDLPNNVET-PGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAML
        +  + V F TE IN LYDLPN++   PGQ  + +  +  A++ ++ I WP A    TPT + QL+PHQLT EA+VWLFFIKKKIFPT HDSTI  E A++
Subjt:  ISGKTVSFNTEAINALYDLPNNVET-PGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAML

Query:  LYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAV-PFLSVIQTISIPGGLCNQMALNRMITFHGHKEMERRAKTSG
        LYCI AKK  NLG ++  + L+WMR PK A PFP+TV+ LCLK +      I  I + GG CN      + T      ++R+A TSG
Subjt:  LYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAV-PFLSVIQTISIPGGLCNQMALNRMITFHGHKEMERRAKTSG

A0A5A7V6M5 Gag/pol protein2.6e-4151.11Show/hide
Query:  KIRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDLPNNVETPGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTG-KYQLYPHQLTTEASVWL
        KIR+ VV KFY  K N  +  + I  +   FN E IN LY+ PN+ E  GQ  V   TK +A+EAL+ +AWPG   EV P   +YQLYPH LTT+A+VW+
Subjt:  KIRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDLPNNVETPGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTG-KYQLYPHQLTTEASVWL

Query:  FFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAVPFLSVIQTISIP
        FF K KIFPT +DSTI+++  ++LYCI+ KK +NL E+I  +IL WM  PK AMPFPS +E LCLK +P L      +IP
Subjt:  FFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAVPFLSVIQTISIP

A0A5D3CW17 Uncharacterized protein1.1e-3645.69Show/hide
Query:  IRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDLPNNVETPGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFF
        IR  VV  FY A +N EE   ++  K V F  +AINALY L NN    G L  +NP  R  ++ALE I WPG  W+  PT KYQL+P+ L TE SVWL F
Subjt:  IRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDLPNNVETPGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFF

Query:  IKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAVPFLSVIQTISIPGGLCNQMALNRMITFHGHK
        IKK I PTRHDSTI++E  MLLY      + N  E+    ++AW++ P GA PF    + L +KA P L   Q + +  G+C    L+R IT H +K
Subjt:  IKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAVPFLSVIQTISIPGGLCNQMALNRMITFHGHK

A0A5D3DVQ6 Uncharacterized protein1.9e-3635.79Show/hide
Query:  EEF-EEELEAMSPLDDGPSPRKSREVAGPSRGRKKLGRSGPEEHLSGGDTISKTPSINSL-----IKVEKGLFPFNGQLPDFLYAPIQAFGWKSFFKGHT
        +EF EE+   +SPL++    R+ R+      G+  + R   E+     ++      + S        VEKG F F  QL  FL  PI+A GW+ F +G  
Subjt:  EEF-EEELEAMSPLDDGPSPRKSREVAGPSRGRKKLGRSGPEEHLSGGDTISKTPSINSL-----IKVEKGLFPFNGQLPDFLYAPIQAFGWKSFFKGHT

Query:  KIRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDLPNNVETPGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLF
         IR GVV+ FY  K++ E+    +  +                             P+    +EALE +AW    W+VT   KY+L+ H LTTEASVWL 
Subjt:  KIRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDLPNNVETPGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLF

Query:  FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAVPFLSVIQTIS-IPGGLCN
        FIKKK+ PTRHD+TI+ E  MLLYCI+ +  V++ E+I   I AW++ P+GA PFP  +E LCL++   L     I+ +  G+CN
Subjt:  FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMRAPKGAMPFPSTVETLCLKAVPFLSVIQTIS-IPGGLCN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGATCTTCAAGTCAAGATAAGATTAGAGTACGTTCTGATGGACCTGAATCAGAACGTCCCACATCGCGGGGAAGTGGCGAACACAAAACGCCACAATACATCCA
CCTGAAGCACGGCAAGGGGAAGGTTTTACTGAAACCTCCGTCGGAAGCAGCTGACAATTTCTTGGAAGTTGAAGTTGATAATCAGGATACAGAGGCGGTGCTCGAATTTC
TTAATGAACGAGCAAAGAAGAGGAAGGAAGCCCACATCAAAAGGACTAAGGAAGCTCGTCGCCGAAAGGACGAGCGTGAGCGAACGAAAGTTGACACAATTCGAAAGGCG
AAGAGCACACTAGAAATACGCTCACCGCCTAACGAGGTTGCAGAGCTTCACAAGAAGATCTCTGACAAGCTCGCACAAGTCTCATTCACTAAAACGAGGAAAACAATCGA
GGCAGTTAAGGTTGCCTTAAAAAGGAGAGAGGAAAAGAAAAAGATGTTCGCAGAATTAAGCGAACAAGTGGCGGAGCTCCCCGCGAAAGTAAGAGCATTGGAGCCGGAAA
GAAACCTCGACACAATCGCAGAAGAATTCGAGGAAGAGCTGGAGGCGATGAGCCCACTTGATGACGGGCCATCGCCAAGAAAATCAAGGGAGGTCGCAGGACCATCAAGA
GGAAGAAAGAAACTTGGACGTTCTGGACCTGAAGAACACCTATCAGGCGGCGACACCATAAGCAAGACACCCTCTATCAACTCTCTCATCAAAGTTGAAAAGGGGTTGTT
TCCGTTCAATGGTCAACTTCCTGACTTCCTCTACGCGCCAATTCAGGCGTTCGGATGGAAGTCATTTTTCAAAGGGCACACCAAGATACGATTAGGAGTGGTAGAAAAAT
TTTATGCGGCTAAACTCAACGCTGAAGAGTTTAGCGTACAAATAAGTGGAAAGACAGTGAGTTTCAACACGGAGGCCATCAATGCGTTGTATGATTTGCCCAACAACGTT
GAAACCCCAGGGCAATTATACGTAGATAATCCTACGAAGAGGATGGCCCGTGAAGCGTTGGAAGCCATCGCATGGCCTGGGGCCGCATGGGAAGTAACGCCAACAGGGAA
GTATCAGTTGTATCCACACCAACTAACCACTGAAGCAAGTGTGTGGTTGTTCTTTATCAAGAAGAAGATCTTCCCAACACGCCATGATAGCACCATCAATTTGGAGTCAG
CGATGCTACTCTATTGCATCCTAGCAAAGAAGCGTGTTAACCTTGGCGAACTTATAGCCACATCCATTCTGGCATGGATGCGAGCTCCCAAAGGCGCGATGCCCTTCCCG
TCAACCGTTGAGACCCTTTGCCTTAAAGCTGTGCCATTCTTATCTGTCATCCAAACCATCTCAATACCTGGCGGACTGTGTAATCAAATGGCCTTAAACCGCATGATTAC
TTTCCATGGACACAAGGAAATGGAAAGGCGGGCAAAGACATCAGGCGACACGCCTGAAGGAATGGCCCTAGCAGAAAGAAAAAGAAAGGCCCCAGTCATCGCATCAACCC
CCCCACCTAAAGCCAAAAAAACAAAGGTTCTTGTGACGAAGCAACTTCCTCTGAAATTTCCCCACTCCTCATCTCGCCCAATACAGCGAGCTCCACCATCAGTCCAAAAT
TCCAACAACTCCAATCCTCCCCGCTCTTCTTCGCCCATTCCAATCACCCCATCATTCCCAACTATCTCTCCCCACCATTCACCTCTCCCCCACATTCGTTCCCCACCAAC
ATCCCTCACCTTTCCCCACGACCGCCTACACCTCCACCCACAAAATCCACTTCCCCCCTTCACTCCAAATCACCCTCACCAAGGCGAGCTGAACCCCTTTCGCCTTTTCT
TCTTTCGCCCATCATGGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGATCTTCAAGTCAAGATAAGATTAGAGTACGTTCTGATGGACCTGAATCAGAACGTCCCACATCGCGGGGAAGTGGCGAACACAAAACGCCACAATACATCCA
CCTGAAGCACGGCAAGGGGAAGGTTTTACTGAAACCTCCGTCGGAAGCAGCTGACAATTTCTTGGAAGTTGAAGTTGATAATCAGGATACAGAGGCGGTGCTCGAATTTC
TTAATGAACGAGCAAAGAAGAGGAAGGAAGCCCACATCAAAAGGACTAAGGAAGCTCGTCGCCGAAAGGACGAGCGTGAGCGAACGAAAGTTGACACAATTCGAAAGGCG
AAGAGCACACTAGAAATACGCTCACCGCCTAACGAGGTTGCAGAGCTTCACAAGAAGATCTCTGACAAGCTCGCACAAGTCTCATTCACTAAAACGAGGAAAACAATCGA
GGCAGTTAAGGTTGCCTTAAAAAGGAGAGAGGAAAAGAAAAAGATGTTCGCAGAATTAAGCGAACAAGTGGCGGAGCTCCCCGCGAAAGTAAGAGCATTGGAGCCGGAAA
GAAACCTCGACACAATCGCAGAAGAATTCGAGGAAGAGCTGGAGGCGATGAGCCCACTTGATGACGGGCCATCGCCAAGAAAATCAAGGGAGGTCGCAGGACCATCAAGA
GGAAGAAAGAAACTTGGACGTTCTGGACCTGAAGAACACCTATCAGGCGGCGACACCATAAGCAAGACACCCTCTATCAACTCTCTCATCAAAGTTGAAAAGGGGTTGTT
TCCGTTCAATGGTCAACTTCCTGACTTCCTCTACGCGCCAATTCAGGCGTTCGGATGGAAGTCATTTTTCAAAGGGCACACCAAGATACGATTAGGAGTGGTAGAAAAAT
TTTATGCGGCTAAACTCAACGCTGAAGAGTTTAGCGTACAAATAAGTGGAAAGACAGTGAGTTTCAACACGGAGGCCATCAATGCGTTGTATGATTTGCCCAACAACGTT
GAAACCCCAGGGCAATTATACGTAGATAATCCTACGAAGAGGATGGCCCGTGAAGCGTTGGAAGCCATCGCATGGCCTGGGGCCGCATGGGAAGTAACGCCAACAGGGAA
GTATCAGTTGTATCCACACCAACTAACCACTGAAGCAAGTGTGTGGTTGTTCTTTATCAAGAAGAAGATCTTCCCAACACGCCATGATAGCACCATCAATTTGGAGTCAG
CGATGCTACTCTATTGCATCCTAGCAAAGAAGCGTGTTAACCTTGGCGAACTTATAGCCACATCCATTCTGGCATGGATGCGAGCTCCCAAAGGCGCGATGCCCTTCCCG
TCAACCGTTGAGACCCTTTGCCTTAAAGCTGTGCCATTCTTATCTGTCATCCAAACCATCTCAATACCTGGCGGACTGTGTAATCAAATGGCCTTAAACCGCATGATTAC
TTTCCATGGACACAAGGAAATGGAAAGGCGGGCAAAGACATCAGGCGACACGCCTGAAGGAATGGCCCTAGCAGAAAGAAAAAGAAAGGCCCCAGTCATCGCATCAACCC
CCCCACCTAAAGCCAAAAAAACAAAGGTTCTTGTGACGAAGCAACTTCCTCTGAAATTTCCCCACTCCTCATCTCGCCCAATACAGCGAGCTCCACCATCAGTCCAAAAT
TCCAACAACTCCAATCCTCCCCGCTCTTCTTCGCCCATTCCAATCACCCCATCATTCCCAACTATCTCTCCCCACCATTCACCTCTCCCCCACATTCGTTCCCCACCAAC
ATCCCTCACCTTTCCCCACGACCGCCTACACCTCCACCCACAAAATCCACTTCCCCCCTTCACTCCAAATCACCCTCACCAAGGCGAGCTGAACCCCTTTCGCCTTTTCT
TCTTTCGCCCATCATGGACCTGA
Protein sequenceShow/hide protein sequence
MAGSSSQDKIRVRSDGPESERPTSRGSGEHKTPQYIHLKHGKGKVLLKPPSEAADNFLEVEVDNQDTEAVLEFLNERAKKRKEAHIKRTKEARRRKDERERTKVDTIRKA
KSTLEIRSPPNEVAELHKKISDKLAQVSFTKTRKTIEAVKVALKRREEKKKMFAELSEQVAELPAKVRALEPERNLDTIAEEFEEELEAMSPLDDGPSPRKSREVAGPSR
GRKKLGRSGPEEHLSGGDTISKTPSINSLIKVEKGLFPFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAEEFSVQISGKTVSFNTEAINALYDLPNNV
ETPGQLYVDNPTKRMAREALEAIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILAWMRAPKGAMPFP
STVETLCLKAVPFLSVIQTISIPGGLCNQMALNRMITFHGHKEMERRAKTSGDTPEGMALAERKRKAPVIASTPPPKAKKTKVLVTKQLPLKFPHSSSRPIQRAPPSVQN
SNNSNPPRSSSPIPITPSFPTISPHHSPLPHIRSPPTSLTFPHDRLHLHPQNPLPPFTPNHPHQGELNPFRLFFFRPSWT