; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0001503 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0001503
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGag/pol protein
Genome locationchr09:9216210..9218100
RNA-Seq ExpressionPI0001503
SyntenyPI0001503
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048500.1 protein MNN4-like [Cucumis melo var. makuwa]1.6e-3937.46Show/hide
Query:  AVKAALKRKEEKKKMF-AELSEQVAELPAKARALEPERNLKAIAEEFEDELEAMSPLDD---GPPPRKPREVAGPS----RGRKKAGRSGPEECPSSGDT
        A KA  K ++ KK++   ++  Q  +  A+ +    E+     ++EFE ELE +SPL+D      P+K R + G        + K  +   E   S  + 
Subjt:  AVKAALKRKEEKKKMF-AELSEQVAELPAKARALEPERNLKAIAEEFEDELEAMSPLDD---GPPPRKPREVAGPS----RGRKKAGRSGPEECPSSGDT

Query:  INKPPSINSLIKVEKGLFLFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNDIETSGQIYVN
        +     +     +EKG+F F GQLP FL +PI+A  WK FF+G T IR  V+  FY   +N       + GK V+F  + +N LY L     T       
Subjt:  INKPPSINSLIKVEKGLFLFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNDIETSGQIYVN

Query:  SPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILSWM
         P+    + ALE +AWPG  W++TP  KYQL+PH L T ASVWL FIKK + PTRHD+TI+LE  MLLYCI+ +  +N+ E+I   I +W+
Subjt:  SPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILSWM

KAA0049609.1 transposase [Cucumis melo var. makuwa]6.1e-3450.31Show/hide
Query:  ISGKTVSFSAEAINALYDLPNDIET-SGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAML
        +  + V F+ E IN LYDLPND+    GQ  +    +   ++ +++I WP A    TPT + QL+PHQLT EA+VWLFFIKKKIFPT HDSTI  E A++
Subjt:  ISGKTVSFSAEAINALYDLPNDIET-SGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAML

Query:  LYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGGLCN
        LYCI AKK  NLG ++  + LSWMR PK A PFP+T++ LCLK +      I  I + GG CN
Subjt:  LYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGGLCN

KAA0054837.1 hypothetical protein E6C27_scaffold406G00150 [Cucumis melo var. makuwa]1.4e-3535.44Show/hide
Query:  EEF-EDELEAMSPLDDGPPPRKPREVAGPSRGRKKAGRSGPEECPSSGDTINKPPSINSL-----IKVEKGLFLFNGQLPDFLYAPIQAFGWKSFFKGHT
        +EF E++   +SPL++    R+PR+      G+    R   E+     ++      + S        VEKG F+F  QL  FL  PI+A GW+ F +G  
Subjt:  EEF-EDELEAMSPLDDGPPPRKPREVAGPSRGRKKAGRSGPEECPSSGDTINKPPSINSL-----IKVEKGLFLFNGQLPDFLYAPIQAFGWKSFFKGHT

Query:  KIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNDIETSGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLF
         IR GVV+ FY  K++  +    +  +                             P+    +EALE +AW    W+VT   KY+L+ H LTTEASVWL 
Subjt:  KIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNDIETSGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLF

Query:  FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVVPFL-SAIQTISIPGGLCN
        FIKKK+ PTRHD+TI+ E  MLLYCI+ +  V++ E+I   I +W++ P+GA PFP  IE LCL+    L  + Q   +  G+CN
Subjt:  FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVVPFL-SAIQTISIPGGLCN

KAA0062900.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-4150.56Show/hide
Query:  KIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNDIETSGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTG-KYQLYPHQLTTEASVWL
        KIR+ VV KFY  K N ++  + I  +   F+ E IN LY+ PND E  GQ  V   TK + +EAL+++AWPG   EV P   +YQLYPH LTT+A+VW+
Subjt:  KIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNDIETSGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTG-KYQLYPHQLTTEASVWL

Query:  FFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVVPFLSAIQTISIP
        FF K KIFPT +DSTI+++  ++LYCI+ KK +NL E+I  +IL+WM  PK AMPFPS +E LCLK +P L      +IP
Subjt:  FFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVVPFLSAIQTISIP

TYK15967.1 hypothetical protein E5676_scaffold94G00870 [Cucumis melo var. makuwa]6.1e-3450.31Show/hide
Query:  ISGKTVSFSAEAINALYDLPNDIET-SGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAML
        +  + V F+ E IN LYDLPND+    GQ  +    +   ++ +++I WP A    TPT + QL+PHQLT EA+VWLFFIKKKIFPT HDSTI  E A++
Subjt:  ISGKTVSFSAEAINALYDLPNDIET-SGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAML

Query:  LYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGGLCN
        LYCI AKK  NLG ++  + LSWMR PK A PFP+T++ LCLK +      I  I + GG CN
Subjt:  LYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGGLCN

TrEMBL top hitse value%identityAlignment
A0A5A7TZE0 Protein MNN4-like8.0e-4037.46Show/hide
Query:  AVKAALKRKEEKKKMF-AELSEQVAELPAKARALEPERNLKAIAEEFEDELEAMSPLDD---GPPPRKPREVAGPS----RGRKKAGRSGPEECPSSGDT
        A KA  K ++ KK++   ++  Q  +  A+ +    E+     ++EFE ELE +SPL+D      P+K R + G        + K  +   E   S  + 
Subjt:  AVKAALKRKEEKKKMF-AELSEQVAELPAKARALEPERNLKAIAEEFEDELEAMSPLDD---GPPPRKPREVAGPS----RGRKKAGRSGPEECPSSGDT

Query:  INKPPSINSLIKVEKGLFLFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNDIETSGQIYVN
        +     +     +EKG+F F GQLP FL +PI+A  WK FF+G T IR  V+  FY   +N       + GK V+F  + +N LY L     T       
Subjt:  INKPPSINSLIKVEKGLFLFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNDIETSGQIYVN

Query:  SPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILSWM
         P+    + ALE +AWPG  W++TP  KYQL+PH L T ASVWL FIKK + PTRHD+TI+LE  MLLYCI+ +  +N+ E+I   I +W+
Subjt:  SPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILSWM

A0A5A7U806 Transposase2.9e-3450.31Show/hide
Query:  ISGKTVSFSAEAINALYDLPNDIET-SGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAML
        +  + V F+ E IN LYDLPND+    GQ  +    +   ++ +++I WP A    TPT + QL+PHQLT EA+VWLFFIKKKIFPT HDSTI  E A++
Subjt:  ISGKTVSFSAEAINALYDLPNDIET-SGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAML

Query:  LYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGGLCN
        LYCI AKK  NLG ++  + LSWMR PK A PFP+T++ LCLK +      I  I + GG CN
Subjt:  LYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGGLCN

A0A5A7V6M5 Gag/pol protein6.5e-4250.56Show/hide
Query:  KIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNDIETSGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTG-KYQLYPHQLTTEASVWL
        KIR+ VV KFY  K N ++  + I  +   F+ E IN LY+ PND E  GQ  V   TK + +EAL+++AWPG   EV P   +YQLYPH LTT+A+VW+
Subjt:  KIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNDIETSGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTG-KYQLYPHQLTTEASVWL

Query:  FFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVVPFLSAIQTISIP
        FF K KIFPT +DSTI+++  ++LYCI+ KK +NL E+I  +IL+WM  PK AMPFPS +E LCLK +P L      +IP
Subjt:  FFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVVPFLSAIQTISIP

A0A5D3CVL7 Uncharacterized protein2.9e-3450.31Show/hide
Query:  ISGKTVSFSAEAINALYDLPNDIET-SGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAML
        +  + V F+ E IN LYDLPND+    GQ  +    +   ++ +++I WP A    TPT + QL+PHQLT EA+VWLFFIKKKIFPT HDSTI  E A++
Subjt:  ISGKTVSFSAEAINALYDLPNDIET-SGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAML

Query:  LYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGGLCN
        LYCI AKK  NLG ++  + LSWMR PK A PFP+T++ LCLK +      I  I + GG CN
Subjt:  LYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVV-PFLSAIQTISIPGGLCN

A0A5D3DVQ6 Uncharacterized protein7.0e-3635.44Show/hide
Query:  EEF-EDELEAMSPLDDGPPPRKPREVAGPSRGRKKAGRSGPEECPSSGDTINKPPSINSL-----IKVEKGLFLFNGQLPDFLYAPIQAFGWKSFFKGHT
        +EF E++   +SPL++    R+PR+      G+    R   E+     ++      + S        VEKG F+F  QL  FL  PI+A GW+ F +G  
Subjt:  EEF-EDELEAMSPLDDGPPPRKPREVAGPSRGRKKAGRSGPEECPSSGDTINKPPSINSL-----IKVEKGLFLFNGQLPDFLYAPIQAFGWKSFFKGHT

Query:  KIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNDIETSGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLF
         IR GVV+ FY  K++  +    +  +                             P+    +EALE +AW    W+VT   KY+L+ H LTTEASVWL 
Subjt:  KIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNDIETSGQIYVNSPTKRMTREALEIIAWPGAAWEVTPTGKYQLYPHQLTTEASVWLF

Query:  FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVVPFL-SAIQTISIPGGLCN
        FIKKK+ PTRHD+TI+ E  MLLYCI+ +  V++ E+I   I +W++ P+GA PFP  IE LCL+    L  + Q   +  G+CN
Subjt:  FIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVVPFL-SAIQTISIPGGLCN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGATCTTCAAGTCATCAAGATCACATTAGAGTACGTTCTGATGGCCCTGAATCAGAACGCCCTACATCATGGGGAAGTGGCGAACAAAAAATGCCACAATACAC
CCACTTGAAACACGAGGCGGTGCTCGAATTTCTTAACGAACGAGCAAAGAAGAGGAAAGAAGCCCACATCAAAAGGACTAAGGAAGCTCGTCGCCAAAAGGACGAGCGTG
AGCGAAAAAGAGTTGATGCAATTCGAAAGGGGAAGAGCACACTAGAAATATGTTCACCGCCTAACGAGGTTGCAGAGCTTCACAAGAAGATCTCTGACAAGCTCGCACAA
GTCTCGTTCGCTAAAACGAGGAAAACAATCGAAGCAGTTAAGGCTGCCTTAAAAAGGAAAGAAGAGAAGAAAAAGATGTTCGCAGAGTTGAGCGAGCAAGTAGCGGAGCT
CCCAGCGAAAGCAAGAGCATTGGAGCCAGAGAGAAACCTCAAAGCAATCGCAGAAGAATTCGAGGATGAGTTAGAGGCGATGAGCCCACTTGACGACGGGCCACCGCCAA
GAAAACCACGGGAGGTCGCAGGACCATCAAGAGGAAGGAAGAAAGCTGGGCGTTCTGGACCTGAAGAATGTCCATCAAGCGGTGACACCATAAACAAACCACCCTCTATC
AACTCTCTCATCAAAGTTGAGAAGGGGTTGTTTCTGTTCAATGGTCAACTCCCGGATTTCCTCTACGCGCCAATTCAGGCGTTCGGATGGAAATCATTTTTCAAGGGGCA
CACCAAGATAAGATTAGGAGTGGTGGAAAAGTTTTACGCGGCTAAGCTCAACGCTGCAGAGTTTAGCGTACAAATAAGTGGAAAGACAGTGAGTTTCAGCGCGGAGGCCA
TCAATGCGTTGTATGATTTGCCCAATGACATTGAAACCTCAGGGCAAATATACGTAAACAGTCCTACGAAGAGGATGACCCGTGAAGCGCTGGAAATCATCGCATGGCCT
GGGGCCGCATGGGAAGTAACGCCAACAGGGAAGTATCAGTTGTATCCACACCAACTAACCACTGAAGCAAGCGTGTGGTTGTTCTTTATCAAGAAAAAGATCTTCCCAAC
ACGCCATGATAGCACCATCAATTTAGAGTCAGCAATGCTACTCTATTGTATCCTAGCGAAGAAGCGTGTTAATCTTGGCGAACTTATAGCCACATCCATTCTGTCATGGA
TGCGGGCTCCCAAAGGCGCGATGCCCTTCCCTTCTACCATTGAGGCCCTTTGCCTTAAAGTTGTGCCATTCTTATCCGCCATCCAAACCATCTCAATACCAGGCGGGCTG
TGTAATCAAATGGCCTTAAACCGCATGATTACTTTCTATGGACACAAGGAAATGGAAAGGCGGGCAAAGACATTAGGCGACACACCTGAAGGAATGGCCCAAGCAGAAAG
AAAAAGGAAGTCCCCAGTCGTCGCATCAACCCCCCCACCTAAAGCCAAAAAAACAAAGGTTCTTGCGACGAAGCAGCTTCCACTGAAATTTCTCCACTCCTCTCGCCCAA
TACAGCGAGCTACCCATCAGTCCAAAATTCCAGCAGCTCCAATCCTCCTCGCTCTTCTTCACCCATTCCAATCACCCCACCATCACCTAACATCTCTCCCCGCCATTCAC
CTCTCCCCCACATTTGTTCCCCTACCAACATCCCTCACCTTTCCCCACGACCGCCTACCCCTCCGCTCACAAAATCTACTTCTCCCCTTCCTTCCAAATCACCCTCACCA
AGGCGGGCTGAACCTCTTTCGCCTTTTATTCTGTCGCCCATCATGGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGATCTTCAAGTCATCAAGATCACATTAGAGTACGTTCTGATGGCCCTGAATCAGAACGCCCTACATCATGGGGAAGTGGCGAACAAAAAATGCCACAATACAC
CCACTTGAAACACGAGGCGGTGCTCGAATTTCTTAACGAACGAGCAAAGAAGAGGAAAGAAGCCCACATCAAAAGGACTAAGGAAGCTCGTCGCCAAAAGGACGAGCGTG
AGCGAAAAAGAGTTGATGCAATTCGAAAGGGGAAGAGCACACTAGAAATATGTTCACCGCCTAACGAGGTTGCAGAGCTTCACAAGAAGATCTCTGACAAGCTCGCACAA
GTCTCGTTCGCTAAAACGAGGAAAACAATCGAAGCAGTTAAGGCTGCCTTAAAAAGGAAAGAAGAGAAGAAAAAGATGTTCGCAGAGTTGAGCGAGCAAGTAGCGGAGCT
CCCAGCGAAAGCAAGAGCATTGGAGCCAGAGAGAAACCTCAAAGCAATCGCAGAAGAATTCGAGGATGAGTTAGAGGCGATGAGCCCACTTGACGACGGGCCACCGCCAA
GAAAACCACGGGAGGTCGCAGGACCATCAAGAGGAAGGAAGAAAGCTGGGCGTTCTGGACCTGAAGAATGTCCATCAAGCGGTGACACCATAAACAAACCACCCTCTATC
AACTCTCTCATCAAAGTTGAGAAGGGGTTGTTTCTGTTCAATGGTCAACTCCCGGATTTCCTCTACGCGCCAATTCAGGCGTTCGGATGGAAATCATTTTTCAAGGGGCA
CACCAAGATAAGATTAGGAGTGGTGGAAAAGTTTTACGCGGCTAAGCTCAACGCTGCAGAGTTTAGCGTACAAATAAGTGGAAAGACAGTGAGTTTCAGCGCGGAGGCCA
TCAATGCGTTGTATGATTTGCCCAATGACATTGAAACCTCAGGGCAAATATACGTAAACAGTCCTACGAAGAGGATGACCCGTGAAGCGCTGGAAATCATCGCATGGCCT
GGGGCCGCATGGGAAGTAACGCCAACAGGGAAGTATCAGTTGTATCCACACCAACTAACCACTGAAGCAAGCGTGTGGTTGTTCTTTATCAAGAAAAAGATCTTCCCAAC
ACGCCATGATAGCACCATCAATTTAGAGTCAGCAATGCTACTCTATTGTATCCTAGCGAAGAAGCGTGTTAATCTTGGCGAACTTATAGCCACATCCATTCTGTCATGGA
TGCGGGCTCCCAAAGGCGCGATGCCCTTCCCTTCTACCATTGAGGCCCTTTGCCTTAAAGTTGTGCCATTCTTATCCGCCATCCAAACCATCTCAATACCAGGCGGGCTG
TGTAATCAAATGGCCTTAAACCGCATGATTACTTTCTATGGACACAAGGAAATGGAAAGGCGGGCAAAGACATTAGGCGACACACCTGAAGGAATGGCCCAAGCAGAAAG
AAAAAGGAAGTCCCCAGTCGTCGCATCAACCCCCCCACCTAAAGCCAAAAAAACAAAGGTTCTTGCGACGAAGCAGCTTCCACTGAAATTTCTCCACTCCTCTCGCCCAA
TACAGCGAGCTACCCATCAGTCCAAAATTCCAGCAGCTCCAATCCTCCTCGCTCTTCTTCACCCATTCCAATCACCCCACCATCACCTAACATCTCTCCCCGCCATTCAC
CTCTCCCCCACATTTGTTCCCCTACCAACATCCCTCACCTTTCCCCACGACCGCCTACCCCTCCGCTCACAAAATCTACTTCTCCCCTTCCTTCCAAATCACCCTCACCA
AGGCGGGCTGAACCTCTTTCGCCTTTTATTCTGTCGCCCATCATGGACCTGA
Protein sequenceShow/hide protein sequence
MAGSSSHQDHIRVRSDGPESERPTSWGSGEQKMPQYTHLKHEAVLEFLNERAKKRKEAHIKRTKEARRQKDERERKRVDAIRKGKSTLEICSPPNEVAELHKKISDKLAQ
VSFAKTRKTIEAVKAALKRKEEKKKMFAELSEQVAELPAKARALEPERNLKAIAEEFEDELEAMSPLDDGPPPRKPREVAGPSRGRKKAGRSGPEECPSSGDTINKPPSI
NSLIKVEKGLFLFNGQLPDFLYAPIQAFGWKSFFKGHTKIRLGVVEKFYAAKLNAAEFSVQISGKTVSFSAEAINALYDLPNDIETSGQIYVNSPTKRMTREALEIIAWP
GAAWEVTPTGKYQLYPHQLTTEASVWLFFIKKKIFPTRHDSTINLESAMLLYCILAKKRVNLGELIATSILSWMRAPKGAMPFPSTIEALCLKVVPFLSAIQTISIPGGL
CNQMALNRMITFYGHKEMERRAKTLGDTPEGMAQAERKRKSPVVASTPPPKAKKTKVLATKQLPLKFLHSSRPIQRATHQSKIPAAPILLALLHPFQSPHHHLTSLPAIH
LSPTFVPLPTSLTFPHDRLPLRSQNLLLPFLPNHPHQGGLNLFRLLFCRPSWT