; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021324 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021324
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCACTA en-spm transposon protein
Genome locationscaffold6:46825125..46832487
RNA-Seq ExpressionSpg021324
SyntenySpg021324
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060512.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]3.6e-4340.51Show/hide
Query:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLC----
        AIG+ VR++FPVRC    +V +E  E+V   +   F  D +  A+ +F++ QM   FKE+R D HRH+++Y  PEEARANPP  L    EDWHFLC    
Subjt:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLC----

Query:  -------------TKFETPEWKKLE-QGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKRPGHVKGLGWGPKP
                       F   +++  E +G  +  ++LF  TH R G +++Q A DAHNQML LQ  P PEGS+PL+ +EI +KVLG+RPG+ KGLGWGPKP
Subjt:  -------------TKFETPEWKKLE-QGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKRPGHVKGLGWGPKP

Query:  TSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG
         +R   S+  +S+   +    +IE LQA+LQE+  +I      V++     L   +++E+++ MI++F ++Q G
Subjt:  TSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG

TYK10637.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]1.0e-4237.98Show/hide
Query:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKFE
        AIG+ VR++FPVRC    +V +E  E+V   +   F  D +  A+ +F++ QM   FKE+RAD HRH+++Y  PEEARANPP  L    EDWHFLC  + 
Subjt:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKFE

Query:  TPEWKKL-------------------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR
        +  +++                                 +G  +  ++LF  TH R G +V+Q A DAHNQML LQ    P+GS+PL+ +EI ++VLGKR
Subjt:  TPEWKKL-------------------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR

Query:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG
        PG+ KGLGWGPKP +R   S+  +S+   R    +IE LQA+LQE+   + ++E      ++ T    +++E+++ MI++F ++Q G
Subjt:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG

TYK11230.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]6.1e-4338.33Show/hide
Query:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKFE
        AIG+ VR++FPVRC    +V +E  E+V   +   F  D +  A+ +F++ QM   FKE+RAD HRH+++Y  PEEARANPP  L    EDWHFLC  + 
Subjt:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKFE

Query:  TPEWKKL-------------------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR
        +  +++                                 +G  +  ++LF  TH R G +V+Q A DAHNQML LQ  P PEGS+PL+ +EI ++VLG+R
Subjt:  TPEWKKL-------------------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR

Query:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG
        PG+ KGLGWGPKP +R   S+  +S+   +    +IE LQA+LQE+  +I      V++     L   +++E+++ MI++F ++Q G
Subjt:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG

TYK18843.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]3.6e-4339.02Show/hide
Query:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKF-
        AIG+ VR++FPVRC    +V +E  E+V   +   F  D +  A+ +F++ QM   FKE+RAD HRH+++Y  PEEARANPP  L    EDWHFLC  + 
Subjt:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKF-

Query:  -----------ETPEWKKL-------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR
                   +    K+L                    +G  +  ++LF  TH R G +V+Q A DAHNQML LQ  P PEGS+PL+ +EI ++VLG+R
Subjt:  -----------ETPEWKKL-------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR

Query:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG
        PG+ KGLGWGPKP +R   S+  +S+   +    +IE LQA+LQE+  +I      V++     L   +++E+++ MI++F ++Q G
Subjt:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG

TYK21492.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]6.1e-4338.33Show/hide
Query:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKFE
        AIG+ VR++FPVRC    +V +E  E+V   +   F  D +  A+ +F++ QM   FKE+RAD HRH+++Y  PEEARANPP  L    EDWHFLC  + 
Subjt:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKFE

Query:  TPEWKKL-------------------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR
        +  +++                                 +G  +  ++LF  TH R G +V+Q A DAHNQML LQ  P PEGS+PL+ +EI ++VLG+R
Subjt:  TPEWKKL-------------------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR

Query:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG
        PG+ KGLGWGPKP +R   S+  +S+   +    +IE LQA+LQE+  +I      V++     L   +++E+++ MI++F ++Q G
Subjt:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG

TrEMBL top hitse value%identityAlignment
A0A5A7UXD1 CACTA en-spm transposon protein1.7e-4340.51Show/hide
Query:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLC----
        AIG+ VR++FPVRC    +V +E  E+V   +   F  D +  A+ +F++ QM   FKE+R D HRH+++Y  PEEARANPP  L    EDWHFLC    
Subjt:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLC----

Query:  -------------TKFETPEWKKLE-QGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKRPGHVKGLGWGPKP
                       F   +++  E +G  +  ++LF  TH R G +++Q A DAHNQML LQ  P PEGS+PL+ +EI +KVLG+RPG+ KGLGWGPKP
Subjt:  -------------TKFETPEWKKLE-QGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKRPGHVKGLGWGPKP

Query:  TSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG
         +R   S+  +S+   +    +IE LQA+LQE+  +I      V++     L   +++E+++ MI++F ++Q G
Subjt:  TSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG

A0A5D3CF83 CACTA en-spm transposon protein5.1e-4337.98Show/hide
Query:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKFE
        AIG+ VR++FPVRC    +V +E  E+V   +   F  D +  A+ +F++ QM   FKE+RAD HRH+++Y  PEEARANPP  L    EDWHFLC  + 
Subjt:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKFE

Query:  TPEWKKL-------------------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR
        +  +++                                 +G  +  ++LF  TH R G +V+Q A DAHNQML LQ    P+GS+PL+ +EI ++VLGKR
Subjt:  TPEWKKL-------------------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR

Query:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG
        PG+ KGLGWGPKP +R   S+  +S+   R    +IE LQA+LQE+   + ++E      ++ T    +++E+++ MI++F ++Q G
Subjt:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG

A0A5D3CIP7 CACTA en-spm transposon protein3.0e-4338.33Show/hide
Query:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKFE
        AIG+ VR++FPVRC    +V +E  E+V   +   F  D +  A+ +F++ QM   FKE+RAD HRH+++Y  PEEARANPP  L    EDWHFLC  + 
Subjt:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKFE

Query:  TPEWKKL-------------------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR
        +  +++                                 +G  +  ++LF  TH R G +V+Q A DAHNQML LQ  P PEGS+PL+ +EI ++VLG+R
Subjt:  TPEWKKL-------------------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR

Query:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG
        PG+ KGLGWGPKP +R   S+  +S+   +    +IE LQA+LQE+  +I      V++     L   +++E+++ MI++F ++Q G
Subjt:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG

A0A5D3D5L1 CACTA en-spm transposon protein1.7e-4339.02Show/hide
Query:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKF-
        AIG+ VR++FPVRC    +V +E  E+V   +   F  D +  A+ +F++ QM   FKE+RAD HRH+++Y  PEEARANPP  L    EDWHFLC  + 
Subjt:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKF-

Query:  -----------ETPEWKKL-------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR
                   +    K+L                    +G  +  ++LF  TH R G +V+Q A DAHNQML LQ  P PEGS+PL+ +EI ++VLG+R
Subjt:  -----------ETPEWKKL-------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR

Query:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG
        PG+ KGLGWGPKP +R   S+  +S+   +    +IE LQA+LQE+  +I      V++     L   +++E+++ MI++F ++Q G
Subjt:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG

A0A5D3DD39 CACTA en-spm transposon protein3.0e-4338.33Show/hide
Query:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKFE
        AIG+ VR++FPVRC    +V +E  E+V   +   F  D +  A+ +F++ QM   FKE+RAD HRH+++Y  PEEARANPP  L    EDWHFLC  + 
Subjt:  AIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPRLKEDIEDWHFLCTKFE

Query:  TPEWKKL-------------------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR
        +  +++                                 +G  +  ++LF  TH R G +V+Q A DAHNQML LQ  P PEGS+PL+ +EI ++VLG+R
Subjt:  TPEWKKL-------------------------------EQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKR

Query:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG
        PG+ KGLGWGPKP +R   S+  +S+   +    +IE LQA+LQE+  +I      V++     L   +++E+++ MI++F ++Q G
Subjt:  PGHVKGLGWGPKPTSRNKTSSDEASSQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACCATCTCCCGACGGCAAATCCGTAGCGTCGGCAGTTACAATTACATATGGTCGGACTTGAAGAGAACTCCCGACGCTCAGTTTACACATCGTCGGGAGTATCT
CCCGTCTCCTGTCTCCTGTCTCCTGTCTCCCTCCCTCTCCCTCCTTCGATTCGCCGTCTCCCATCTCCTGTCGCCGCCGTTCAACTCTCGCCTCCTCTCACTCAAAGAAG
CCCGCGTCACTGAGGTGGATCGGATCTTAGAGATAACACCTGACGAAAGTTCAAATCATAGTGTCGAGAAGCCAAACATTGAAGCCAGATTATGGCAGTCAAGCTTTGGA
GCTCTGACGTTATTGGCCAACCAAGTAAAAAGATTGGCCTTGGGAGCCCTTTTACTTCGACTCCTTCTTGGTTGCAACGCAATTGGTATTGCTGTGAGGGAATCATTTCC
AGTTCGATGCGCCTCAATCCGAAATGTACCGAAAGAATGTAAGGAACTTGTCATAAGTCGAGTGCTGGAGCATTTTGACTTCGACCTATCCAAACCAGCGGTGAAAAAGT
TCCTTCAACGACAAATGCAGAATTTATTTAAGGAGTATCGGGCAGATTTACATAGACACTACAGACAATACGAAAGTCCTGAAGAGGCACGTGCAAACCCACCACCTCGG
CTGAAAGAAGACATTGAAGATTGGCATTTTCTATGTACAAAGTTCGAGACCCCAGAGTGGAAGAAACTGGAACAAGGTTGCGAAATTGGACCAATTGACCTGTTTGAGCG
GACACACTCTAGAAATGGTGAGTGGGTCAATCAGAAGGCTAATGATGCACATAATCAGATGCTGGTGTTGCAAGATGCTCCTGTTCCAGAAGGGTCTGAACCACTCGCAG
GAGAGGAGATATTGGAGAAAGTGTTGGGAAAACGACCAGGCCATGTTAAAGGGCTTGGTTGGGGACCAAAGCCTACATCACGGAACAAAACCTCATCTGATGAAGCATCC
TCACAAAGAGAACGTGAGCAAGCAACGCAAATAGAATCTTTGCAGGCACAACTTCAGGAATCAGAAGCGAAGATTCACAAGGTAGAAGCAATGGTTGAGGAAGAGAGGAG
TGCAACACTCGAAACGAAAGCTGAGTTGGAGAACTTGAGAGGAATGATCCAACAATTCATGCAATCACAAGGAGGAGAGACATCGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAACCATCTCCCGACGGCAAATCCGTAGCGTCGGCAGTTACAATTACATATGGTCGGACTTGAAGAGAACTCCCGACGCTCAGTTTACACATCGTCGGGAGTATCT
CCCGTCTCCTGTCTCCTGTCTCCTGTCTCCCTCCCTCTCCCTCCTTCGATTCGCCGTCTCCCATCTCCTGTCGCCGCCGTTCAACTCTCGCCTCCTCTCACTCAAAGAAG
CCCGCGTCACTGAGGTGGATCGGATCTTAGAGATAACACCTGACGAAAGTTCAAATCATAGTGTCGAGAAGCCAAACATTGAAGCCAGATTATGGCAGTCAAGCTTTGGA
GCTCTGACGTTATTGGCCAACCAAGTAAAAAGATTGGCCTTGGGAGCCCTTTTACTTCGACTCCTTCTTGGTTGCAACGCAATTGGTATTGCTGTGAGGGAATCATTTCC
AGTTCGATGCGCCTCAATCCGAAATGTACCGAAAGAATGTAAGGAACTTGTCATAAGTCGAGTGCTGGAGCATTTTGACTTCGACCTATCCAAACCAGCGGTGAAAAAGT
TCCTTCAACGACAAATGCAGAATTTATTTAAGGAGTATCGGGCAGATTTACATAGACACTACAGACAATACGAAAGTCCTGAAGAGGCACGTGCAAACCCACCACCTCGG
CTGAAAGAAGACATTGAAGATTGGCATTTTCTATGTACAAAGTTCGAGACCCCAGAGTGGAAGAAACTGGAACAAGGTTGCGAAATTGGACCAATTGACCTGTTTGAGCG
GACACACTCTAGAAATGGTGAGTGGGTCAATCAGAAGGCTAATGATGCACATAATCAGATGCTGGTGTTGCAAGATGCTCCTGTTCCAGAAGGGTCTGAACCACTCGCAG
GAGAGGAGATATTGGAGAAAGTGTTGGGAAAACGACCAGGCCATGTTAAAGGGCTTGGTTGGGGACCAAAGCCTACATCACGGAACAAAACCTCATCTGATGAAGCATCC
TCACAAAGAGAACGTGAGCAAGCAACGCAAATAGAATCTTTGCAGGCACAACTTCAGGAATCAGAAGCGAAGATTCACAAGGTAGAAGCAATGGTTGAGGAAGAGAGGAG
TGCAACACTCGAAACGAAAGCTGAGTTGGAGAACTTGAGAGGAATGATCCAACAATTCATGCAATCACAAGGAGGAGAGACATCGAAATAG
Protein sequenceShow/hide protein sequence
MKTISRRQIRSVGSYNYIWSDLKRTPDAQFTHRREYLPSPVSCLLSPSLSLLRFAVSHLLSPPFNSRLLSLKEARVTEVDRILEITPDESSNHSVEKPNIEARLWQSSFG
ALTLLANQVKRLALGALLLRLLLGCNAIGIAVRESFPVRCASIRNVPKECKELVISRVLEHFDFDLSKPAVKKFLQRQMQNLFKEYRADLHRHYRQYESPEEARANPPPR
LKEDIEDWHFLCTKFETPEWKKLEQGCEIGPIDLFERTHSRNGEWVNQKANDAHNQMLVLQDAPVPEGSEPLAGEEILEKVLGKRPGHVKGLGWGPKPTSRNKTSSDEAS
SQREREQATQIESLQAQLQESEAKIHKVEAMVEEERSATLETKAELENLRGMIQQFMQSQGGETSK