; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0015257 (gene) of Chayote v1 genome

Gene IDSed0015257
OrganismSechium edule (Chayote v1)
DescriptionHomeobox Hox-B3-like protein
Genome locationLG05:37973226..37974248
RNA-Seq ExpressionSed0015257
SyntenySed0015257
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607056.1 hypothetical protein SDJN03_00398, partial [Cucurbita argyrosperma subsp. sororia]2.5e-7580.65Show/hide
Query:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP
        MA  F    QTHL FHSIDPRSLLLHQN+A      PL+LTAETFSMERGPRYRAYAELRE+KLR RN +YRD E PEKS PPAKKQVRFVG ET RKR 
Subjt:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP

Query:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR
         +A VAQSVPDFSAVLRKENRRPPPGLSPVME  TPP    GKN IGG +SANSRGSKSAS AGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGE++
Subjt:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR

Query:  R-GGVRRGKTVLGVRQI
        R GG RRGKTVLGVRQI
Subjt:  R-GGVRRGKTVLGVRQI

KAG7036757.1 hypothetical protein SDJN02_00377, partial [Cucurbita argyrosperma subsp. argyrosperma]8.5e-7681.11Show/hide
Query:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP
        MA  F    QTHL FHSIDPRSLLLHQN+A      PL+LTAETFSMERGPRYRAYAELRE+KLR RNA+YRD E PEKS PPAKKQVRFVG ET RKR 
Subjt:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP

Query:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR
         +A VAQSVPDFSAVLRKENRRPPPGLSPVME  TPP    GKN IGG +SANSRGSKSAS AGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGE++
Subjt:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR

Query:  R-GGVRRGKTVLGVRQI
        R GG RRGKTVLGVRQI
Subjt:  R-GGVRRGKTVLGVRQI

XP_022948564.1 uncharacterized protein LOC111452204 [Cucurbita moschata]8.5e-7681.11Show/hide
Query:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP
        MA  F    QTHL FHSIDPRSLLLHQN+A      PL+LTAETFSMERGPRYRAYAELRE+KLR RNA+YRD E PEKS PPAKKQVRFVG ET RKR 
Subjt:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP

Query:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR
         +A VAQSVPDFSAVLRKENRRPPPGLSPVME  TPP    GKN IGG +SANSRGSKSAS AGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGE++
Subjt:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR

Query:  R-GGVRRGKTVLGVRQI
        R GG RRGKTVLGVRQI
Subjt:  R-GGVRRGKTVLGVRQI

XP_022998093.1 uncharacterized protein LOC111492844 [Cucurbita maxima]9.1e-7881.02Show/hide
Query:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP
        MA  F  L QTHLPFHSIDPRSLLLHQN+A      PL+LTAETFSMERGPRYRAYAELRE+KLR RN++YRD E PEKS PPAKKQVRFVG ET RKR 
Subjt:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP

Query:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR
         +A VAQSVPDFS+VLRKEN+RPPPGLSPVME  TPP    GKN IGG +SANSRGSKSAS AGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGE++
Subjt:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR

Query:  RGGVRRGKTVLGVRQI
        RGG RRGKTVLGVRQI
Subjt:  RGGVRRGKTVLGVRQI

XP_023524771.1 uncharacterized protein LOC111788608 [Cucurbita pepo subsp. pepo]2.7e-7781.65Show/hide
Query:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP
        MA  F  L QTHLPFHSIDPRSLLLHQN+A      PL+LTAETFSMERGPRYRAYAELRE+KLR RNA+YRD E PEKS PPAKKQVRFVG ET RKR 
Subjt:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP

Query:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPPGK-----NGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGES
         +A VAQSVPDFSAVLRKENRRPPPGLSPVME  TPPGK     N IGG +SANSRGSKSAS AGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGE+
Subjt:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPPGK-----NGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGES

Query:  RR-GGVRRGKTVLGVRQI
        +R GG RRGKTVLGVRQI
Subjt:  RR-GGVRRGKTVLGVRQI

TrEMBL top hitse value%identityAlignment
A0A0A0L9P8 Uncharacterized protein1.4e-6876.85Show/hide
Query:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP
        MAHKF      HLPFHSID RSLLLHQN +AA  PI L LT E FSMERGPRYRAYAELRE+KLR RNA+YR  E PEKS PP KKQV+F+G ET RKR 
Subjt:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP

Query:  AAAAVAQSVPDFSAVLRKENRRPPPG-LSPVMETKTPPGK---NGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR
         +A VAQSVPDFSAVLRKENR+PPPG LSPVME  TPPGK     IGG+ S  SRGSKSAS AGEKRGGGL A RKSYAGFEELKGFSTAAANAINGE+R
Subjt:  AAAAVAQSVPDFSAVLRKENRRPPPG-LSPVMETKTPPGK---NGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR

Query:  RGGVRRGKTVLGVRQI
        +GG RRGKTVLGVRQI
Subjt:  RGGVRRGKTVLGVRQI

A0A5A7U2K6 Uncharacterized protein1.6e-6773.61Show/hide
Query:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP
        MA KFH     HLPFHSID RSLLLHQN+AA   PI L LT E FSMERGPRY+AYAELRE+KLRFRNA+YR  E PEKS PP KKQ++F+G ET RKR 
Subjt:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP

Query:  AAAAVAQSVPDFSAVLRKENRRPPPG-LSPVMETKTPPGK---NGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR
         +A VAQSVPDFSAVLRKENR+PPPG LSPVME  TPPGK     +GG+ S NSRGSKSAS AGEKRGGGL   RKSYAGFEELKGFSTA A AINGE+R
Subjt:  AAAAVAQSVPDFSAVLRKENRRPPPG-LSPVMETKTPPGK---NGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR

Query:  RGGVRRGKTVLGVRQI
        +GG R+GKTVLG RQ+
Subjt:  RGGVRRGKTVLGVRQI

A0A6J1D4H7 uncharacterized protein LOC1110169393.1e-6374.04Show/hide
Query:  HLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRPAAAAVAQSVPD
        HLPFH+IDPRSLLLHQN + A  PI LRLTA +FSMERGPRYRAYAELRE+KLR RNA Y D E PEK  PPAKKQV+F+G ET RKR  +AAVAQSVPD
Subjt:  HLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRPAAAAVAQSVPD

Query:  FSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKR---GGGLAAARKSYAGFEELKGFSTAAANAINGESRRGGVRRGK
        FSA LRKENR+PP    P +   TPP    GKN +G  +S NSRGSKSAS AGEKR   GGGL AARKSYAGFEELKGFSTAAANAINGE RRGG R+GK
Subjt:  FSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKR---GGGLAAARKSYAGFEELKGFSTAAANAINGESRRGGVRRGK

Query:  TVLGVRQI
        TVLG+RQI
Subjt:  TVLGVRQI

A0A6J1GA97 uncharacterized protein LOC1114522044.1e-7681.11Show/hide
Query:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP
        MA  F    QTHL FHSIDPRSLLLHQN+A      PL+LTAETFSMERGPRYRAYAELRE+KLR RNA+YRD E PEKS PPAKKQVRFVG ET RKR 
Subjt:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP

Query:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR
         +A VAQSVPDFSAVLRKENRRPPPGLSPVME  TPP    GKN IGG +SANSRGSKSAS AGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGE++
Subjt:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR

Query:  R-GGVRRGKTVLGVRQI
        R GG RRGKTVLGVRQI
Subjt:  R-GGVRRGKTVLGVRQI

A0A6J1KFU5 uncharacterized protein LOC1114928444.4e-7881.02Show/hide
Query:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP
        MA  F  L QTHLPFHSIDPRSLLLHQN+A      PL+LTAETFSMERGPRYRAYAELRE+KLR RN++YRD E PEKS PPAKKQVRFVG ET RKR 
Subjt:  MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRP

Query:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR
         +A VAQSVPDFS+VLRKEN+RPPPGLSPVME  TPP    GKN IGG +SANSRGSKSAS AGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGE++
Subjt:  AAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP----GKNGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESR

Query:  RGGVRRGKTVLGVRQI
        RGG RRGKTVLGVRQI
Subjt:  RGGVRRGKTVLGVRQI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67035.1 unknown protein1.3e-1034.97Show/hide
Query:  FSMERGPRYRAYAELREAKLRFRNAVYRDVEPPE-----------KSPPPAKKQVRFVGMETARKRPAAAAVAQSVPDFSAVLRKENRRPP--PGLSPVM
        +  ERG RY  YA LRE+KLR +    + ++  +           +  P  K +  F   +      +++++AQSVPDFS+++RKENRRPP    L P  
Subjt:  FSMERGPRYRAYAELREAKLRFRNAVYRDVEPPE-----------KSPPPAKKQVRFVGMETARKRPAAAAVAQSVPDFSAVLRKENRRPP--PGLSPVM

Query:  ETKTPPGKNGIGGMVSANS--RGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESRRGGVRRGKTVLGVRQI
           TPP        V A S  RGS SAS AGEK+GG     RKS                        GG   G+T+LG RQI
Subjt:  ETKTPPGKNGIGGMVSANS--RGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESRRGGVRRGKTVLGVRQI

AT1G67035.2 unknown protein1.9e-1738.78Show/hide
Query:  FSMERGPRYRAYAELREAKLRFRNAVYRDVEPPE-----------KSPPPAKKQVRFVGMETARKRPAAAAVAQSVPDFSAVLRKENRRPP--PGLSPVM
        +  ERG RY  YA LRE+KLR +    + ++  +           +  P  K +  F   +      +++++AQSVPDFS+++RKENRRPP    L P  
Subjt:  FSMERGPRYRAYAELREAKLRFRNAVYRDVEPPE-----------KSPPPAKKQVRFVGMETARKRPAAAAVAQSVPDFSAVLRKENRRPP--PGLSPVM

Query:  ETKTPPGKNGIGGMVSANS--RGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESRR-------------GGVRRGKTVLGVRQI
           TPP        V A S  RGS SAS AGEK+G G+   RKSYA  ++LK  S AAA+AING   +             GG   G+T+LG RQI
Subjt:  ETKTPPGKNGIGGMVSANS--RGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESRR-------------GGVRRGKTVLGVRQI

AT5G38300.1 unknown protein2.6e-2239Show/hide
Query:  FHSIDPRSLLLHQNAAAAAAPIPLRLTAETFS-MERGPRYRAYAELREAKLRFRNAVYR------DVEPP------------------EKSPPPAKKQVR
        F S+DP SL+L QN+        L+L  + FS  ERGPRY  Y+ LRE+KLR +    +      +VE P                  +K  P  KKQ R
Subjt:  FHSIDPRSLLLHQNAAAAAAPIPLRLTAETFS-MERGPRYRAYAELREAKLRFRNAVYR------DVEPP------------------EKSPPPAKKQVR

Query:  FVGME---------------------TARKRPAAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP---GKNGIGGMVSAN-SRGSKSASAAGEKRG
        F G+                      +  K+   +++AQSVPDFSAV+RKENRRP       + T TPP    K+  GG+++ + SRGSKSAS AGEK+ 
Subjt:  FVGME---------------------TARKRPAAAAVAQSVPDFSAVLRKENRRPPPGLSPVMETKTPP---GKNGIGGMVSAN-SRGSKSASAAGEKRG

Query:  GG---LAAARKSYAGFEELKGFSTAAANAINGESRRGGVRR--------GKTVLGVRQI
         G   +  ARKSYA  E+LK  S AAA+AING    GG  R         +T+LG RQI
Subjt:  GG---LAAARKSYAGFEELKGFSTAAANAINGESRRGGVRR--------GKTVLGVRQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCACAAATTCCACGATTTGAGACAAACCCATCTCCCCTTTCACTCCATCGACCCCAGATCCCTCCTCCTCCACCAGAACGCCGCCGCCGCCGCCGCCCCAATTCC
TCTCCGTCTCACGGCGGAGACTTTTTCCATGGAAAGAGGCCCCAGATACAGAGCCTACGCCGAGCTCAGAGAAGCCAAGCTCCGATTCAGAAACGCCGTGTACCGCGACG
TGGAACCGCCGGAAAAGTCCCCTCCGCCGGCGAAGAAGCAAGTGAGATTTGTGGGTATGGAGACCGCTCGAAAGAGGCCGGCGGCGGCGGCGGTGGCGCAATCGGTGCCG
GATTTCTCTGCTGTACTGAGGAAGGAGAACAGGAGGCCGCCACCGGGGCTGTCGCCGGTGATGGAGACGAAGACGCCGCCGGGGAAGAACGGCATTGGGGGGATGGTGTC
GGCGAATTCGAGAGGGAGTAAGTCGGCGAGTGCGGCCGGGGAGAAAAGGGGCGGCGGGTTGGCGGCGGCGAGGAAGAGCTACGCCGGATTCGAGGAGCTGAAGGGGTTTT
CGACGGCGGCGGCGAACGCCATTAATGGTGAAAGTAGGAGAGGGGGAGTTAGGAGAGGGAAGACTGTGTTGGGAGTTAGACAGATTTGA
mRNA sequenceShow/hide mRNA sequence
TGAGAAACCAAATCCCTTTTTTCAAAACCCCACAATTACTTACCCTTTCAATTTTATCTCTTCATTCAAACAGCACACAACACGAACAGAGCTTCCTCATCCTTCATCAA
TGGCCCACAAATTCCACGATTTGAGACAAACCCATCTCCCCTTTCACTCCATCGACCCCAGATCCCTCCTCCTCCACCAGAACGCCGCCGCCGCCGCCGCCCCAATTCCT
CTCCGTCTCACGGCGGAGACTTTTTCCATGGAAAGAGGCCCCAGATACAGAGCCTACGCCGAGCTCAGAGAAGCCAAGCTCCGATTCAGAAACGCCGTGTACCGCGACGT
GGAACCGCCGGAAAAGTCCCCTCCGCCGGCGAAGAAGCAAGTGAGATTTGTGGGTATGGAGACCGCTCGAAAGAGGCCGGCGGCGGCGGCGGTGGCGCAATCGGTGCCGG
ATTTCTCTGCTGTACTGAGGAAGGAGAACAGGAGGCCGCCACCGGGGCTGTCGCCGGTGATGGAGACGAAGACGCCGCCGGGGAAGAACGGCATTGGGGGGATGGTGTCG
GCGAATTCGAGAGGGAGTAAGTCGGCGAGTGCGGCCGGGGAGAAAAGGGGCGGCGGGTTGGCGGCGGCGAGGAAGAGCTACGCCGGATTCGAGGAGCTGAAGGGGTTTTC
GACGGCGGCGGCGAACGCCATTAATGGTGAAAGTAGGAGAGGGGGAGTTAGGAGAGGGAAGACTGTGTTGGGAGTTAGACAGATTTGAGAGTCACAAAAGAAAGAACAAT
TGGGGTTCTGATTTTTCTTTGGAAGAATTGTTTTGTTTTGATGTTCTTCCATTTTGTTTGAATTTTGTTGTTTTTGTTGTTGAACTTTTGGAGGGTGCTTTATTCTTTGA
TTGTAATGGATAAATTGGGATTGTGGAGTAATGAAGAAAACATGTTAGTGTGTTGTTAATCTCATTGATTGTATTTATTATAGATGTTTCTGAGTGTCCAATTATCAAAA
TCAGAATATTTATTGTTAATAATACTTTGTTAC
Protein sequenceShow/hide protein sequence
MAHKFHDLRQTHLPFHSIDPRSLLLHQNAAAAAAPIPLRLTAETFSMERGPRYRAYAELREAKLRFRNAVYRDVEPPEKSPPPAKKQVRFVGMETARKRPAAAAVAQSVP
DFSAVLRKENRRPPPGLSPVMETKTPPGKNGIGGMVSANSRGSKSASAAGEKRGGGLAAARKSYAGFEELKGFSTAAANAINGESRRGGVRRGKTVLGVRQI