; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G012400 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G012400
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionangio-associated migratory cell protein
Genome locationCmo_Chr09:10655142..10663420
RNA-Seq ExpressionCmoCh09G012400
SyntenyCmoCh09G012400
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR019775 - WD40 repeat, conserved site
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592326.1 Vegetative incompatibility protein HET-E-1, partial [Cucurbita argyrosperma subsp. sororia]1.0e-6799.25Show/hide
Query:  DDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSVSS
        DDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSVSS
Subjt:  DDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSVSS

Query:  LAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK
        LAFSANGQLLASGSFDGVIQIWDTSSGNLKCTL+
Subjt:  LAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK

KAG7025146.1 SPAC25H1.08c, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-6998.55Show/hide
Query:  HEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKD
        HEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRG FAQELTGHKD
Subjt:  HEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKD

Query:  SVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK
        SVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTL+
Subjt:  SVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK

XP_008463066.1 PREDICTED: angio-associated migratory cell protein isoform X1 [Cucumis melo]1.8e-6490.07Show/hide
Query:  SNTHEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTG
        S +HEEEDDHGEVFLDE DI+HEV VDEEDLPDAVDEEG DDEY DE DDS+HTFTGHTGEVYTV CSPIDATLVATGGGDDKGFMWKIGRGDFAQEL+G
Subjt:  SNTHEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTG

Query:  HKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK
        HKDSVSSLAFSA+GQLLASGSFDG+IQIWDTSSGNLKCTL+
Subjt:  HKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK

XP_022925391.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111432693 [Cucurbita moschata]1.4e-6999.27Show/hide
Query:  THEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHK
        +HEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHK
Subjt:  THEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHK

Query:  DSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCT
        DSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCT
Subjt:  DSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCT

XP_023535004.1 angio-associated migratory cell protein-like [Cucurbita pepo subsp. pepo]2.5e-6696.35Show/hide
Query:  EEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDS
        EEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEG DDEYVDEDDDS HTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQEL+GHKDS
Subjt:  EEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDS

Query:  VSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK
        VSSLAFSANG LLASGSFDGVIQIWDTSSGNLKCTL+
Subjt:  VSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK

TrEMBL top hitse value%identityAlignment
A0A0A0KBX7 WD_REPEATS_REGION domain-containing protein1.1e-6489.36Show/hide
Query:  SNTHEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTG
        S +HEEEDDHGEVFLDE DI+HEV VDEEDLPDAVDEEG DDEY DE DDS+HTFTGHTGEVYTV CSP+DATLVATGGGDDKGFMWKIGRGDFAQEL+G
Subjt:  SNTHEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTG

Query:  HKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK
        HKDSVSSLAFSA+GQLLASGSFDG+IQIWDTSSGNLKCTL+
Subjt:  HKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK

A0A1S3CIC4 angio-associated migratory cell protein isoform X18.7e-6590.07Show/hide
Query:  SNTHEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTG
        S +HEEEDDHGEVFLDE DI+HEV VDEEDLPDAVDEEG DDEY DE DDS+HTFTGHTGEVYTV CSPIDATLVATGGGDDKGFMWKIGRGDFAQEL+G
Subjt:  SNTHEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTG

Query:  HKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK
        HKDSVSSLAFSA+GQLLASGSFDG+IQIWDTSSGNLKCTL+
Subjt:  HKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK

A0A1S3CID2 angio-associated migratory cell protein isoform X28.7e-6590.07Show/hide
Query:  SNTHEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTG
        S +HEEEDDHGEVFLDE DI+HEV VDEEDLPDAVDEEG DDEY DE DDS+HTFTGHTGEVYTV CSPIDATLVATGGGDDKGFMWKIGRGDFAQEL+G
Subjt:  SNTHEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTG

Query:  HKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK
        HKDSVSSLAFSA+GQLLASGSFDG+IQIWDTSSGNLKCTL+
Subjt:  HKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK

A0A5D3DTH3 Angio-associated migratory cell protein isoform X18.7e-6590.07Show/hide
Query:  SNTHEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTG
        S +HEEEDDHGEVFLDE DI+HEV VDEEDLPDAVDEEG DDEY DE DDS+HTFTGHTGEVYTV CSPIDATLVATGGGDDKGFMWKIGRGDFAQEL+G
Subjt:  SNTHEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTG

Query:  HKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK
        HKDSVSSLAFSA+GQLLASGSFDG+IQIWDTSSGNLKCTL+
Subjt:  HKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLK

A0A6J1EC32 LOW QUALITY PROTEIN: uncharacterized protein LOC1114326936.8e-7099.27Show/hide
Query:  THEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHK
        +HEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHK
Subjt:  THEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHK

Query:  DSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCT
        DSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCT
Subjt:  DSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCT

SwissProt top hitse value%identityAlignment
O13982 Uncharacterized WD repeat-containing protein C25H1.08c7.4e-1330.6Show/hide
Query:  EEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDE-----DDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTG
        EE+++ E+++ E+++   ++ +++ +   V E+  + E  D       D S+  F  H   V++V+ +P+ + L A+GGGDD G++W I  G+   +LTG
Subjt:  EEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDE-----DDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTG

Query:  HKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSG
        HKDS+ ++ +S +G  +A+G  D  +++W +S+G
Subjt:  HKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSG

Q13685 Angio-associated migratory cell protein6.3e-1234.11Show/hide
Query:  DEDDILHEVQVDEEDLPDAVDEEGYDDEYVDE-----------DDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDS
        D DD+  E  +++ D  +  +EEG ++ +V E            DDS  TF  H+  V+ V+  P   TL  TGG DDK F+W++  G+   E  GHKDS
Subjt:  DEDDILHEVQVDEEDLPDAVDEEGYDDEYVDE-----------DDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDS

Query:  VSSLAFSANGQLLASGSFDGVIQIWDTSS
        V+   FS +  L+A+G   G++++W   +
Subjt:  VSSLAFSANGQLLASGSFDGVIQIWDTSS

Q3SZK1 Angio-associated migratory cell protein3.7e-1234.11Show/hide
Query:  DEDDILHEVQVDEEDLPDAVDEEGYDDEYVDE-----------DDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDS
        D DD++ E  +++ D  +  +EEG ++ +V E            DDS  TF  H+  V+ V+  P   TL  TGG DDK F+W++  G+   E  GHKDS
Subjt:  DEDDILHEVQVDEEDLPDAVDEEGYDDEYVDE-----------DDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDS

Query:  VSSLAFSANGQLLASGSFDGVIQIWDTSS
        V+   FS +  L+A+G   G++++W   +
Subjt:  VSSLAFSANGQLLASGSFDGVIQIWDTSS

Q5RCG7 Angio-associated migratory cell protein6.3e-1234.11Show/hide
Query:  DEDDILHEVQVDEEDLPDAVDEEGYDDEYVDE-----------DDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDS
        D DD+  E  +++ D  +  +EEG ++ +V E            DDS  TF  H+  V+ V+  P   TL  TGG DDK F+W++  G+   E  GHKDS
Subjt:  DEDDILHEVQVDEEDLPDAVDEEGYDDEYVDE-----------DDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDS

Query:  VSSLAFSANGQLLASGSFDGVIQIWDTSS
        V+   FS +  L+A+G   G++++W   +
Subjt:  VSSLAFSANGQLLASGSFDGVIQIWDTSS

Q7YR70 Angio-associated migratory cell protein1.1e-1134.38Show/hide
Query:  DEDDI---LHEVQVDEEDLPDAVDEEGYDDE-------YVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSV
        D DD+   + +V  +EE+  +  +EEG+  E        ++  DDS  TF  H+  V+ V+  P   TL  TGG DDK F+W++  G+   E  GHKDSV
Subjt:  DEDDI---LHEVQVDEEDLPDAVDEEGYDDE-------YVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSV

Query:  SSLAFSANGQLLASGSFDGVIQIWDTSS
        +   FS +  L+A+G   G++++W   +
Subjt:  SSLAFSANGQLLASGSFDGVIQIWDTSS

Arabidopsis top hitse value%identityAlignment
AT1G11160.1 Transducin/WD40 repeat-like superfamily protein3.3e-0828.71Show/hide
Query:  EDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLKVLEGE
        E+   +  FTGH      V   P     +A+G  D    +W   +    Q   GH   +S++ FS +G+ + SG  D V+++WD ++G L    K  EG 
Subjt:  EDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTLKVLEGE

Query:  L
        +
Subjt:  L

AT1G71840.1 transducin family protein / WD-40 repeat family protein9.5e-4868.09Show/hide
Query:  SNTHEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDD-EYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELT
        +N   EE+D GE+FL E D+L E+ VDEEDLP+A D++  DD E  DE+DDS+HTFTGH GE+Y +ACSP DATLVATGGGDDK F+WKIG GD+A EL 
Subjt:  SNTHEEEDDHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDD-EYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELT

Query:  GHKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTL
        GHKDSVS LAFS +GQLLASG  DGV+QI+D SSG LKC L
Subjt:  GHKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTL

AT3G49660.1 Transducin/WD40 repeat-like superfamily protein1.1e-0830.85Show/hide
Query:  EDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTL
        E    I T  GHT   + V  +P  + ++ +G  D+   +W +  G   + L  H D V+++ F+ +G L+ S S+DG+ +IWD+ +G+   TL
Subjt:  EDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNLKCTL

AT5G25150.1 TBP-associated factor 56.7e-0936.9Show/hide
Query:  DSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSVSSLAFSANGQLLASGSFDGVIQIWDTSS
        + +  F GH   V ++A SP D   +A+G  D    MW +        L GH   V SL++S  G LLASGS D  +++WD +S
Subjt:  DSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSVSSLAFSANGQLLASGSFDGVIQIWDTSS

AT5G67320.1 WD-40 repeat family protein2.5e-0833.33Show/hide
Query:  IHTFTGHTGEVYTVACSPI--------DATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNL
        +H    HT E+YT+  SP             +A+   D    +W    G       GH++ V SLAFS NG+ +ASGS D  I IW    G +
Subjt:  IHTFTGHTGEVYTVACSPI--------DATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSVSSLAFSANGQLLASGSFDGVIQIWDTSSGNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCGGACTTGCGATTGTACGACACTCACTTCCCAGCAACAAGTGAGTTAAGCATATTCGTTCTTGCGTTGCCTACGGCCAAGGGACGTATGCTCGAGCTTCTTGC
CCCGGTTCAAGAAGAAAGTTTTAGAAAACAGGGACAGAGAGATTTCGAAAGAGGCTCAGTGTCGCCGCCGCCATCTTTCTGCCCAATTCCAGTTATTGTACCACGAGAGA
TCTATGAACATTCCAGCCCAAAGCAAAGCCATGAGAGCTTATGCTCAAAGTGGACAATATCATACCATTGTGGAGAGTCGTGTTCCTCTAACACACATGAGGAGGAAGAT
GATCACGGTGAGGTCTTCCTCGACGAAGATGACATACTCCACGAAGTCCAAGTTGATGAGGAAGATCTTCCTGATGCTGTTGATGAAGAAGGCTATGATGATGAATATGT
TGATGAGGATGATGATTCAATTCATACATTTACTGGCCATACCGGTGAGGTTTACACGGTGGCTTGTAGTCCAATCGATGCCACATTAGTTGCGACGGGTGGTGGAGATG
ACAAAGGTTTTATGTGGAAGATCGGTCGAGGAGATTTTGCACAGGAGCTTACCGGTCACAAGGATTCTGTGTCTAGCTTAGCATTTAGTGCAAATGGACAACTACTTGCA
TCAGGGAGTTTTGATGGTGTTATCCAAATTTGGGATACTTCATCCGGGAATCTTAAATGCACACTGAAGGTCCTGGAGGGGGAATTGAGTGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGCGGACTTGCGATTGTACGACACTCACTTCCCAGCAACAAGTGAGTTAAGCATATTCGTTCTTGCGTTGCCTACGGCCAAGGGACGTATGCTCGAGCTTCTTGC
CCCGGTTCAAGAAGAAAGTTTTAGAAAACAGGGACAGAGAGATTTCGAAAGAGGCTCAGTGTCGCCGCCGCCATCTTTCTGCCCAATTCCAGTTATTGTACCACGAGAGA
TCTATGAACATTCCAGCCCAAAGCAAAGCCATGAGAGCTTATGCTCAAAGTGGACAATATCATACCATTGTGGAGAGTCGTGTTCCTCTAACACACATGAGGAGGAAGAT
GATCACGGTGAGGTCTTCCTCGACGAAGATGACATACTCCACGAAGTCCAAGTTGATGAGGAAGATCTTCCTGATGCTGTTGATGAAGAAGGCTATGATGATGAATATGT
TGATGAGGATGATGATTCAATTCATACATTTACTGGCCATACCGGTGAGGTTTACACGGTGGCTTGTAGTCCAATCGATGCCACATTAGTTGCGACGGGTGGTGGAGATG
ACAAAGGTTTTATGTGGAAGATCGGTCGAGGAGATTTTGCACAGGAGCTTACCGGTCACAAGGATTCTGTGTCTAGCTTAGCATTTAGTGCAAATGGACAACTACTTGCA
TCAGGGAGTTTTGATGGTGTTATCCAAATTTGGGATACTTCATCCGGGAATCTTAAATGCACACTGAAGGTCCTGGAGGGGGAATTGAGTGGGTAA
Protein sequenceShow/hide protein sequence
MVADLRLYDTHFPATSELSIFVLALPTAKGRMLELLAPVQEESFRKQGQRDFERGSVSPPPSFCPIPVIVPREIYEHSSPKQSHESLCSKWTISYHCGESCSSNTHEEED
DHGEVFLDEDDILHEVQVDEEDLPDAVDEEGYDDEYVDEDDDSIHTFTGHTGEVYTVACSPIDATLVATGGGDDKGFMWKIGRGDFAQELTGHKDSVSSLAFSANGQLLA
SGSFDGVIQIWDTSSGNLKCTLKVLEGELSG