; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg18926 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg18926
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionGolgin family A protein
Genome locationCarg_Chr17:5652490..5653233
RNA-Seq ExpressionCarg18926
SyntenyCarg18926
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575392.1 hypothetical protein SDJN03_26031, partial [Cucurbita argyrosperma subsp. sororia]5.0e-13698.8Show/hide
Query:  MSLCLQAKALPVCAWRSDGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGG
        MSLCLQAKALPVCAWRSDGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFA NLLSGG
Subjt:  MSLCLQAKALPVCAWRSDGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGG

Query:  DGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSP--GGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVA
        DGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSP  GGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVA
Subjt:  DGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSP--GGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVA

Query:  ERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD
        ERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD
Subjt:  ERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD

KAG7013932.1 hypothetical protein SDJN02_24101, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-138100Show/hide
Query:  MSLCLQAKALPVCAWRSDGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGG
        MSLCLQAKALPVCAWRSDGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGG
Subjt:  MSLCLQAKALPVCAWRSDGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGG

Query:  DGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVAER
        DGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVAER
Subjt:  DGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVAER

Query:  SEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD
        SEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD
Subjt:  SEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD

XP_022953699.1 uncharacterized protein LOC111456149 [Cucurbita moschata]2.7e-12994.33Show/hide
Query:  MSLCLQAKALPVCAWRSDGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGG
        MSLCLQAKALPVCAWRSDG RNRA PSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGG
Subjt:  MSLCLQAKALPVCAWRSDGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGG

Query:  DGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVAER
        DGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGG    G GGGG+RSERR+RLRVKREKSKGV ER
Subjt:  DGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVAER

Query:  SEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD
        SE + ENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAK+HLLD
Subjt:  SEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD

XP_022992375.1 uncharacterized protein LOC111488702 [Cucurbita maxima]4.9e-12391.97Show/hide
Query:  MSLCLQAKALPVCAWRS--DGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLS
        MSLCLQAKALPVCAWRS  DG RNRAPP+PNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLS
Subjt:  MSLCLQAKALPVCAWRS--DGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLS

Query:  GGDGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVA
        GG GR GT RDGVVSELLEISDRFGWDWDG    GGWRDVNFELLGTSKGGRIPRR +PTAQKVSPGG      GGGGGGRRSERRNRLRVKREKSKGV 
Subjt:  GGDGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVA

Query:  ERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD
        ERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD
Subjt:  ERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD

XP_023548140.1 uncharacterized protein LOC111806868 [Cucurbita pepo subsp. pepo]2.0e-12994.38Show/hide
Query:  MSLCLQAKALPVCAWRS--DGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLS
        MSLCLQAKALPVCAWRS  DG RNRAPP+PNPIPPRDRVIGFGRHKGKMLGAL SSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAH+L+S
Subjt:  MSLCLQAKALPVCAWRS--DGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLS

Query:  GGDGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVA
        GGDGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRG+P  QKVSP  GGGGGSGGGGGGRRSERR+RLRVKREKSKGVA
Subjt:  GGDGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVA

Query:  ERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD
        ERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGR+TLLNRFTTAKSHLLD
Subjt:  ERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD

TrEMBL top hitse value%identityAlignment
A0A0A0K7H6 Uncharacterized protein7.4e-6963.05Show/hide
Query:  SLCLQAKALP---VCAWRSDGTRNRA-PPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLL
        SLCL   AL    V   RS G   R+     NP+P RDRVIGFG+HKGKMLG LPS+YLKWISKNLRAR+ EEWAILADQVLEDP+YQDR+QWEFAHN+L
Subjt:  SLCLQAKALP---VCAWRSDGTRNRA-PPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLL

Query:  SGGDGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGV
        +G  GR  +GRD VVSEL EISDRFGWDWD   +  GWR V+FELLGTSKGGRIPRR +PT +  S       G GGGGGGRR ERR+RLR KREKS G 
Subjt:  SGGDGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGV

Query:  AERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLL
         E+SE K E           +NPV   N  FPGRQ LL R  T KS LL
Subjt:  AERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLL

A0A1S3CGX9 uncharacterized protein LOC1035006172.3e-7065.46Show/hide
Query:  SLCLQAKA-LPVCAW--RSDGTRNRA-PPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLL
        SLCL   A    C W  RS G   R+     NPIP RDRVIGFG+HKGKMLG LPS+YLKWISKNLRAR+ EEWAILADQVLEDPVYQDR+QWEFAHN+L
Subjt:  SLCLQAKA-LPVCAW--RSDGTRNRA-PPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLL

Query:  SGGDGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGV
        +G  GR  +GRD VVSEL EISDRFGWDWD   +  GWRDV+FELLGTSKGGRIPRR +PT    S        SGGGGGGRR ERR+RLR KREKSKG 
Subjt:  SGGDGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGV

Query:  AERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLL
         E+SE K EN          +NPV   N  FPGRQ LL R TT KS LL
Subjt:  AERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLL

A0A5D3BWA7 Uncharacterized protein1.1e-6965.06Show/hide
Query:  SLCLQAKA-LPVCAW--RSDGTRNRA-PPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLL
        SLCL   A    C W  RS G   R+     NPIP RDRVIGFG+HKGKMLG LPS+YLKWISKNLRAR+ EEWA LADQVLEDPVYQDR+QWEFAHN+L
Subjt:  SLCLQAKA-LPVCAW--RSDGTRNRA-PPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLL

Query:  SGGDGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGV
        +G  GR  +GRD VVSEL EISDRFGWDWD   +  GWRDV+FELLGTSKGGRIPRR +PT    S        SGGGGGGRR ERR+RLR KREKSKG 
Subjt:  SGGDGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGV

Query:  AERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLL
         E+SE K EN          +NPV   N  FPGRQ LL R TT KS LL
Subjt:  AERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLL

A0A6J1GQE9 uncharacterized protein LOC1114561491.3e-12994.33Show/hide
Query:  MSLCLQAKALPVCAWRSDGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGG
        MSLCLQAKALPVCAWRSDG RNRA PSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGG
Subjt:  MSLCLQAKALPVCAWRSDGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGG

Query:  DGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVAER
        DGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGG    G GGGG+RSERR+RLRVKREKSKGV ER
Subjt:  DGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVAER

Query:  SEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD
        SE + ENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAK+HLLD
Subjt:  SEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD

A0A6J1JVI9 uncharacterized protein LOC1114887022.4e-12391.97Show/hide
Query:  MSLCLQAKALPVCAWRS--DGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLS
        MSLCLQAKALPVCAWRS  DG RNRAPP+PNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLS
Subjt:  MSLCLQAKALPVCAWRS--DGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLS

Query:  GGDGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVA
        GG GR GT RDGVVSELLEISDRFGWDWDG    GGWRDVNFELLGTSKGGRIPRR +PTAQKVSPGG      GGGGGGRRSERRNRLRVKREKSKGV 
Subjt:  GGDGRRGTGRDGVVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVA

Query:  ERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD
        ERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD
Subjt:  ERSEMKAENNPLPTPKAQPDNPVTGMNRRFPGRQTLLNRFTTAKSHLLD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G51080.1 unknown protein4.5e-3444.55Show/hide
Query:  RDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGGDG--------RRGTGRDGVVSELLEISDRFGWD
        RD +I FG+HKGKMLG LPSSYLKW+SKNLRA   E WA LAD+VLED VY+DR +WEFA  +L G D         R+       VS LLEIS+RFGWD
Subjt:  RDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGGDG--------RRGTGRDGVVSELLEISDRFGWD

Query:  WDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGG-------GGGGGSGGGGGGRRSERRNRLRVKREKSKG-VAERSEMKAENNPLPTPKAQP
         +      GW  +NFELLGTSKGGRIPR      ++   G                  G RR +RR R+R    +  G    RSE K     L   + Q 
Subjt:  WDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGG-------GGGGGSGGGGGGRRSERRNRLRVKREKSKG-VAERSEMKAENNPLPTPKAQP

Query:  DNPVTGMNRRFPGRQTLLNR
        +  +      FPGR++LL +
Subjt:  DNPVTGMNRRFPGRQTLLNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTGTGTTTGCAGGCGAAGGCGTTGCCGGTGTGTGCGTGGAGAAGCGATGGGACAAGAAACAGGGCACCTCCCAGCCCTAACCCTATTCCACCCAGAGACCGCGT
CATAGGGTTTGGGCGGCACAAGGGCAAGATGCTGGGAGCCCTGCCTTCATCCTACCTCAAATGGATCTCCAAAAATCTCAGAGCTAGGGACACGGAGGAGTGGGCCATTT
TGGCAGACCAGGTTCTGGAGGACCCGGTTTACCAAGACCGCATCCAGTGGGAGTTCGCCCACAACCTTCTGAGCGGCGGGGATGGGAGAAGGGGAACTGGCCGCGACGGC
GTCGTTTCTGAGCTGTTGGAGATCAGTGATAGGTTTGGCTGGGATTGGGACGGTGGCGCCCACGACGGCGGTTGGAGAGACGTTAACTTCGAGCTCTTGGGGACCTCTAA
AGGTGGAAGAATCCCTCGGCGAGGCGATCCGACGGCGCAGAAGGTGTCGCCGGGCGGGGGCGGAGGCGGAGGCAGTGGCGGCGGCGGTGGCGGAAGAAGGAGCGAGAGAA
GAAATCGGCTGAGGGTGAAGCGAGAGAAATCGAAGGGAGTGGCGGAGAGAAGTGAAATGAAAGCGGAGAACAATCCTCTTCCCACTCCCAAAGCTCAACCGGATAATCCG
GTTACCGGGATGAACCGTCGCTTCCCTGGTCGTCAAACTCTGCTAAACCGGTTTACCACAGCCAAATCACATTTATTGGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTGTGTTTGCAGGCGAAGGCGTTGCCGGTGTGTGCGTGGAGAAGCGATGGGACAAGAAACAGGGCACCTCCCAGCCCTAACCCTATTCCACCCAGAGACCGCGT
CATAGGGTTTGGGCGGCACAAGGGCAAGATGCTGGGAGCCCTGCCTTCATCCTACCTCAAATGGATCTCCAAAAATCTCAGAGCTAGGGACACGGAGGAGTGGGCCATTT
TGGCAGACCAGGTTCTGGAGGACCCGGTTTACCAAGACCGCATCCAGTGGGAGTTCGCCCACAACCTTCTGAGCGGCGGGGATGGGAGAAGGGGAACTGGCCGCGACGGC
GTCGTTTCTGAGCTGTTGGAGATCAGTGATAGGTTTGGCTGGGATTGGGACGGTGGCGCCCACGACGGCGGTTGGAGAGACGTTAACTTCGAGCTCTTGGGGACCTCTAA
AGGTGGAAGAATCCCTCGGCGAGGCGATCCGACGGCGCAGAAGGTGTCGCCGGGCGGGGGCGGAGGCGGAGGCAGTGGCGGCGGCGGTGGCGGAAGAAGGAGCGAGAGAA
GAAATCGGCTGAGGGTGAAGCGAGAGAAATCGAAGGGAGTGGCGGAGAGAAGTGAAATGAAAGCGGAGAACAATCCTCTTCCCACTCCCAAAGCTCAACCGGATAATCCG
GTTACCGGGATGAACCGTCGCTTCCCTGGTCGTCAAACTCTGCTAAACCGGTTTACCACAGCCAAATCACATTTATTGGATTAA
Protein sequenceShow/hide protein sequence
MSLCLQAKALPVCAWRSDGTRNRAPPSPNPIPPRDRVIGFGRHKGKMLGALPSSYLKWISKNLRARDTEEWAILADQVLEDPVYQDRIQWEFAHNLLSGGDGRRGTGRDG
VVSELLEISDRFGWDWDGGAHDGGWRDVNFELLGTSKGGRIPRRGDPTAQKVSPGGGGGGGSGGGGGGRRSERRNRLRVKREKSKGVAERSEMKAENNPLPTPKAQPDNP
VTGMNRRFPGRQTLLNRFTTAKSHLLD