; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy7G020060 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy7G020060
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionGolgin family A protein
Genome locationGy14Chr7:22553173..22554093
RNA-Seq ExpressionCsGy7G020060
SyntenyCsGy7G020060
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059334.1 uncharacterized protein E6C27_scaffold242G00590 [Cucumis melo var. makuwa]7.80e-16193.09Show/hide
Query:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV
        MV VFAGSTSLCLPGNA AAACV RRRS+G TIRSEAEG+NPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWA LADQVLEDPVYQDRV
Subjt:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV

Query:  QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGGRRRERRDRLREKRDKSTG
        QWEFAHNVLTGAGRSGRD+VVSELSEISDRFGWDWDNKSGWR VDFELLGTSKGGRIPRRIEPTQ SE KTA+NVSGGGGGGRRRERRDRLREKR+KS G
Subjt:  QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGGRRRERRDRLREKRDKSTG

Query:  GGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPN
        GGEKSE KTE ENPVPRFNNPFPGRQALLKRV TIKSDLLVKKKPN
Subjt:  GGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPN

XP_004141831.3 uncharacterized protein LOC101215921 [Cucumis sativus]3.12e-17397.99Show/hide
Query:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV
        MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNP+PGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDP+YQDRV
Subjt:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV

Query:  QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGG--RRRERRDRLREKRDKS
        QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGG  RRRERRDRLREKR+KS
Subjt:  QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGG--RRRERRDRLREKRDKS

Query:  TGGGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPNS
        TGGGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPNS
Subjt:  TGGGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPNS

XP_008462203.1 PREDICTED: uncharacterized protein LOC103500617 [Cucumis melo]9.50e-16293.5Show/hide
Query:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV
        MV VFAGSTSLCLPGNA AAACV RRRS+G TIRSEAEG+NPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV
Subjt:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV

Query:  QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGGRRRERRDRLREKRDKSTG
        QWEFAHNVLTGAGRSGRD+VVSELSEISDRFGWDWDNKSGWR VDFELLGTSKGGRIPRRIEPTQ SE KTA+NVSGGGGGGRRRERRDRLREKR+KS G
Subjt:  QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGGRRRERRDRLREKRDKSTG

Query:  GGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPN
        GGEKSE KTE ENPVPRFNNPFPGRQALLKRV TIKSDLLVKKKPN
Subjt:  GGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPN

XP_022992375.1 uncharacterized protein LOC111488702 [Cucurbita maxima]2.15e-9164.78Show/hide
Query:  SLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRVQWEFAHNVL
        SLCL   AL   C  R    GI  R+     NPIP RDRVIGFG+HKGKMLG LPS+YLKWISKNLRAR+ EEWAILADQVLEDPVYQDR+QWEFAHN+L
Subjt:  SLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRVQWEFAHNVL

Query:  ---TGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVS---GGGGGGRRRERRDRLREKRDKSTGGGE
           TG G + RD VVSEL EISDRFGWDWD   GWR V+FELLGTSKGGRIPRR EPT       AQ VS   GGGGGGRR ERR+RLR KR+KS G  E
Subjt:  ---TGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVS---GGGGGGRRRERRDRLREKRDKSTGGGE

Query:  KSERKTEI----------ENPVPRFNNPFPGRQALLKRVATIKSDLL
        +SE K E           +NPV   N  FPGRQ LL R  T KS LL
Subjt:  KSERKTEI----------ENPVPRFNNPFPGRQALLKRVATIKSDLL

XP_038898819.1 uncharacterized protein LOC120086315 [Benincasa hispida]8.36e-13779.13Show/hide
Query:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGIT--IRSEAEGKNP--IPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVY
        MV + AGST LCL G  LAA C+ RRRS GI   IRSEAEG+N   +PGRDRV+GFGKHKGKMLGTLPSTYLKW+SKNLRAREFEEWAILADQVLEDPVY
Subjt:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGIT--IRSEAEGKNP--IPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVY

Query:  QDRVQWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVS---GGGGGGRRRERRDRLRE
        QDR+QWEFAHN+LTGAGR+GRD+VVSEL EISDRFGWDW+NKSGWR V+FELLGTSKGGRIPRRIE TQKSESKT  NVS   GGGGGGRRRERR+RLR 
Subjt:  QDRVQWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVS---GGGGGGRRRERRDRLRE

Query:  KRDKSTGGGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPNS
        KR+KS G  EKSE KTE ENP PRFNNPFPGRQ LL R  T KS L +KK PNS
Subjt:  KRDKSTGGGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPNS

TrEMBL top hitse value%identityAlignment
A0A0A0K7H6 Uncharacterized protein1.51e-17397.99Show/hide
Query:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV
        MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNP+PGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDP+YQDRV
Subjt:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV

Query:  QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGG--RRRERRDRLREKRDKS
        QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGG  RRRERRDRLREKR+KS
Subjt:  QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGG--RRRERRDRLREKRDKS

Query:  TGGGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPNS
        TGGGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPNS
Subjt:  TGGGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPNS

A0A1S3CGX9 uncharacterized protein LOC1035006174.60e-16293.5Show/hide
Query:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV
        MV VFAGSTSLCLPGNA AAACV RRRS+G TIRSEAEG+NPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV
Subjt:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV

Query:  QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGGRRRERRDRLREKRDKSTG
        QWEFAHNVLTGAGRSGRD+VVSELSEISDRFGWDWDNKSGWR VDFELLGTSKGGRIPRRIEPTQ SE KTA+NVSGGGGGGRRRERRDRLREKR+KS G
Subjt:  QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGGRRRERRDRLREKRDKSTG

Query:  GGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPN
        GGEKSE KTE ENPVPRFNNPFPGRQALLKRV TIKSDLLVKKKPN
Subjt:  GGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPN

A0A5D3BWA7 Uncharacterized protein3.78e-16193.09Show/hide
Query:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV
        MV VFAGSTSLCLPGNA AAACV RRRS+G TIRSEAEG+NPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWA LADQVLEDPVYQDRV
Subjt:  MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRV

Query:  QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGGRRRERRDRLREKRDKSTG
        QWEFAHNVLTGAGRSGRD+VVSELSEISDRFGWDWDNKSGWR VDFELLGTSKGGRIPRRIEPTQ SE KTA+NVSGGGGGGRRRERRDRLREKR+KS G
Subjt:  QWEFAHNVLTGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGGRRRERRDRLREKRDKSTG

Query:  GGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPN
        GGEKSE KTE ENPVPRFNNPFPGRQALLKRV TIKSDLLVKKKPN
Subjt:  GGEKSERKTEIENPVPRFNNPFPGRQALLKRVATIKSDLLVKKKPN

A0A6J1GQE9 uncharacterized protein LOC1114561491.29e-8863.39Show/hide
Query:  SLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRVQWEFAHNVL
        SLCL   AL    V   RS GI  R+ A   NPIP RDRVIGFG+HKGKMLG LPS+YLKWISKNLRAR+ EEWAILADQVLEDPVYQDR+QWEFAHN+L
Subjt:  SLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRVQWEFAHNVL

Query:  TGA-GR--SGRDNVVSELSEISDRFGWDWD---NKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVS-----GGGGGGRRRERRDRLREKRDKS
        +G  GR  +GRD VVSEL EISDRFGWDWD   +  GWR V+FELLGTSKGGRIPRR +PT       AQ VS     G GGGG+R ERRDRLR KR+KS
Subjt:  TGA-GR--SGRDNVVSELSEISDRFGWDWD---NKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVS-----GGGGGGRRRERRDRLREKRDKS

Query:  TGGGEKSERKTEIEN------------PVPRFNNPFPGRQALLKRVATIKSDLL
         G  E+SE  TE+EN            PV   N  FPGRQ LL R  T K+ LL
Subjt:  TGGGEKSERKTEIEN------------PVPRFNNPFPGRQALLKRVATIKSDLL

A0A6J1JVI9 uncharacterized protein LOC1114887021.04e-9164.78Show/hide
Query:  SLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRVQWEFAHNVL
        SLCL   AL   C  R    GI  R+     NPIP RDRVIGFG+HKGKMLG LPS+YLKWISKNLRAR+ EEWAILADQVLEDPVYQDR+QWEFAHN+L
Subjt:  SLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRVQWEFAHNVL

Query:  ---TGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVS---GGGGGGRRRERRDRLREKRDKSTGGGE
           TG G + RD VVSEL EISDRFGWDWD   GWR V+FELLGTSKGGRIPRR EPT       AQ VS   GGGGGGRR ERR+RLR KR+KS G  E
Subjt:  ---TGAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVS---GGGGGGRRRERRDRLREKRDKSTGGGE

Query:  KSERKTEI----------ENPVPRFNNPFPGRQALLKRVATIKSDLL
        +SE K E           +NPV   N  FPGRQ LL R  T KS LL
Subjt:  KSERKTEI----------ENPVPRFNNPFPGRQALLKRVATIKSDLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G51080.1 unknown protein5.7e-4546.79Show/hide
Query:  IPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRVQWEFAHNVLTGAGRSGRD-----------NVVSELSEISDRF
        +  RD +I FGKHKGKMLGTLPS+YLKW+SKNLRA  FE WA LAD+VLED VY+DR +WEFA  +L G+  S R            N VS L EIS+RF
Subjt:  IPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRVQWEFAHNVLTGAGRSGRD-----------NVVSELSEISDRF

Query:  GWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESK---------TAQNVSGGGGGGRRRERRDRLREKRDKSTGGG-EKSERK------TEIENPV
        GWD ++K GW  ++FELLGTSKGGRIPR  +  ++ E +           +       G RR++RR+R+R+   +  G    +SE+K       ++E  +
Subjt:  GWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESK---------TAQNVSGGGGGGRRRERRDRLREKRDKSTGGG-EKSERK------TEIENPV

Query:  -PRFNNPFPGRQALLKRV
         P+  +PFPGR++LLK+V
Subjt:  -PRFNNPFPGRQALLKRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTTGTTTTTGCCGGTTCCACGAGTCTGTGTTTGCCGGGAAATGCTTTAGCGGCGGCATGCGTGCGGAGAAGACGAAGCCAGGGAATTACTATAAGAAGCGAGGC
AGAAGGGAAAAATCCGATTCCAGGTAGAGACCGGGTGATAGGGTTTGGAAAACACAAGGGCAAAATGCTTGGAACCCTTCCTTCAACCTATCTGAAATGGATCTCCAAAA
ACCTTCGAGCAAGAGAGTTCGAAGAGTGGGCGATTTTAGCAGACCAAGTTCTGGAAGACCCGGTTTACCAAGACCGGGTCCAGTGGGAGTTCGCTCACAACGTTTTGACC
GGCGCTGGTCGGAGTGGCCGTGACAACGTCGTTTCTGAGTTATCGGAGATCAGTGACCGGTTTGGCTGGGATTGGGATAATAAATCCGGTTGGAGAGGTGTGGATTTTGA
GCTCTTGGGGACCTCGAAAGGTGGAAGAATTCCACGGCGAATTGAACCAACTCAGAAATCGGAATCGAAGACTGCGCAAAATGTATCAGGTGGCGGCGGCGGCGGAAGGA
GGAGGGAGAGAAGAGATCGGTTGAGAGAGAAGCGAGATAAATCGACGGGAGGCGGCGAGAAAAGTGAAAGGAAAACGGAGATTGAGAATCCCGTTCCCAGGTTTAACAAT
CCCTTCCCTGGCCGTCAAGCTCTTCTAAAACGGGTTGCCACAATTAAATCAGATTTGTTAGTTAAAAAGAAACCCAATTCCTAA
mRNA sequenceShow/hide mRNA sequence
GTTCGTGCGTCTAGTCGAGGTTTAAAAATGGTTGTTGTTTTTGCCGGTTCCACGAGTCTGTGTTTGCCGGGAAATGCTTTAGCGGCGGCATGCGTGCGGAGAAGACGAAG
CCAGGGAATTACTATAAGAAGCGAGGCAGAAGGGAAAAATCCGATTCCAGGTAGAGACCGGGTGATAGGGTTTGGAAAACACAAGGGCAAAATGCTTGGAACCCTTCCTT
CAACCTATCTGAAATGGATCTCCAAAAACCTTCGAGCAAGAGAGTTCGAAGAGTGGGCGATTTTAGCAGACCAAGTTCTGGAAGACCCGGTTTACCAAGACCGGGTCCAG
TGGGAGTTCGCTCACAACGTTTTGACCGGCGCTGGTCGGAGTGGCCGTGACAACGTCGTTTCTGAGTTATCGGAGATCAGTGACCGGTTTGGCTGGGATTGGGATAATAA
ATCCGGTTGGAGAGGTGTGGATTTTGAGCTCTTGGGGACCTCGAAAGGTGGAAGAATTCCACGGCGAATTGAACCAACTCAGAAATCGGAATCGAAGACTGCGCAAAATG
TATCAGGTGGCGGCGGCGGCGGAAGGAGGAGGGAGAGAAGAGATCGGTTGAGAGAGAAGCGAGATAAATCGACGGGAGGCGGCGAGAAAAGTGAAAGGAAAACGGAGATT
GAGAATCCCGTTCCCAGGTTTAACAATCCCTTCCCTGGCCGTCAAGCTCTTCTAAAACGGGTTGCCACAATTAAATCAGATTTGTTAGTTAAAAAGAAACCCAATTCCTA
AGAGAAAGCAACTGTATATAGAGACCATAACCTTGAGAAAATTGATAGCAGAATCTTCGTTCATTACTGAGAATTATGTAGAGTCACTAAAATAATATGTTTATTATTGA
GTGAGCGTTTGGAACAAGACAATATAAAGTCTCAGCTCAAT
Protein sequenceShow/hide protein sequence
MVVVFAGSTSLCLPGNALAAACVRRRRSQGITIRSEAEGKNPIPGRDRVIGFGKHKGKMLGTLPSTYLKWISKNLRAREFEEWAILADQVLEDPVYQDRVQWEFAHNVLT
GAGRSGRDNVVSELSEISDRFGWDWDNKSGWRGVDFELLGTSKGGRIPRRIEPTQKSESKTAQNVSGGGGGGRRRERRDRLREKRDKSTGGGEKSERKTEIENPVPRFNN
PFPGRQALLKRVATIKSDLLVKKKPNS