; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G83 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G83
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionUbiquitin-like domain-containing protein
Genome locationctg1:1305935..1312384
RNA-Seq ExpressionCucsat.G83
SyntenyCucsat.G83
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000626 - Ubiquitin-like domain
IPR019387 - Uncharacterised domain SAYSvFN
IPR029071 - Ubiquitin-like domain superfamily
IPR039159 - SAYSvFN domain-containing protein 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053058.1 ubiquitin family protein [Cucumis melo var. makuwa]8.11e-15188.54Show/hide
Query:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKV----------------CDLRKLVAESNRLPIGNLKLILRGKILDD
        MAEIVEDSNFNR        +ISNR PPQDTVEVIVRTIGPTRPSRLLTPSTIKV                CDLRKLVAES+RLPIGNLKLILRGKILDD
Subjt:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKV----------------CDLRKLVAESNRLPIGNLKLILRGKILDD

Query:  CKNEDDVYVRLNHGDSLTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDL
        CKNEDDVYVRLNHGDSLTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILL+VVFSLSLKGWAAIL+WFIMAPVAHSWDL
Subjt:  CKNEDDVYVRLNHGDSLTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDL

Query:  GPLYILGTGFCIILLNLGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRL
        GPLYILGTGFCIILLNLGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRL
Subjt:  GPLYILGTGFCIILLNLGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRL

XP_004148728.1 uncharacterized protein LOC101223066 [Cucumis sativus]1.27e-169100Show/hide
Query:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS
        MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS
Subjt:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS

Query:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN
        LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN
Subjt:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN

Query:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
        LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
Subjt:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF

XP_016900685.1 PREDICTED: uncharacterized protein LOC103490803 [Cucumis melo]3.67e-15894.58Show/hide
Query:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS
        MAEIVEDSNFNR        +ISNR PPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAES+RLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS
Subjt:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS

Query:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN
        LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILL+VVFSLSLKGWAAIL+WFIMAPVAHSWDLGPLYILGTGFCIILLN
Subjt:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN

Query:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
        LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
Subjt:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF

XP_022948058.1 uncharacterized protein LOC111451749 [Cucurbita moschata]3.39e-14987.08Show/hide
Query:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS
        MAEIVE  NFNR+I+GG +SSI+   PP++T+E+ +RTIGP RPSRLL PS IKVCDLRKLVAE ++LPIGNLKLILRGKILDDCKN+DDV+VRLNHGDS
Subjt:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS

Query:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN
        LTVAVKPKPPAEHLRD+ D+DEDDLKFRLPESSSRLKKK+Y FLREKLKFPDILLMV+FSLSLK WAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN
Subjt:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN

Query:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
        LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
Subjt:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF

XP_038905341.1 uncharacterized protein LOC120091406 isoform X1 [Benincasa hispida]8.88e-15290.83Show/hide
Query:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS
        MAEIVE+SNFNR+I GGA+SSI N   PQDTVE+IVRTIGP RPSRLL PS IKV DLRKL+AES+RLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS
Subjt:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS

Query:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN
        LTVAVKPK PAEHLRDEFDEDEDDLKFRLPESS+ LKKKVYTFLREKLKFPDILLMV+FSLSLK WAAILIWFIMAPVAHSWDLGP+YILGTGFCIILLN
Subjt:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN

Query:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
        LGHR+SGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
Subjt:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF

TrEMBL top hitse value%identityAlignment
A0A0A0L1Y1 Ubiquitin-like domain-containing protein6.15e-170100Show/hide
Query:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS
        MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS
Subjt:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS

Query:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN
        LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN
Subjt:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN

Query:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
        LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
Subjt:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF

A0A1S4DY93 uncharacterized protein LOC1034908031.78e-15894.58Show/hide
Query:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS
        MAEIVEDSNFNR        +ISNR PPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAES+RLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS
Subjt:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS

Query:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN
        LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILL+VVFSLSLKGWAAIL+WFIMAPVAHSWDLGPLYILGTGFCIILLN
Subjt:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN

Query:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
        LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
Subjt:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF

A0A5D3CHX8 Ubiquitin family protein3.93e-15188.54Show/hide
Query:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKV----------------CDLRKLVAESNRLPIGNLKLILRGKILDD
        MAEIVEDSNFNR        +ISNR PPQDTVEVIVRTIGPTRPSRLLTPSTIKV                CDLRKLVAES+RLPIGNLKLILRGKILDD
Subjt:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKV----------------CDLRKLVAESNRLPIGNLKLILRGKILDD

Query:  CKNEDDVYVRLNHGDSLTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDL
        CKNEDDVYVRLNHGDSLTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILL+VVFSLSLKGWAAIL+WFIMAPVAHSWDL
Subjt:  CKNEDDVYVRLNHGDSLTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDL

Query:  GPLYILGTGFCIILLNLGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRL
        GPLYILGTGFCIILLNLGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRL
Subjt:  GPLYILGTGFCIILLNLGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRL

A0A6J1CW45 uncharacterized protein LOC1110148341.11e-14787.08Show/hide
Query:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS
        MAEIVE+ NFNR+I GG +SS SN  PP++TV++IVRTIGP RPSRL  PS +KV DLRKLVAESN+LPIGNLKLILRGKILDD KN+DDVYVRLN G+S
Subjt:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS

Query:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN
        LTVAVKPKPPAEHLRD FD+DEDDLKFRLPESSSRLKKK+Y FLREKLKFPDILLMV+FSLSLK WAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN
Subjt:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN

Query:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
        LGHR+SGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
Subjt:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF

A0A6J1G8B5 uncharacterized protein LOC1114517491.64e-14987.08Show/hide
Query:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS
        MAEIVE  NFNR+I+GG +SSI+   PP++T+E+ +RTIGP RPSRLL PS IKVCDLRKLVAE ++LPIGNLKLILRGKILDDCKN+DDV+VRLNHGDS
Subjt:  MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDS

Query:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN
        LTVAVKPKPPAEHLRD+ D+DEDDLKFRLPESSSRLKKK+Y FLREKLKFPDILLMV+FSLSLK WAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN
Subjt:  LTVAVKPKPPAEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLN

Query:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
        LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF
Subjt:  LGHRRSGEMSAYSIFNEGFRELPGTLNADRLDRDVRLGQF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G35360.1 ubiquitin family protein7.2e-6958.72Show/hide
Query:  ISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDSLTVAVKPKPPAEHLRDEFDED
        +S+ +    TVE+  +TIGP RPS++   S IK+ DLR  +AE  + P+  L++ILRGK L D ++ DD+YV L   DS  VAV P PPA  +    D+D
Subjt:  ISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDSLTVAVKPKPPAEHLRDEFDED

Query:  EDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLNLGHRRSGEMSAYSIFNEGFRE
        +DDLKF+LP S+SR K+K+Y FLR KLK PDI+LM +FSLSLK W  I +WFI+AP+AH WDLGP++ILGTGF IILLNLG R+ G++SAYSIFNE FRE
Subjt:  EDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLNLGHRRSGEMSAYSIFNEGFRE

Query:  LPGTLNADRLDRDVRLGQ
        LPGT NADR+DRD+R GQ
Subjt:  LPGTLNADRLDRDVRLGQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGATCGTTGAGGATTCCAATTTCAATCGGAGCATCACCGGAGGAGCCGATTCGAGCATCAGTAATCGCACTCCTCCACAAGACACAGTCGAAGTCATTGTTCG
AACCATCGGGCCAACACGCCCCTCTCGCCTTCTCACCCCTTCCACGATCAAGGTGTGCGATTTGCGGAAGCTGGTTGCAGAGAGTAATCGATTGCCGATTGGGAATTTGA
AGCTTATTTTACGGGGAAAAATTTTGGATGACTGTAAAAATGAAGATGATGTATATGTGCGGCTCAATCATGGCGATTCATTGACTGTTGCTGTGAAGCCAAAGCCTCCA
GCAGAACATCTTCGTGATGAATTTGATGAAGATGAGGATGACCTGAAGTTTAGGCTTCCAGAGTCTTCAAGTCGTTTGAAGAAAAAAGTCTACACTTTTCTACGTGAGAA
GTTGAAGTTTCCTGATATCCTTTTGATGGTGGTTTTCTCTCTCAGTCTGAAGGGTTGGGCTGCTATTCTCATCTGGTTTATCATGGCACCTGTTGCCCATAGCTGGGACC
TTGGACCTTTATATATACTTGGAACTGGTTTTTGCATCATTCTACTAAATCTTGGACATCGGCGATCTGGGGAAATGAGTGCATATTCCATCTTCAATGAAGGTTTCAGA
GAGCTTCCAGGGACACTGAATGCCGACCGTCTCGATAGAGATGTTCGGCTGGGTCAGTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAGATCGTTGAGGATTCCAATTTCAATCGGAGCATCACCGGAGGAGCCGATTCGAGCATCAGTAATCGCACTCCTCCACAAGACACAGTCGAAGTCATTGTTCG
AACCATCGGGCCAACACGCCCCTCTCGCCTTCTCACCCCTTCCACGATCAAGGTGTGCGATTTGCGGAAGCTGGTTGCAGAGAGTAATCGATTGCCGATTGGGAATTTGA
AGCTTATTTTACGGGGAAAAATTTTGGATGACTGTAAAAATGAAGATGATGTATATGTGCGGCTCAATCATGGCGATTCATTGACTGTTGCTGTGAAGCCAAAGCCTCCA
GCAGAACATCTTCGTGATGAATTTGATGAAGATGAGGATGACCTGAAGTTTAGGCTTCCAGAGTCTTCAAGTCGTTTGAAGAAAAAAGTCTACACTTTTCTACGTGAGAA
GTTGAAGTTTCCTGATATCCTTTTGATGGTGGTTTTCTCTCTCAGTCTGAAGGGTTGGGCTGCTATTCTCATCTGGTTTATCATGGCACCTGTTGCCCATAGCTGGGACC
TTGGACCTTTATATATACTTGGAACTGGTTTTTGCATCATTCTACTAAATCTTGGACATCGGCGATCTGGGGAAATGAGTGCATATTCCATCTTCAATGAAGGTTTCAGA
GAGCTTCCAGGGACACTGAATGCCGACCGTCTCGATAGAGATGTTCGGCTGGGTCAGTTCTGA
Protein sequenceShow/hide protein sequence
MAEIVEDSNFNRSITGGADSSISNRTPPQDTVEVIVRTIGPTRPSRLLTPSTIKVCDLRKLVAESNRLPIGNLKLILRGKILDDCKNEDDVYVRLNHGDSLTVAVKPKPP
AEHLRDEFDEDEDDLKFRLPESSSRLKKKVYTFLREKLKFPDILLMVVFSLSLKGWAAILIWFIMAPVAHSWDLGPLYILGTGFCIILLNLGHRRSGEMSAYSIFNEGFR
ELPGTLNADRLDRDVRLGQF