; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G03430 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G03430
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUBA domain-containing protein
Genome locationChr5:3657449..3660203
RNA-Seq ExpressionCSPI05G03430
SyntenyCSPI05G03430
Gene Ontology termsGO:0043162 - ubiquitin-dependent protein catabolic process via the multivesicular body sorting pathway (biological process)
GO:0000813 - ESCRT I complex (cellular component)
GO:0043130 - ubiquitin binding (molecular function)
InterPro domainsIPR009060 - UBA-like superfamily
IPR038870 - Ubiquitin-associated protein 1
IPR042575 - Ubiquitin-associated protein 1, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145917.1 uncharacterized protein LOC101219735 [Cucumis sativus]2.2e-122100Show/hide
Query:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
        MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
Subjt:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD

Query:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
        IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
Subjt:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK

Query:  VVEALLMYDNDTDKAVAHFLGGTS
        VVEALLMYDNDTDKAVAHFLGGTS
Subjt:  VVEALLMYDNDTDKAVAHFLGGTS

XP_008437562.1 PREDICTED: uncharacterized protein LOC103482940 isoform X1 [Cucumis melo]6.4e-11495.54Show/hide
Query:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
        MAYDFRNNSGHYDSH PMY STASS+PSPSPHPMYS SMYPRIGQQAPSSTPPVARLSSHH+SSSPS SPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
Subjt:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD

Query:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
        IPRSNF FDFEFEKKVLAEAEKEAPNWNRFGLE  PPKPVESTSSMGSIGDP VSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
Subjt:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK

Query:  VVEALLMYDNDTDKAVAHFLGGTS
        VVEALLMYDNDTDKAVAHFLGGTS
Subjt:  VVEALLMYDNDTDKAVAHFLGGTS

XP_008437563.1 PREDICTED: uncharacterized protein LOC103482940 isoform X2 [Cucumis melo]4.6e-11295.48Show/hide
Query:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
        MAYDFRNNSGHYDSH PMY STASS+PSPSPHPMYS SMYPRIGQQAPSSTPPVARLSSHH+SSSPS SPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
Subjt:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD

Query:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
        IPRSNF FDFEFEKKVLAEAEKEAPNWNRFGLE  PPKPVESTSSMGSIGDP VSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
Subjt:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK

Query:  VVEALLMYDNDTDKAVAHFLG
        VVEALLMYDNDTDKAVAHFLG
Subjt:  VVEALLMYDNDTDKAVAHFLG

XP_008437564.1 PREDICTED: uncharacterized protein LOC103482940 isoform X3 [Cucumis melo]3.0e-11194.64Show/hide
Query:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
        MAYDFRNNSGHYDSH PMY STASS+PSPSPHPMYS SMYPRIGQQAPSSTPPVARLSSHH+SSSPS SPSSS  LGIRVTIKPEYRITPPPQLSPQVGD
Subjt:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD

Query:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
        IPRSNF FDFEFEKKVLAEAEKEAPNWNRFGLE  PPKPVESTSSMGSIGDP VSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
Subjt:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK

Query:  VVEALLMYDNDTDKAVAHFLGGTS
        VVEALLMYDNDTDKAVAHFLGGTS
Subjt:  VVEALLMYDNDTDKAVAHFLGGTS

XP_038874870.1 uncharacterized protein LOC120067367 [Benincasa hispida]1.4e-10890.75Show/hide
Query:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSS--SPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQV
        MAYD+RN  GHYD+H PMY  TASSSPSPSPHPMYS SMYPRIGQQAPS+T PVARLSSHH+SS  SPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQ+
Subjt:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSS--SPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQV

Query:  GDIPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVES-TSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFS
        GDIPRSNFQFDFEFEKKVLAEAEKE PNWNRFGLEH P KPVES TSSM SIGDPVVSKYVASGL+REAVSFAVANYGDNPTKVQEFVKGYTLLREMGFS
Subjt:  GDIPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVES-TSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFS

Query:  SIKVVEALLMYDNDTDKAVAHFLGGTS
        S+KVVEALLMYDNDTDKAVAHFLGGTS
Subjt:  SIKVVEALLMYDNDTDKAVAHFLGGTS

TrEMBL top hitse value%identityAlignment
A0A0A0KNB8 Uncharacterized protein1.1e-122100Show/hide
Query:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
        MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
Subjt:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD

Query:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
        IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
Subjt:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK

Query:  VVEALLMYDNDTDKAVAHFLGGTS
        VVEALLMYDNDTDKAVAHFLGGTS
Subjt:  VVEALLMYDNDTDKAVAHFLGGTS

A0A1S3AUC1 uncharacterized protein LOC103482940 isoform X22.2e-11295.48Show/hide
Query:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
        MAYDFRNNSGHYDSH PMY STASS+PSPSPHPMYS SMYPRIGQQAPSSTPPVARLSSHH+SSSPS SPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
Subjt:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD

Query:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
        IPRSNF FDFEFEKKVLAEAEKEAPNWNRFGLE  PPKPVESTSSMGSIGDP VSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
Subjt:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK

Query:  VVEALLMYDNDTDKAVAHFLG
        VVEALLMYDNDTDKAVAHFLG
Subjt:  VVEALLMYDNDTDKAVAHFLG

A0A1S3AUG0 uncharacterized protein LOC103482940 isoform X13.1e-11495.54Show/hide
Query:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
        MAYDFRNNSGHYDSH PMY STASS+PSPSPHPMYS SMYPRIGQQAPSSTPPVARLSSHH+SSSPS SPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
Subjt:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD

Query:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
        IPRSNF FDFEFEKKVLAEAEKEAPNWNRFGLE  PPKPVESTSSMGSIGDP VSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
Subjt:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK

Query:  VVEALLMYDNDTDKAVAHFLGGTS
        VVEALLMYDNDTDKAVAHFLGGTS
Subjt:  VVEALLMYDNDTDKAVAHFLGGTS

A0A1S3AUY5 uncharacterized protein LOC103482940 isoform X31.4e-11194.64Show/hide
Query:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD
        MAYDFRNNSGHYDSH PMY STASS+PSPSPHPMYS SMYPRIGQQAPSSTPPVARLSSHH+SSSPS SPSSS  LGIRVTIKPEYRITPPPQLSPQVGD
Subjt:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGD

Query:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
        IPRSNF FDFEFEKKVLAEAEKEAPNWNRFGLE  PPKPVESTSSMGSIGDP VSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK
Subjt:  IPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIK

Query:  VVEALLMYDNDTDKAVAHFLGGTS
        VVEALLMYDNDTDKAVAHFLGGTS
Subjt:  VVEALLMYDNDTDKAVAHFLGGTS

A0A6J1H2H7 uncharacterized protein LOC1114594992.7e-10285.84Show/hide
Query:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPS--SSSGLGIRVTIKPEYRITPPPQLSPQV
        MAYD+RN  G+Y++H PMY   ASSSPSPS HPMY+ SMYPRIGQQ  S  PPVAR+SSHH+SSS +PSPS  SSSGLGIRVTIKPEYRITPPPQLSPQV
Subjt:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPS--SSSGLGIRVTIKPEYRITPPPQLSPQV

Query:  GDIPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSS
        GDIPRSNFQFDFEFEKKVLAEAEKE PNWNRFGLEHPP +PVES+SSMGS GDPVVSKYVASGL+REAVS AVANYGDNPTKVQEFVKGYTLLREMGF S
Subjt:  GDIPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSS

Query:  IKVVEALLMYDNDTDKAVAHFLGGTS
         KVVEALLMYDNDTDKAVAHFLGGTS
Subjt:  IKVVEALLMYDNDTDKAVAHFLGGTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53330.1 Ubiquitin-associated/translation elongation factor EF1B protein2.7e-6259.31Show/hide
Query:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSP-HPMYSHSMYPRIGQQ---APSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSP
        M YD+RN SG     +PMY    S+SPSPS  HPMY    YP+IGQQ    P    P  R SS  +++SP      SSG+GIRV +KPEYRITPPPQL P
Subjt:  MAYDFRNNSGHYDSHQPMYTSTASSSPSPSP-HPMYSHSMYPRIGQQ---APSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSP

Query:  QVGDIPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVE-STSSMGSIG--DPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLRE
        +VGDI RS+FQFDF  E+KVLAEAEK+ P+W++FG E+PP K  E S SS+G +   D VV KY ASGLNREAV+ AVANYGDNPTKVQEF  G+T +RE
Subjt:  QVGDIPRSNFQFDFEFEKKVLAEAEKEAPNWNRFGLEHPPPKPVE-STSSMGSIG--DPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLRE

Query:  MGFSSIKVVEALLMYDNDTDKAVAHFLGGTS
        MGF +  V +AL M++NDTDKA+AH L G+S
Subjt:  MGFSSIKVVEALLMYDNDTDKAVAHFLGGTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTACGATTTCAGAAACAACTCCGGCCACTACGATTCTCACCAGCCGATGTACACCTCAACCGCTTCTTCTTCCCCATCTCCTTCTCCCCATCCCATGTATTCACA
TTCAATGTACCCCAGAATCGGTCAACAAGCTCCTTCATCAACCCCTCCGGTGGCCCGTCTCTCATCCCATCATTATTCTTCTTCTCCATCCCCATCACCTTCTTCTTCGT
CAGGATTGGGCATCAGGGTTACTATTAAACCGGAATATCGAATTACTCCTCCGCCTCAATTATCTCCACAAGTTGGAGATATTCCTCGGAGCAATTTCCAATTTGATTTT
GAGTTTGAGAAAAAGGTTTTAGCTGAAGCAGAGAAAGAAGCTCCAAATTGGAATCGGTTTGGGTTGGAACACCCTCCTCCTAAACCAGTGGAGTCCACATCTTCAATGGG
TTCAATTGGAGATCCAGTTGTGAGCAAGTATGTTGCATCTGGCTTAAATCGGGAAGCTGTTTCATTTGCAGTTGCTAACTATGGAGACAATCCAACCAAGGTTCAAGAAT
TTGTAAAAGGCTACACACTCCTACGAGAAATGGGATTTTCTTCTATCAAGGTGGTTGAGGCATTACTCATGTATGACAATGACACCGATAAGGCTGTAGCTCATTTTCTT
GGTGGTACGTCTTAA
mRNA sequenceShow/hide mRNA sequence
AACAAATCTTAATTGGAGAAATTTGTTGAAAATGAAATCCAAAATTTGAAAATATAAAATGGGAATGATTAGATTAATATTTTGCGTTAAAGTTTGAAACATATTTGGTT
TGACAATCGTCCACCGTCTTTCCCTTGAGCCTTTTCCTCTGTTCTCCAATGGCGTACGATTTCAGAAACAACTCCGGCCACTACGATTCTCACCAGCCGATGTACACCTC
AACCGCTTCTTCTTCCCCATCTCCTTCTCCCCATCCCATGTATTCACATTCAATGTACCCCAGAATCGGTCAACAAGCTCCTTCATCAACCCCTCCGGTGGCCCGTCTCT
CATCCCATCATTATTCTTCTTCTCCATCCCCATCACCTTCTTCTTCGTCAGGATTGGGCATCAGGGTTACTATTAAACCGGAATATCGAATTACTCCTCCGCCTCAATTA
TCTCCACAAGTTGGAGATATTCCTCGGAGCAATTTCCAATTTGATTTTGAGTTTGAGAAAAAGGTTTTAGCTGAAGCAGAGAAAGAAGCTCCAAATTGGAATCGGTTTGG
GTTGGAACACCCTCCTCCTAAACCAGTGGAGTCCACATCTTCAATGGGTTCAATTGGAGATCCAGTTGTGAGCAAGTATGTTGCATCTGGCTTAAATCGGGAAGCTGTTT
CATTTGCAGTTGCTAACTATGGAGACAATCCAACCAAGGTTCAAGAATTTGTAAAAGGCTACACACTCCTACGAGAAATGGGATTTTCTTCTATCAAGGTGGTTGAGGCA
TTACTCATGTATGACAATGACACCGATAAGGCTGTAGCTCATTTTCTTGGTGGTACGTCTTAATTGAAAATTATGAAAATGATTGTGTCAAGAGTTCTGTTGTACTTCAA
CTATGACATCTTTGTATATGTAATATGACTATATAGAGTCCTCTTGGGATTGTAAATGTTATAAATAATATAGGCATAGGCATTTGTTGTGTATTGTAGTCGTTTACCTA
TTGGTGTTTGGCCAACACTTGCAGATCCTCTTAATGACAGCTGATGATGATTACATCTTATATTTTTCATGTCTTTGGATTGAATTCTGCTGCAGAATTAACTGTAGGGT
TGATTGATTGC
Protein sequenceShow/hide protein sequence
MAYDFRNNSGHYDSHQPMYTSTASSSPSPSPHPMYSHSMYPRIGQQAPSSTPPVARLSSHHYSSSPSPSPSSSSGLGIRVTIKPEYRITPPPQLSPQVGDIPRSNFQFDF
EFEKKVLAEAEKEAPNWNRFGLEHPPPKPVESTSSMGSIGDPVVSKYVASGLNREAVSFAVANYGDNPTKVQEFVKGYTLLREMGFSSIKVVEALLMYDNDTDKAVAHFL
GGTS