; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G018590 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G018590
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110414781
Genome locationchr01:20305989..20307703
RNA-Seq ExpressionLsi01G018590
SyntenyLsi01G018590
Gene Ontology termsGO:0010380 - regulation of chlorophyll biosynthetic process (biological process)
GO:0010581 - regulation of starch biosynthetic process (biological process)
GO:0019430 - removal of superoxide radicals (biological process)
GO:0042744 - hydrogen peroxide catabolic process (biological process)
GO:0043085 - positive regulation of catalytic activity (biological process)
GO:0045454 - cell redox homeostasis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0004791 - thioredoxin-disulfide reductase activity (molecular function)
GO:0008047 - enzyme activator activity (molecular function)
GO:0016671 - oxidoreductase activity, acting on a sulfur group of donors, disulfide as acceptor (molecular function)
GO:0042802 - identical protein binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043532.1 uncharacterized protein E6C27_scaffold335G00260 [Cucumis melo var. makuwa]3.4e-9280.51Show/hide
Query:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR
        MA FPSPLERTVASALLLLS S    PSPP TPIS+ EWLFE+N I GKCS E+SAFCDYSN+SSSILT SD SS +P  + LLF T PC H++ LNVVR
Subjt:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR

Query:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV
        KSRSK++RISE +N S TD+VTLSSGS SSET SCLSSSSSVVTSAPIH +VTRAEKKL MIRHAWRK+Q+ASAHMRRRAEAILSYLS GCSSEVKIRQV
Subjt:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV

Query:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV
        IGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYK+
Subjt:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV

KGN53858.2 hypothetical protein Csa_019116 [Cucumis sativus]3.6e-9481.36Show/hide
Query:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR
        MA FPSPLERTVASALLLLS S    PSPP TPIS+ EWLFEE  I GKCS EMSAFC++SNSSSSILT SD SS +P  + LLFSTSPC H++KLNVVR
Subjt:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR

Query:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV
        KSRSK++RISE +N S TD+VTLSSGS SSET SCLSSSSSVVTS PIH +VTRAEKKL MIRHAWRK+Q+ASAHMRRRAEAILSYLS GCSSEVKIR+V
Subjt:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV

Query:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV
        IGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYKV
Subjt:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV

XP_004149405.1 uncharacterized protein LOC101216264 [Cucumis sativus]4.7e-9480.93Show/hide
Query:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR
        MA FPSPLERTVASALLLLS S    PSPP TPIS+ EWLFEE  I GKCS EMSAFC++SNSSSSILT SD SS +P  + LLFSTSPC H++KLNVVR
Subjt:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR

Query:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV
        KSRSK++RISE +N S TD+VTLSSGS SSET SCLSSSSSVVTS PIH +VTRAEKKL MIRHAWRK+Q+ASAHMRRRAEAILSYLS GCSSEVKIR+V
Subjt:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV

Query:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV
        IGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYK+
Subjt:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV

XP_016902598.1 PREDICTED: uncharacterized protein LOC103499533 [Cucumis melo]1.2e-9280.33Show/hide
Query:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR
        MA FPSPLERTVASALLLLS S    PSPP TPIS+ EWLFE+N I GKCS E+SAFCDYSN+SSSILT SD SS +P  + LLF T PC H++ LNVVR
Subjt:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR

Query:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV
        KSRSK++RISE +N S TD+VTLSSGS SSET SCLSSSSSVVTSAPIH +VTRAEKKL MIRHAWRK+Q+ASAHMRRRAEAILSYLS GCSSEVKIRQV
Subjt:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV

Query:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKVTLN
        IGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYKV  N
Subjt:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKVTLN

XP_038882356.1 uncharacterized protein LOC120073620 [Benincasa hispida]9.8e-10084.58Show/hide
Query:  MAHFPSPLERTVASALLLLST----SPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKL
        MA FPSPLERTVASALLLLST     PPPPPSPP TP SQ +WLFE N+IGGKCSTE+S FCDYSNSSSSILT S+ESSE+ A++PLLFST PCL E+KL
Subjt:  MAHFPSPLERTVASALLLLST----SPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKL

Query:  NVVRKSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVK
         VVRKSRSKIIRISEK N S TDDVTLSS S SSET SCLSSSSSVVT APIH +V RAEKKL MIRHAWRK+QVASAHMRRRAEAILSYLSGGCSSEVK
Subjt:  NVVRKSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVK

Query:  IRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV
        IRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYK+
Subjt:  IRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV

TrEMBL top hitse value%identityAlignment
A0A0A0L130 Uncharacterized protein2.3e-9480.93Show/hide
Query:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR
        MA FPSPLERTVASALLLLS S    PSPP TPIS+ EWLFEE  I GKCS EMSAFC++SNSSSSILT SD SS +P  + LLFSTSPC H++KLNVVR
Subjt:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR

Query:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV
        KSRSK++RISE +N S TD+VTLSSGS SSET SCLSSSSSVVTS PIH +VTRAEKKL MIRHAWRK+Q+ASAHMRRRAEAILSYLS GCSSEVKIR+V
Subjt:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV

Query:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV
        IGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYK+
Subjt:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV

A0A1S4E302 uncharacterized protein LOC1034995335.6e-9380.33Show/hide
Query:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR
        MA FPSPLERTVASALLLLS S    PSPP TPIS+ EWLFE+N I GKCS E+SAFCDYSN+SSSILT SD SS +P  + LLF T PC H++ LNVVR
Subjt:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR

Query:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV
        KSRSK++RISE +N S TD+VTLSSGS SSET SCLSSSSSVVTSAPIH +VTRAEKKL MIRHAWRK+Q+ASAHMRRRAEAILSYLS GCSSEVKIRQV
Subjt:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV

Query:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKVTLN
        IGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYKV  N
Subjt:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKVTLN

A0A5A7TJT5 Uncharacterized protein1.6e-9280.51Show/hide
Query:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR
        MA FPSPLERTVASALLLLS S    PSPP TPIS+ EWLFE+N I GKCS E+SAFCDYSN+SSSILT SD SS +P  + LLF T PC H++ LNVVR
Subjt:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVR

Query:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV
        KSRSK++RISE +N S TD+VTLSSGS SSET SCLSSSSSVVTSAPIH +VTRAEKKL MIRHAWRK+Q+ASAHMRRRAEAILSYLS GCSSEVKIRQV
Subjt:  KSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQV

Query:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV
        IGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYMYK+
Subjt:  IGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV

A0A6J1EL58 uncharacterized protein LOC1114355592.6e-9079.08Show/hide
Query:  MAHFPSPLERTVASALLLLSTSPPPPPSP---PLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLN
        MA FP  LERTVASALLLLSTSPPPPPSP   P  PISQ EWLFEE I+GGKCS+EMS FCD S S SS+LT SDESSE+ AQ+ LLFSTS    E+KLN
Subjt:  MAHFPSPLERTVASALLLLSTSPPPPPSP---PLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLN

Query:  VVRKSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKI
        VVRKSRS+ +RIS  +N + TDDVTLSSGS SSET  CLSSSSSV TSAPI  +VTRAEKKL MIRHAWRK+ VASAHMRRRAEAILSYLSGGCSSEVKI
Subjt:  VVRKSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKI

Query:  RQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV
        RQV+GDSPDTSKALRMLLKLEEIKRSGTGGRQDPY+Y +
Subjt:  RQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV

A0A6J1HWL4 uncharacterized protein LOC1114673041.7e-8978.66Show/hide
Query:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLT---PISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLN
        MA FP  LERTVASALLLLSTSPPPP SPP +   PISQ EWLFEE I+GGKCS+EMS FCD S S SS+LT SDESSE+ AQ+ LLFSTS    E+KLN
Subjt:  MAHFPSPLERTVASALLLLSTSPPPPPSPPLT---PISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLN

Query:  VVRKSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKI
        VVRKSRS+ +RIS  +N + TDDVTLSSGS SSET  CLSSSSSV TSAPI  +VTRAEKKL MIRHAWRK+ VASAHMRRRAEAILSYLSGGCSSEVKI
Subjt:  VVRKSRSKIIRISEKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKI

Query:  RQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV
        RQV+GDSPDTSKALRMLLKLEEIKRSGTGGRQDPY+Y +
Subjt:  RQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G57440.1 unknown protein2.6e-2640.96Show/hide
Query:  MAHFPSPLERTVASALLLLSTSP---PPPPSPPLTPISQV-EWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKL
        MA +PS +ERTVAS+LLLLS  P    P  S  +   S V +W  E     G  +  +      S S  S L+       S  ++  +  T      +  
Subjt:  MAHFPSPLERTVASALLLLSTSP---PPPPSPPLTPISQV-EWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKL

Query:  NVVRKSRSKIIRISEKQNFSYTD-------DVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWR--KEQVASAHMRRRAEAILSYL
           RK RS++I  S   NF  T        DV LS+ SV S+  SCLS+ SS V+S     +  R +K    +R   +  KE   S+ +RRRA+ IL +L
Subjt:  NVVRKSRSKIIRISEKQNFSYTD-------DVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWR--KEQVASAHMRRRAEAILSYL

Query:  SGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV
        S   SSEV IRQ++GDSPDTSKALRMLLK+EE+KR GTGGR DP++YK+
Subjt:  SGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYMYKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACATTTCCCTTCCCCACTTGAACGCACTGTCGCTTCTGCTCTGCTCCTCCTCTCCACTTCGCCGCCTCCTCCGCCCTCTCCTCCACTTACACCGATTTCTCAAGT
CGAGTGGCTGTTTGAGGAGAACATTATTGGAGGAAAATGCTCCACAGAGATGTCCGCGTTTTGTGATTATTCGAACTCTTCCTCTTCGATACTCACTGGATCAGATGAAT
CGTCCGAGAGTCCTGCTCAGGATCCGTTGCTGTTTTCTACTTCGCCTTGTCTCCACGAGATAAAGCTTAATGTCGTGAGAAAGAGTCGTTCGAAGATAATACGGATTTCC
GAGAAGCAGAATTTCAGTTATACAGACGACGTTACCTTGTCTTCAGGCTCCGTGTCCTCAGAGACGGCTTCTTGTTTATCAAGCAGCTCAAGCGTGGTCACAAGCGCGCC
GATCCATCACGTGGTTACGAGAGCAGAGAAGAAGTTAGGAATGATTCGTCACGCGTGGAGGAAAGAGCAGGTGGCATCGGCTCATATGCGGCGGCGTGCGGAAGCCATTC
TGAGTTACCTCTCCGGTGGTTGTTCCTCTGAAGTGAAGATACGGCAAGTGATTGGTGACAGCCCTGACACAAGCAAAGCTCTCAGAATGCTGTTGAAACTGGAAGAGATC
AAAAGATCCGGAACAGGTGGGCGTCAAGATCCCTATATGTACAAGGTAACACTCAACTCCCTGCAGCTTCCTCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCACATTTCCCTTCCCCACTTGAACGCACTGTCGCTTCTGCTCTGCTCCTCCTCTCCACTTCGCCGCCTCCTCCGCCCTCTCCTCCACTTACACCGATTTCTCAAGT
CGAGTGGCTGTTTGAGGAGAACATTATTGGAGGAAAATGCTCCACAGAGATGTCCGCGTTTTGTGATTATTCGAACTCTTCCTCTTCGATACTCACTGGATCAGATGAAT
CGTCCGAGAGTCCTGCTCAGGATCCGTTGCTGTTTTCTACTTCGCCTTGTCTCCACGAGATAAAGCTTAATGTCGTGAGAAAGAGTCGTTCGAAGATAATACGGATTTCC
GAGAAGCAGAATTTCAGTTATACAGACGACGTTACCTTGTCTTCAGGCTCCGTGTCCTCAGAGACGGCTTCTTGTTTATCAAGCAGCTCAAGCGTGGTCACAAGCGCGCC
GATCCATCACGTGGTTACGAGAGCAGAGAAGAAGTTAGGAATGATTCGTCACGCGTGGAGGAAAGAGCAGGTGGCATCGGCTCATATGCGGCGGCGTGCGGAAGCCATTC
TGAGTTACCTCTCCGGTGGTTGTTCCTCTGAAGTGAAGATACGGCAAGTGATTGGTGACAGCCCTGACACAAGCAAAGCTCTCAGAATGCTGTTGAAACTGGAAGAGATC
AAAAGATCCGGAACAGGTGGGCGTCAAGATCCCTATATGTACAAGGTAACACTCAACTCCCTGCAGCTTCCTCTCTAA
Protein sequenceShow/hide protein sequence
MAHFPSPLERTVASALLLLSTSPPPPPSPPLTPISQVEWLFEENIIGGKCSTEMSAFCDYSNSSSSILTGSDESSESPAQDPLLFSTSPCLHEIKLNVVRKSRSKIIRIS
EKQNFSYTDDVTLSSGSVSSETASCLSSSSSVVTSAPIHHVVTRAEKKLGMIRHAWRKEQVASAHMRRRAEAILSYLSGGCSSEVKIRQVIGDSPDTSKALRMLLKLEEI
KRSGTGGRQDPYMYKVTLNSLQLPL