; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh08G004810 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh08G004810
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionUnknown protein
Genome locationCma_Chr08:2728044..2728754
RNA-Seq ExpressionCmaCh08G004810
SyntenyCmaCh08G004810
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593251.1 WRKY transcription factor 23, partial [Cucurbita argyrosperma subsp. sororia]6.4e-9284.07Show/hide
Query:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGR----------------
        M LREGDLTFDLESGAKIVKEEAGNVE SSIKREVKNI WGRLTEDTLPRDERAAASSSYVANII DANIKLLI+KNLKGEEGR                
Subjt:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGR----------------

Query:  ---QKKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSS
           +KKAPKPPRPPKGPSLD ADQMLVKE AELAMKKRSRIER KA+KKAKAEKT SCNTYIPAL+IT LFFLVVIIQ ISSRSSSLFQGSP+PAVGGSS
Subjt:  ---QKKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSS

Query:  GFVLVQYIMNFPRNESYIPNSTTSVK
        GF+ VQYIMNFPRNESYIPNSTTSVK
Subjt:  GFVLVQYIMNFPRNESYIPNSTTSVK

XP_022959630.1 uncharacterized protein LOC111460650 isoform X1 [Cucurbita moschata]4.7e-9584.89Show/hide
Query:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGR----------------
        M LREGDLTFDLESGAKIVKE++GNVEPSSIKR VKNIFWGRLT+DTLPRDERAAASSSYVANIIRDANIKLLI+KNLKGEEGR                
Subjt:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGR----------------

Query:  --QKKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSG
          +KKAPKPPRPPKGPSLDAADQMLVKE A+LAMKKRSRIERMKA+KKAKAEKTSSCNTYIPAL+IT LFFLVVIIQ ISSRSSSLFQGSP+PAVGGSSG
Subjt:  --QKKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSG

Query:  FVLVQYIMNFPRNESYIPNSTTSVK
        F+ VQYIMNFPRNESYIPNSTTSVK
Subjt:  FVLVQYIMNFPRNESYIPNSTTSVK

XP_022959631.1 uncharacterized protein LOC111460650 isoform X2 [Cucurbita moschata]7.0e-9992.75Show/hide
Query:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKAPKPPRPPKGPSL
        M LREGDLTFDLESGAKIVKE++GNVEPSSIKR VKNIFWGRLT+DTLPRDERAAASSSYVANIIRDANIKLLI+KNLKGEEGRQKKAPKPPRPPKGPSL
Subjt:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKAPKPPRPPKGPSL

Query:  DAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSGFVLVQYIMNFPRNESYIP
        DAADQMLVKE A+LAMKKRSRIERMKA+KKAKAEKTSSCNTYIPAL+IT LFFLVVIIQ ISSRSSSLFQGSP+PAVGGSSGF+ VQYIMNFPRNESYIP
Subjt:  DAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSGFVLVQYIMNFPRNESYIP

Query:  NSTTSVK
        NSTTSVK
Subjt:  NSTTSVK

XP_023004656.1 uncharacterized protein LOC111497885 [Cucurbita maxima]2.0e-106100Show/hide
Query:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKAPKPPRPPKGPSL
        MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKAPKPPRPPKGPSL
Subjt:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKAPKPPRPPKGPSL

Query:  DAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSGFVLVQYIMNFPRNESYIP
        DAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSGFVLVQYIMNFPRNESYIP
Subjt:  DAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSGFVLVQYIMNFPRNESYIP

Query:  NSTTSVK
        NSTTSVK
Subjt:  NSTTSVK

XP_023513698.1 uncharacterized protein LOC111778230 isoform X2 [Cucurbita pepo subsp. pepo]2.8e-9585.78Show/hide
Query:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGR----------------
        M LREGDLT DLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDA+IKLLI+KNLKGEEGR                
Subjt:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGR----------------

Query:  --QKKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSG
          +KKAPKPPRPPKGPSLDAADQMLVKE AELAMKKRSRIERMK LKKAKAEKTSSCNTYIPAL+IT LFFLVVI+Q ISSRSSSLFQGSP+PAVGGSSG
Subjt:  --QKKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSG

Query:  FVLVQYIMNFPRNESYIPNSTTSVK
        F+ VQYIMNFPRNESYIPNSTTSVK
Subjt:  FVLVQYIMNFPRNESYIPNSTTSVK

TrEMBL top hitse value%identityAlignment
A0A1S4DYQ5 uncharacterized protein LOC1034927411.8e-6366.22Show/hide
Query:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSS----YVANIIRDANIKLLINKNLKGEEGRQ-----------
        M LRE DLTFDLE+G KIV EE G+ EPSS KR+VKNI W RLTED+L +DERA AS+S     VA+II D ++ LLINKNL+GE+  +           
Subjt:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSS----YVANIIRDANIKLLINKNLKGEEGRQ-----------

Query:  ------KKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGG
              KKA KPPRPPKGPSLDAAD+M+VKE A LAMKKR+R ERMKALKKAKAEKTSS N+ IPAL+ITFLFFLV+IIQGIS RSSS+ QGSP+PAVGG
Subjt:  ------KKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGG

Query:  SSGFVLVQYIMNFPRNESYIPN
        SSGF+ VQYI +FP +ES + N
Subjt:  SSGFVLVQYIMNFPRNESYIPN

A0A5D3D3X4 Putative transmembrane protein6.7e-6365.77Show/hide
Query:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSS----YVANIIRDANIKLLINKNLKGEEGRQ-----------
        M LRE DLTFDLE+G KIV EE G+ EPSS KR+VKNI W RLTED+L +DERA AS+S     VA+II D ++ LLI+KNL+GE+  +           
Subjt:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSS----YVANIIRDANIKLLINKNLKGEEGRQ-----------

Query:  ------KKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGG
              KKA KPPRPPKGPSLDAAD+M+VKE A LAMKKR+R ERMKALKKAKAEKTSS N+ IPAL+ITFLFFLV+IIQGIS RSSS+ QGSP+PAVGG
Subjt:  ------KKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGG

Query:  SSGFVLVQYIMNFPRNESYIPN
        SSGF+ VQYI +FP +ES + N
Subjt:  SSGFVLVQYIMNFPRNESYIPN

A0A6J1H5E1 uncharacterized protein LOC111460650 isoform X23.4e-9992.75Show/hide
Query:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKAPKPPRPPKGPSL
        M LREGDLTFDLESGAKIVKE++GNVEPSSIKR VKNIFWGRLT+DTLPRDERAAASSSYVANIIRDANIKLLI+KNLKGEEGRQKKAPKPPRPPKGPSL
Subjt:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKAPKPPRPPKGPSL

Query:  DAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSGFVLVQYIMNFPRNESYIP
        DAADQMLVKE A+LAMKKRSRIERMKA+KKAKAEKTSSCNTYIPAL+IT LFFLVVIIQ ISSRSSSLFQGSP+PAVGGSSGF+ VQYIMNFPRNESYIP
Subjt:  DAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSGFVLVQYIMNFPRNESYIP

Query:  NSTTSVK
        NSTTSVK
Subjt:  NSTTSVK

A0A6J1H6U5 uncharacterized protein LOC111460650 isoform X12.3e-9584.89Show/hide
Query:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGR----------------
        M LREGDLTFDLESGAKIVKE++GNVEPSSIKR VKNIFWGRLT+DTLPRDERAAASSSYVANIIRDANIKLLI+KNLKGEEGR                
Subjt:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGR----------------

Query:  --QKKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSG
          +KKAPKPPRPPKGPSLDAADQMLVKE A+LAMKKRSRIERMKA+KKAKAEKTSSCNTYIPAL+IT LFFLVVIIQ ISSRSSSLFQGSP+PAVGGSSG
Subjt:  --QKKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSG

Query:  FVLVQYIMNFPRNESYIPNSTTSVK
        F+ VQYIMNFPRNESYIPNSTTSVK
Subjt:  FVLVQYIMNFPRNESYIPNSTTSVK

A0A6J1KSQ4 uncharacterized protein LOC1114978859.9e-107100Show/hide
Query:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKAPKPPRPPKGPSL
        MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKAPKPPRPPKGPSL
Subjt:  MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKAPKPPRPPKGPSL

Query:  DAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSGFVLVQYIMNFPRNESYIP
        DAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSGFVLVQYIMNFPRNESYIP
Subjt:  DAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSGFVLVQYIMNFPRNESYIP

Query:  NSTTSVK
        NSTTSVK
Subjt:  NSTTSVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02380.1 unknown protein3.7e-2136.15Show/hide
Query:  EGDLTFDLESGAKIVKEE--AGNVEPSSIKREVKNIFWGRLTEDTLP----RDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKAPKPPRPPKGP
        E DL  D+E+G   V +E  +  V  + +  E  N        D L     RDE    +SS         ++ L   K   G+  + +KA KPPRPPKGP
Subjt:  EGDLTFDLESGAKIVKEE--AGNVEPSSIKREVKNIFWGRLTEDTLP----RDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKAPKPPRPPKGP

Query:  SLDAADQMLVKEFAELAMKKRSRIERM-KALKKAKAEKTSSCNTYIP--ALVITFLFFLVVIIQGISSRSSSL-FQGSPKPAVGGSSGFVLVQYIMNFPR
        SL   D+ ++++  ELAM+KR+RIERM K+LK+ KA KTS  +  I   +++IT +FF  ++ QG S+ SSS+    SP P V  ++  + VQ+  +F  
Subjt:  SLDAADQMLVKEFAELAMKKRSRIERM-KALKKAKAEKTSSCNTYIP--ALVITFLFFLVVIIQGISSRSSSL-FQGSPKPAVGGSSGFVLVQYIMNFPR

Query:  NESYIPNSTTSVK
         E   P+ TTS++
Subjt:  NESYIPNSTTSVK

AT3G17120.1 unknown protein3.1e-2050.43Show/hide
Query:  EEGRQKKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNT---YIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAV
        +E R+K A KPPRPP+GPSLDAADQ L++E AELAM KR+RIERM+ALKK++A K +S  +    + A + T +FF V++ QG+S R++     S     
Subjt:  EEGRQKKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNT---YIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAV

Query:  G-GSSGFVLVQYIMN
        G  + GFV VQY  N
Subjt:  G-GSSGFVLVQYIMN

AT3G17120.2 unknown protein3.1e-2050.43Show/hide
Query:  EEGRQKKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNT---YIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAV
        +E R+K A KPPRPP+GPSLDAADQ L++E AELAM KR+RIERM+ALKK++A K +S  +    + A + T +FF V++ QG+S R++     S     
Subjt:  EEGRQKKAPKPPRPPKGPSLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNT---YIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAV

Query:  G-GSSGFVLVQYIMN
        G  + GFV VQY  N
Subjt:  G-GSSGFVLVQYIMN

AT4G01960.1 unknown protein6.3e-2135.71Show/hide
Query:  EGDLTFDLESGAKIVKEE--AGNVEPSSIKREVKNIFW-GRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKA---PKPPRPPKGP
        E DL  D+E G     +E     V        V N  W GRL+ D   +               R+ + + L   ++K +  + KK     KPPRPPKGP
Subjt:  EGDLTFDLESGAKIVKEE--AGNVEPSSIKREVKNIFW-GRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKA---PKPPRPPKGP

Query:  SLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSL-FQGSPKPAVGGSSGFVLVQYIMNFPRNES
         L A DQ L++E  ELAM+KR+RIERMK L++ KA K+SS  + I A+++T +FF+ +I QG  + ++SL    SP P    ++  V VQ+   F   E 
Subjt:  SLDAADQMLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSL-FQGSPKPAVGGSSGFVLVQYIMNFPRNES

Query:  YIPNSTTSVK
          P+ TTS +
Subjt:  YIPNSTTSVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTAAGAGAAGGGGATCTCACATTTGATCTGGAAAGCGGGGCGAAGATTGTTAAAGAGGAGGCTGGAAATGTGGAGCCAAGTTCAATCAAACGAGAGGTG
AAGAACATTTTTTGGGGTAGGTTGACAGAGGATACACTGCCACGAGACGAACGAGCTGCAGCCTCGAGCAGTTACGTTGCCAATATCATCCGTGATGCGAACATA
AAATTGTTGATAAATAAGAATTTGAAAGGAGAAGAGGGTCGTCAGAAAAAGGCTCCGAAGCCACCTCGGCCGCCCAAGGGACCTTCACTCGACGCTGCTGACCAA
ATGCTGGTGAAGGAATTCGCTGAGCTTGCCATGAAAAAACGATCGAGAATCGAGCGAATGAAAGCATTGAAGAAGGCAAAAGCTGAGAAAACATCTTCTTGCAAT
ACTTACATACCAGCTTTGGTTATCACATTCCTCTTCTTCCTTGTAGTAATCATTCAAGGTATAAGCTCCAGAAGCAGTTCATTGTTCCAGGGGTCGCCCAAACCG
GCCGTTGGTGGTAGTAGCGGTTTCGTTTTGGTTCAGTACATTATGAACTTTCCCAGAAATGAAAGCTACATACCCAATTCCACCACCTCTGTTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTAAGAGAAGGGGATCTCACATTTGATCTGGAAAGCGGGGCGAAGATTGTTAAAGAGGAGGCTGGAAATGTGGAGCCAAGTTCAATCAAACGAGAGGTG
AAGAACATTTTTTGGGGTAGGTTGACAGAGGATACACTGCCACGAGACGAACGAGCTGCAGCCTCGAGCAGTTACGTTGCCAATATCATCCGTGATGCGAACATA
AAATTGTTGATAAATAAGAATTTGAAAGGAGAAGAGGGTCGTCAGAAAAAGGCTCCGAAGCCACCTCGGCCGCCCAAGGGACCTTCACTCGACGCTGCTGACCAA
ATGCTGGTGAAGGAATTCGCTGAGCTTGCCATGAAAAAACGATCGAGAATCGAGCGAATGAAAGCATTGAAGAAGGCAAAAGCTGAGAAAACATCTTCTTGCAAT
ACTTACATACCAGCTTTGGTTATCACATTCCTCTTCTTCCTTGTAGTAATCATTCAAGGTATAAGCTCCAGAAGCAGTTCATTGTTCCAGGGGTCGCCCAAACCG
GCCGTTGGTGGTAGTAGCGGTTTCGTTTTGGTTCAGTACATTATGAACTTTCCCAGAAATGAAAGCTACATACCCAATTCCACCACCTCTGTTAAGTGA
Protein sequenceShow/hide protein sequence
MGLREGDLTFDLESGAKIVKEEAGNVEPSSIKREVKNIFWGRLTEDTLPRDERAAASSSYVANIIRDANIKLLINKNLKGEEGRQKKAPKPPRPPKGPSLDAADQ
MLVKEFAELAMKKRSRIERMKALKKAKAEKTSSCNTYIPALVITFLFFLVVIIQGISSRSSSLFQGSPKPAVGGSSGFVLVQYIMNFPRNESYIPNSTTSVK