; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh19G006540 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh19G006540
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationCma_Chr19:6893087..6893788
RNA-Seq ExpressionCmaCh19G006540
SyntenyCmaCh19G006540
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572022.1 hypothetical protein SDJN03_28750, partial [Cucurbita argyrosperma subsp. sororia]1.1e-10886.7Show/hide
Query:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQIL
        MDL+LSLALPSAATAAT             +T  +AAAAYELHLLSSLRTPN LGVRQTSLRRRK NSPTTGPIEPPYPWSTDRIAVV TL YLT NQIL
Subjt:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQIL

Query:  TITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSY
        TITG+VKCQQCRRIYE+EY+VVSKFNEIG FVEH MESFRDRAPK+WMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEM+GALKLNHLKYFCSY
Subjt:  TITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSY

Query:  TKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI
        TKNHRTGSKDRLVYLTYITLCRQIDPSGRFS I
Subjt:  TKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI

KAG7011694.1 hypothetical protein SDJN02_26600, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-8794.44Show/hide
Query:  GPIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKE
        GPIEPPYPWSTDRIAVVHTL YLT NQILTITG+VKCQQCRRIYE+EY+VVSKFNEIG FVEH MESFRDRAPK+WMQPNYPTCRFCGAEKGVKPVIPKE
Subjt:  GPIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKE

Query:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI
        WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFS I
Subjt:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI

XP_022952797.1 uncharacterized protein LOC111455388 [Cucurbita moschata]5.1e-11491.85Show/hide
Query:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQIL
        MDL+LSLALPSAATAAT+AA         AATVAAAAAAYELHLLSSLRTPN LGVRQTSLR RK NSPTTGPIEPPYPWSTDRIAVVHTLHYLT NQIL
Subjt:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQIL

Query:  TITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSY
        TITG+VKCQQCRRIYEIEY+VVSKFNEIGSFVEHNMESFRDRAPK+WMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSY
Subjt:  TITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSY

Query:  TKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI
        TKNHRTGSKDRLVYLTYITLCRQIDPSGRFS I
Subjt:  TKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI

XP_022972401.1 uncharacterized protein LOC111470968 [Cucurbita maxima]1.3e-117100Show/hide
Query:  ATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVV
        ATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVV
Subjt:  ATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVV

Query:  SKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCR
        SKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCR
Subjt:  SKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCR

Query:  QIDPSGRFSRI
        QIDPSGRFSRI
Subjt:  QIDPSGRFSRI

XP_023511615.1 uncharacterized protein LOC111776409 [Cucurbita pepo subsp. pepo]4.6e-10784.98Show/hide
Query:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQIL
        MDL+LSLALPSAATAA                 + AAAAYELHLLSSLRTPN LGVRQTSLRRRKCNSPTTG IEPPYPWSTDRIAVVHTLHYLT NQI+
Subjt:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQIL

Query:  TITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSY
        TITG+VKCQQCRRIYE+EY+VVSKFNEIG FVE+ MESFRDRAPK+WMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGAL+LNHLKYFCSY
Subjt:  TITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSY

Query:  TKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI
        TKNHRTGSKDRLVYLTYITLCRQI PSGRF+ I
Subjt:  TKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI

TrEMBL top hitse value%identityAlignment
A0A0A0K3Q8 Uncharacterized protein1.2e-6558.12Show/hide
Query:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSP-TTGPIEPPYPWSTDRIAVVHTLHYLTLNQI
        +DL+LSL  PS   ++  +AA        A T              ++R    LG R++S +R    SP TT  IEPPYPWST+R A+V TL+ L  NQI
Subjt:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSP-TTGPIEPPYPWSTDRIAVVHTLHYLTLNQI

Query:  LTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCS
        L ITGDV+C+QC+  Y IEY++ SKF EI SFVE N  SFRDRAP+ WM PNYPTCRFCG E G +PVIPK+W KINW+FLLLGEM+G L LNHLKYFCS
Subjt:  LTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCS

Query:  YTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI
         T NHRTG+K+RL+YLTYITLC Q+DPSGRF+R+
Subjt:  YTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI

A0A1S3BHR1 uncharacterized protein LOC1034897701.3e-6759.4Show/hide
Query:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSP-TTGPIEPPYPWSTDRIAVVHTLHYLTLNQI
        +DLQLSL  PS    +  +  A          +  A A    + L++ R    LG R++SLRR    SP TT  IEPPYPWST+R A+V TL+ L  +QI
Subjt:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSP-TTGPIEPPYPWSTDRIAVVHTLHYLTLNQI

Query:  LTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCS
        L ITGDV+C+QC+  Y IEY++VSKF EI SFVE N   FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+G L LNHLKYFCS
Subjt:  LTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCS

Query:  YTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI
        YT NHRTG+K+RL+YLTYITLC Q+DPSGRF+R+
Subjt:  YTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI

A0A5A7T547 Uncharacterized protein3.3e-5857.99Show/hide
Query:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSP-TTGPIEPPYPWSTDRIAVVHTLHYLTLNQI
        +DLQLSL  PS    +  +  A          +  A A    + L++ R    LG R++SLRR    SP TT  IEPPYPWST+R A+V TL+ L  +QI
Subjt:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSP-TTGPIEPPYPWSTDRIAVVHTLHYLTLNQI

Query:  LTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCS
        L ITGDV+C+QC+  Y IEY++VSKF EI SFVE N   FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+G L LNHLKYFCS
Subjt:  LTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCS

Query:  YTKNHRTGSKDRLVYLTYI
        YT NHRTG+K+RL+YLT I
Subjt:  YTKNHRTGSKDRLVYLTYI

A0A6J1GLD4 uncharacterized protein LOC1114553882.5e-11491.85Show/hide
Query:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQIL
        MDL+LSLALPSAATAAT+AA         AATVAAAAAAYELHLLSSLRTPN LGVRQTSLR RK NSPTTGPIEPPYPWSTDRIAVVHTLHYLT NQIL
Subjt:  MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQIL

Query:  TITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSY
        TITG+VKCQQCRRIYEIEY+VVSKFNEIGSFVEHNMESFRDRAPK+WMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSY
Subjt:  TITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSY

Query:  TKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI
        TKNHRTGSKDRLVYLTYITLCRQIDPSGRFS I
Subjt:  TKNHRTGSKDRLVYLTYITLCRQIDPSGRFSRI

A0A6J1I5V9 uncharacterized protein LOC1114709686.3e-118100Show/hide
Query:  ATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVV
        ATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVV
Subjt:  ATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVV

Query:  SKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCR
        SKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCR
Subjt:  SKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCR

Query:  QIDPSGRFSRI
        QIDPSGRFSRI
Subjt:  QIDPSGRFSRI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein7.7e-4446.06Show/hide
Query:  RRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAE
        R R   S  +  I PP+PW+T+R   + +L YL  NQI TITG+V+C+ C ++Y++ YN+  +F E+  F        RDRA K W  P    C  CG E
Subjt:  RRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAE

Query:  KGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDP
        K VKPVI +   +INW+FLLLG+ +G   L  LK FC ++KNHRTG+KDR++YLTY+ LC+ + P
Subjt:  KGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)1.0e-4042.94Show/hide
Query:  RRKCNSPTTG--------PIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPT
        RR    P  G         I PPYPW+T +   + +   L+ N I  I+G V C+ C R   +EYN+  KF+E+  +++ N E  R RAP  W  P    
Subjt:  RRKCNSPTTG--------PIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPT

Query:  CRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFS
        CR C +E  +KPV+ +  E+INW+FLLLG+M+G   L+ L+YFC     HRTGSKDR+VY+TY++LC+Q+DP G F+
Subjt:  CRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFS

AT2G16190.2 FUNCTIONS IN: molecular_function unknown2.3e-2438.16Show/hide
Query:  RRKCNSPTTG--------PIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPT
        RR    P  G         I PPYPW+T +   + +   L+ N I  I+G V C+ C R   +EYN+  KF+E+  +++ N E  R RAP  W  P    
Subjt:  RRKCNSPTTG--------PIEPPYPWSTDRIAVVHTLHYLTLNQILTITGDVKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPT

Query:  CRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRT
        CR C +E  +KPV+ +  E+INW+FLLLG+M+G   L+ L    S  K+H T
Subjt:  CRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCTTCAACTCTCTCTTGCCCTCCCGTCCGCCGCCACCGCCGCCACTACCGCCGCCGCCGCCGCCACCGCCACCACCACCGCCGCCGCCACGGTGGCTGCC
GCCGCTGCAGCCTACGAGCTACACTTACTATCCTCATTGAGAACCCCCAACATTTTAGGAGTCCGTCAAACATCTCTCCGCCGTAGGAAGTGTAATTCTCCAACG
ACGGGGCCGATCGAGCCACCATATCCATGGTCGACGGACCGAATAGCGGTGGTTCATACGCTACACTATTTGACATTGAACCAAATCCTGACGATCACCGGGGAT
GTCAAGTGCCAGCAATGTCGGAGAATTTACGAGATCGAATACAACGTTGTTTCGAAGTTTAACGAGATTGGGAGCTTCGTAGAGCACAACATGGAGTCGTTCCGG
GACCGGGCGCCAAAGAAGTGGATGCAGCCGAACTATCCGACGTGTCGGTTTTGCGGGGCGGAAAAAGGAGTGAAGCCGGTGATTCCAAAGGAATGGGAGAAGATC
AATTGGGTGTTCTTGCTTTTGGGGGAAATGGTTGGAGCTTTGAAACTGAATCATTTGAAGTACTTTTGCAGTTACACGAAGAATCATCGAACAGGTTCAAAGGAT
CGTCTTGTTTATCTCACTTACATCACTTTGTGCCGCCAAATTGATCCTTCTGGCCGTTTCAGTCGAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACCTTCAACTCTCTCTTGCCCTCCCGTCCGCCGCCACCGCCGCCACTACCGCCGCCGCCGCCGCCACCGCCACCACCACCGCCGCCGCCACGGTGGCTGCC
GCCGCTGCAGCCTACGAGCTACACTTACTATCCTCATTGAGAACCCCCAACATTTTAGGAGTCCGTCAAACATCTCTCCGCCGTAGGAAGTGTAATTCTCCAACG
ACGGGGCCGATCGAGCCACCATATCCATGGTCGACGGACCGAATAGCGGTGGTTCATACGCTACACTATTTGACATTGAACCAAATCCTGACGATCACCGGGGAT
GTCAAGTGCCAGCAATGTCGGAGAATTTACGAGATCGAATACAACGTTGTTTCGAAGTTTAACGAGATTGGGAGCTTCGTAGAGCACAACATGGAGTCGTTCCGG
GACCGGGCGCCAAAGAAGTGGATGCAGCCGAACTATCCGACGTGTCGGTTTTGCGGGGCGGAAAAAGGAGTGAAGCCGGTGATTCCAAAGGAATGGGAGAAGATC
AATTGGGTGTTCTTGCTTTTGGGGGAAATGGTTGGAGCTTTGAAACTGAATCATTTGAAGTACTTTTGCAGTTACACGAAGAATCATCGAACAGGTTCAAAGGAT
CGTCTTGTTTATCTCACTTACATCACTTTGTGCCGCCAAATTGATCCTTCTGGCCGTTTCAGTCGAATTTGA
Protein sequenceShow/hide protein sequence
MDLQLSLALPSAATAATTAAAAATATTTAAATVAAAAAAYELHLLSSLRTPNILGVRQTSLRRRKCNSPTTGPIEPPYPWSTDRIAVVHTLHYLTLNQILTITGD
VKCQQCRRIYEIEYNVVSKFNEIGSFVEHNMESFRDRAPKKWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKD
RLVYLTYITLCRQIDPSGRFSRI