; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003985 (gene) of Snake gourd v1 genome

Gene IDTan0003985
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUncharacterised protein family (UPF0114)
Genome locationLG01:105820891..105824378
RNA-Seq ExpressionTan0003985
SyntenyTan0003985
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591856.1 hypothetical protein SDJN03_14202, partial [Cucurbita argyrosperma subsp. sororia]7.5e-9781.25Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSP LIT   RT+TTTAR  PST+IIQAYQ++QP P F+              R FPACAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LGS R+ S+R+V HRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSK+
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR

XP_022937012.1 uncharacterized protein LOC111443436 [Cucurbita moschata]3.1e-9882.08Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSP LIT   RT+TTTAR  PST+IIQAYQ++QP P FN              RRFPACAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LGS R+ S+R+V HRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSK+
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR

XP_022976828.1 uncharacterized protein LOC111477089 [Cucurbita maxima]3.4e-9781.67Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSP LIT   RT+TTTAR  PST+IIQAYQ++QP   FN              RRFPACAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LGS R+ S+R V HRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSK+
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR

XP_023523344.1 uncharacterized protein LOC111787566 [Cucurbita pepo subsp. pepo]2.9e-9681.67Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAY-QYRQPYPNFNS------------RRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSPPLI+  +RT+TTT   GPSTMIIQAY QY Q YP FNS            RRFPA A+ASSGP VPAASAP IQSD+G ASRTS LEK   IEE 
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAY-QYRQPYPNFNS------------RRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGV GSLVGS+LCF+KGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LG+ERTLSKRN+EHRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSK+
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR

XP_023536456.1 uncharacterized protein LOC111797628 [Cucurbita pepo subsp. pepo]1.2e-9781.67Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSP LIT   RT+ TTAR  PST+IIQAYQ++QP P FN              RRFPACAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LGS R+ S+R+V HRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSK+
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR

TrEMBL top hitse value%identityAlignment
A0A1S3BA03 uncharacterized protein LOC103487632 isoform X14.5e-9576.68Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSP LIT   RT TTT R  PST+IIQAYQY+QP P FN             SR FPACAS S GPQVPAASAPLIQ+ +GAASRTS LEK++TIEE 
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GS+LCF+KGCVHVAASFSEYFVNRGKVIM+LVEAIDVYLLGTVMLVFGTGLYELFISQLG+ R+LSK NVEH+SNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKRQLYK--LQVICFA
        F LKERPKWM+V TVNELKTKLGHVIVMLLLIGFFDKSK+ + +  + ++C A
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKRQLYK--LQVICFA

A0A6J1F9X5 uncharacterized protein LOC1114434361.5e-9882.08Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSP LIT   RT+TTTAR  PST+IIQAYQ++QP P FN              RRFPACAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LGS R+ S+R+V HRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSK+
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR

A0A6J1FPY6 uncharacterized protein LOC1114459069.0e-9681.25Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAY-QYRQPYPNFNS------------RRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSPPLI+  +RT+TTT R  PSTMII AY QY Q YP FNS            RRFPA A+ASSGP VPAASAP IQSD+G ASRTS LEK   IEE 
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAY-QYRQPYPNFNS------------RRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGV GSLVGS+LCF+KGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LG+ERTLSKRN+EHRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSK+
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR

A0A6J1INA6 uncharacterized protein LOC1114770891.6e-9781.67Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSP LIT   RT+TTTAR  PST+IIQAYQ++QP   FN              RRFPACAS SSGPQVPAASAP +QSD+GAASRTSALEKLDT+EEG
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFN-------------SRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGVLGSL+GSVLCFVKGCVHVAAS SEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LGS R+ S+R V HRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSK+
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR

A0A6J1J2R4 uncharacterized protein LOC1114807451.8e-9682.08Show/hide
Query:  MQPSPPLITDLTRTITTTARPGPSTMIIQAY-QYRQPYPNFNS------------RRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG
        MQPSPPLI+  +R++TTT R  PSTMIIQAY QY Q YP FNS            RRFPA A+ASSGP VPAASAP IQSDIG ASRTSALEK   IEE 
Subjt:  MQPSPPLITDLTRTITTTARPGPSTMIIQAY-QYRQPYPNFNS------------RRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEG

Query:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL
        LEKAIYRCRFMAFLGV GSLVGS+LCF+KGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFIS LG+ERTLSKRN+EHRSNLFGL
Subjt:  LEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGL

Query:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR
        FTLKERPKWM++TTVNELKTKLGHVIVMLLLIGFFDKSK+
Subjt:  FTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)2.0e-5561.93Show/hide
Query:  SSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTV
        S G    A+++    +   AA  +++  + + +EEG+EK IY CRFM FLG LGSL+GSVLCF+KGC++V  SF +Y VNRGKVI LLVEAID+YLLGTV
Subjt:  SSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTV

Query:  MLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR
        MLVFG GLYELFIS L +  + +   V +RS+LFG+FTLKERP+W++V +V+ELKTKLGHVIVMLLLIG FDKSKR
Subjt:  MLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSKR

AT5G13720.1 Uncharacterised protein family (UPF0114)2.2e-3342.02Show/hide
Query:  PACASASSGPQVPAASAPLIQSDIGAASRTSALEK-LDTIEEGLEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVN------RGKVIML
        P  +++SS P     +   + S  G     S   +   + E  +E+ I+  RF+A L V GSL GS+LCF+ GCV++  ++  Y+ N       G++++ 
Subjt:  PACASASSGPQVPAASAPLIQSDIGAASRTSALEK-LDTIEEGLEKAIYRCRFMAFLGVLGSLVGSVLCFVKGCVHVAASFSEYFVN------RGKVIML

Query:  LVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSK
        LVEAIDVYL GTVML+F  GLY LFIS    +           S+LFG+F +KERPKWM +++++ELKTK+GHVIVM+LL+  F++SK
Subjt:  LVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIGFFDKSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCGTCTCCACCGTTGATTACTGACCTTACCAGAACTATAACGACCACCGCCCGACCTGGACCTTCCACGATGATCATCCAAGCCTACCAGTACCGGCAACCTTA
TCCAAATTTTAATAGCCGTAGATTTCCTGCCTGTGCAAGTGCCAGCTCAGGACCTCAAGTTCCGGCTGCTTCTGCTCCTTTAATCCAATCCGATATTGGCGCTGCGTCCC
GGACGTCGGCACTGGAAAAGTTGGATACCATAGAGGAGGGCCTGGAAAAGGCCATTTATCGATGCCGATTCATGGCATTTTTGGGCGTCTTAGGATCTTTGGTTGGTTCT
GTACTCTGTTTCGTCAAGGGGTGCGTTCATGTAGCAGCATCTTTCTCAGAATATTTTGTAAATCGTGGAAAAGTGATAATGTTGCTAGTTGAGGCCATAGATGTGTATCT
CTTAGGAACTGTGATGCTAGTCTTTGGTACGGGTCTCTATGAGCTGTTTATCAGTCAGCTTGGAAGTGAACGCACTTTATCAAAGAGAAACGTTGAGCATAGATCCAACC
TATTTGGCTTGTTCACTTTAAAGGAACGACCAAAATGGATGGACGTAACGACCGTTAACGAGCTGAAAACAAAGCTCGGGCATGTCATAGTGATGCTGCTTCTAATTGGG
TTCTTCGACAAGAGTAAAAGGCAGTTATACAAACTCCAGGTGATTTGCTTTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAACCGTCTCCACCGTTGATTACTGACCTTACCAGAACTATAACGACCACCGCCCGACCTGGACCTTCCACGATGATCATCCAAGCCTACCAGTACCGGCAACCTTA
TCCAAATTTTAATAGCCGTAGATTTCCTGCCTGTGCAAGTGCCAGCTCAGGACCTCAAGTTCCGGCTGCTTCTGCTCCTTTAATCCAATCCGATATTGGCGCTGCGTCCC
GGACGTCGGCACTGGAAAAGTTGGATACCATAGAGGAGGGCCTGGAAAAGGCCATTTATCGATGCCGATTCATGGCATTTTTGGGCGTCTTAGGATCTTTGGTTGGTTCT
GTACTCTGTTTCGTCAAGGGGTGCGTTCATGTAGCAGCATCTTTCTCAGAATATTTTGTAAATCGTGGAAAAGTGATAATGTTGCTAGTTGAGGCCATAGATGTGTATCT
CTTAGGAACTGTGATGCTAGTCTTTGGTACGGGTCTCTATGAGCTGTTTATCAGTCAGCTTGGAAGTGAACGCACTTTATCAAAGAGAAACGTTGAGCATAGATCCAACC
TATTTGGCTTGTTCACTTTAAAGGAACGACCAAAATGGATGGACGTAACGACCGTTAACGAGCTGAAAACAAAGCTCGGGCATGTCATAGTGATGCTGCTTCTAATTGGG
TTCTTCGACAAGAGTAAAAGGCAGTTATACAAACTCCAGGTGATTTGCTTTGCTTAG
Protein sequenceShow/hide protein sequence
MQPSPPLITDLTRTITTTARPGPSTMIIQAYQYRQPYPNFNSRRFPACASASSGPQVPAASAPLIQSDIGAASRTSALEKLDTIEEGLEKAIYRCRFMAFLGVLGSLVGS
VLCFVKGCVHVAASFSEYFVNRGKVIMLLVEAIDVYLLGTVMLVFGTGLYELFISQLGSERTLSKRNVEHRSNLFGLFTLKERPKWMDVTTVNELKTKLGHVIVMLLLIG
FFDKSKRQLYKLQVICFA