; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000249 (gene) of Snake gourd v1 genome

Gene IDTan0000249
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionOleosin
Genome locationLG09:70467244..70467810
RNA-Seq ExpressionTan0000249
SyntenyTan0000249
Gene Ontology termsGO:0019915 - lipid storage (biological process)
GO:0022414 - reproductive process (biological process)
GO:0012511 - monolayer-surrounded lipid storage body (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR000136 - Oleosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138225.1 oleosin 1 [Cucumis sativus]4.8e-5278.43Show/hide
Query:  MADRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS
        MAD  S T  +   LLRR+Q HAP SPQ+LGFLTLFIS SILIFL GLTLTAAVLA IFLTPFLLLT+PIW P++FFLFLA   +LSLA  ALA AA  S
Subjt:  MADRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS

Query:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        WAY+YFKGMHPPGSDRLEYA  RIYDTA+HVKDYAREYGGYLQSKVKDAAPGA
Subjt:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

XP_008453362.1 PREDICTED: oleosin 1 [Cucumis melo]8.7e-5480.13Show/hide
Query:  DRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWA
        DRP +  T    LLRR+Q HAP SPQ+LGFLTLFIS SILIFL GLTLTAAVLA IFLTPFLLLT+PIW P++FFLFLA   +LSLA  ALA AA  SWA
Subjt:  DRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWA

Query:  YRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        Y+YFKGMHPPGSDRLEYARSRIYDTA+HVKDYAREYGGYLQSKVKDAAPGA
Subjt:  YRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

XP_022933252.1 oleosin 1-like [Cucurbita moschata]1.1e-5378.43Show/hide
Query:  MADRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS
        MADRPS    TAT L+RRLQEHAP SPQ+LG LTLFIS SILIFLIGLT TAA++ LIFL+P +LLT+PIWAPL FFLF+ A  LLSL  FA+AAAA  S
Subjt:  MADRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS

Query:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        WAYRYFKGMHPPGS+++EYARSRIYDTA+ VKDYAREYGGYLQSKVKDAAPGA
Subjt:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

XP_023001817.1 oleosin 1-like [Cucurbita maxima]4.3e-5377.78Show/hide
Query:  MADRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS
        MADRPS    TAT L+RRLQEHAP SPQ+LG LTLFIS SILIFLIGLT TAA++ LIFL+P +LLT+PIWAPL FFLF+ A  LL L  FA+AAAA  S
Subjt:  MADRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS

Query:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        WAYRYFKGMHPPGS+++EYARSRIYDTA+ VKDYAREYGGYLQSKVKDAAPGA
Subjt:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

XP_038879769.1 oleosin G-like [Benincasa hispida]1.5e-5381.82Show/hide
Query:  TATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMH
        T   L RR QEHAP SPQ++GFLTLFIS SILIFL GLTLTAAVLALIFLTPF+LLT+PIW P++FFLFLA   +LSL   ALAAAA FSWAYRYFKGMH
Subjt:  TATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMH

Query:  PPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        PPGSDRLEYA  RIYDTA+HVKDYAREYGGYLQSKVKDAAPGA
Subjt:  PPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

TrEMBL top hitse value%identityAlignment
A0A0A0LRZ7 Uncharacterized protein2.3e-5278.43Show/hide
Query:  MADRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS
        MAD  S T  +   LLRR+Q HAP SPQ+LGFLTLFIS SILIFL GLTLTAAVLA IFLTPFLLLT+PIW P++FFLFLA   +LSLA  ALA AA  S
Subjt:  MADRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS

Query:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        WAY+YFKGMHPPGSDRLEYA  RIYDTA+HVKDYAREYGGYLQSKVKDAAPGA
Subjt:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

A0A1S3BVH6 oleosin 14.2e-5480.13Show/hide
Query:  DRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWA
        DRP +  T    LLRR+Q HAP SPQ+LGFLTLFIS SILIFL GLTLTAAVLA IFLTPFLLLT+PIW P++FFLFLA   +LSLA  ALA AA  SWA
Subjt:  DRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWA

Query:  YRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        Y+YFKGMHPPGSDRLEYARSRIYDTA+HVKDYAREYGGYLQSKVKDAAPGA
Subjt:  YRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

A0A5A7UTJ5 Oleosin 14.2e-5480.13Show/hide
Query:  DRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWA
        DRP +  T    LLRR+Q HAP SPQ+LGFLTLFIS SILIFL GLTLTAAVLA IFLTPFLLLT+PIW P++FFLFLA   +LSLA  ALA AA  SWA
Subjt:  DRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWA

Query:  YRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        Y+YFKGMHPPGSDRLEYARSRIYDTA+HVKDYAREYGGYLQSKVKDAAPGA
Subjt:  YRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

A0A6J1EZ90 oleosin 1-like5.5e-5478.43Show/hide
Query:  MADRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS
        MADRPS    TAT L+RRLQEHAP SPQ+LG LTLFIS SILIFLIGLT TAA++ LIFL+P +LLT+PIWAPL FFLF+ A  LLSL  FA+AAAA  S
Subjt:  MADRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS

Query:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        WAYRYFKGMHPPGS+++EYARSRIYDTA+ VKDYAREYGGYLQSKVKDAAPGA
Subjt:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

A0A6J1KHP7 oleosin 1-like2.1e-5377.78Show/hide
Query:  MADRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS
        MADRPS    TAT L+RRLQEHAP SPQ+LG LTLFIS SILIFLIGLT TAA++ LIFL+P +LLT+PIWAPL FFLF+ A  LL L  FA+AAAA  S
Subjt:  MADRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS

Query:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        WAYRYFKGMHPPGS+++EYARSRIYDTA+ VKDYAREYGGYLQSKVKDAAPGA
Subjt:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

SwissProt top hitse value%identityAlignment
A0A060L102 Oleosin G1.8e-3858.7Show/hide
Query:  LRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPGSD
        ++++ +H P   Q+LGF+TLF+SG++L+FL GLTLT  V+ L+ LTP L+  +PI  PL+  LF+A    LS   F LAA +A SW Y Y KG HPPG+D
Subjt:  LRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPGSD

Query:  RLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        +++YAR RI DTATHVKDYAREYGGYLQSK++DAAPGA
Subjt:  RLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

A0A1I9R3Y6 Oleosin5.3e-3857.14Show/hide
Query:  GLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPG
        G+++++ +H P   Q+LGF+TLF+SG+IL+ L GLTLT  V+ L+ LTP L+  +PI  P++  LF+A    LS   F LAA +A SW Y Y KG HPPG
Subjt:  GLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPG

Query:  SDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        +D+++YAR RI DTA+HVKDYAREYGGYLQSK++DAAPGA
Subjt:  SDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

A6MGW7 Oleosin6.9e-3858.87Show/hide
Query:  TGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPP
        + LLRR+Q+H P S Q++GFLTL ISG IL+ L GLT T+A +  +   P +LLT+P+W PL    FL     LS   F +AA A  +W YRY  G HP 
Subjt:  TGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPP

Query:  GSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        GS+R++YARSRI DTA+HVKDYAREYGGYLQS+VKDAAPGA
Subjt:  GSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

Q647G3 Oleosin Ara h 15.01011.0e-1241.6Show/hide
Query:  QVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPGSDRLEYARSRIYDT
        Q + F+T    G   + L GL LT  V+ LI  TP L++ +PI  P +  L LAA   L      +AA AA SW Y Y  G HP GSDRL+YA+  I D 
Subjt:  QVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPGSDRLEYARSRIYDT

Query:  ATHVKDYAREYGGYLQSKVKDAAPG
        A  VKD A++Y G    + ++  PG
Subjt:  ATHVKDYAREYGGYLQSKVKDAAPG

Q9XHP2 Oleosin L2.5e-1136.52Show/hide
Query:  PQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPGSDRLEYARSR
        P++ +V+   T   +G  L+ L GLTL   V+AL   TP L++ +P+  P    +FL     L+   F +AA +  SW YRY  G HPPG+D+LE A+++
Subjt:  PQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPGSDRLEYARSR

Query:  IYDTATHVKDYAREY
        +   A  +KD A ++
Subjt:  IYDTATHVKDYAREY

Arabidopsis top hitse value%identityAlignment
AT1G48990.1 Oleosin family protein3.2e-3054Show/hide
Query:  STTTTTATGLLRR-LQEHAP-QSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAY
        +TT    + LLR+ LQ  +P  S Q+ GFL  FISG IL+ L G+T+TA VL  I   P +++++PIW P    LFL  T  LSLA   LA  A  SW Y
Subjt:  STTTTTATGLLRR-LQEHAP-QSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAY

Query:  RYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        RYFKGMHP  SD+++YARSRI+DTA HVKDYA   GGY    +KDAAPGA
Subjt:  RYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

AT2G25890.1 Oleosin family protein3.9e-1236.67Show/hide
Query:  LLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPGS
        ++R L E +P + Q++ F+T    G  L+ L GLTLT  V+ LI  TP ++L +P+  P    + L     L      +AAA A +W Y+Y  G HP G+
Subjt:  LLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPGS

Query:  DRLEYARSRIYDTATHVKDY
        D+++YAR RI + A  +  Y
Subjt:  DRLEYARSRIYDTATHVKDY

AT3G18570.1 Oleosin family protein4.3e-3550.98Show/hide
Query:  STTTTTA-----TGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS
        ST TTT      +    +L+ H+P S Q+ GFL LFIS  IL+FL+G+++TAAVL  I   P +++++PIW P    +F+     L+++ F +   A  S
Subjt:  STTTTTA-----TGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFS

Query:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA
        W YRYF+GMHP GS++++YARSRIYDTA+HVKDYAREYGGY   + KDAAPGA
Subjt:  WAYRYFKGMHPPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA

AT4G25140.1 oleosin 15.7e-1139.13Show/hide
Query:  QSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPGSDRLEYARSRI
        +S Q+    T   +G  L+ L  LTL   V+AL   TP L++ +PI  P    + L  T  LS   F +AA   FSW Y+Y  G HP GSD+L+ AR ++
Subjt:  QSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPGSDRLEYARSRI

Query:  YDTATHVKDYAREYG
           A  +KD A+ YG
Subjt:  YDTATHVKDYAREYG

AT5G51210.1 oleosin31.7e-0738.39Show/hide
Query:  QEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPGSDRLEY
        QE  P++ Q++   T   +G  L+ L GLTL   V+AL   TP L++ +P+  P    + L  T  L+   F +AA  AFSW YR+  G    GSD++E 
Subjt:  QEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMHPPGSDRLEY

Query:  AR----SRIYDT
        AR    SR+ DT
Subjt:  AR----SRIYDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACCGTCCGTCCACCACCACCACAACCGCCACCGGTCTCCTCCGGCGATTACAAGAACATGCACCTCAGTCCCCGCAGGTCCTGGGCTTCCTTACCCTCTTCAT
CTCCGGCTCGATTCTGATCTTCCTGATCGGCCTCACACTCACCGCCGCCGTCCTCGCCCTAATCTTCCTCACCCCATTCCTCCTCTTAACCACTCCCATCTGGGCTCCTC
TTTCCTTTTTCCTCTTCCTCGCTGCCACCGCCCTCCTCTCTCTCGCCGCCTTTGCACTCGCTGCCGCCGCCGCCTTCTCATGGGCCTACCGCTACTTCAAAGGGATGCAC
CCACCTGGATCCGACCGCCTTGAGTACGCCAGAAGCCGTATTTACGACACTGCCACTCATGTTAAGGATTATGCTAGAGAGTATGGAGGGTATTTGCAGAGTAAGGTGAA
AGATGCAGCTCCTGGAGCTTAA
mRNA sequenceShow/hide mRNA sequence
AAAAATTAAACTAGTACTCCCCCAATAACCCCAACCACTCCGATGGCCGACCGTCCGTCCACCACCACCACAACCGCCACCGGTCTCCTCCGGCGATTACAAGAACATGC
ACCTCAGTCCCCGCAGGTCCTGGGCTTCCTTACCCTCTTCATCTCCGGCTCGATTCTGATCTTCCTGATCGGCCTCACACTCACCGCCGCCGTCCTCGCCCTAATCTTCC
TCACCCCATTCCTCCTCTTAACCACTCCCATCTGGGCTCCTCTTTCCTTTTTCCTCTTCCTCGCTGCCACCGCCCTCCTCTCTCTCGCCGCCTTTGCACTCGCTGCCGCC
GCCGCCTTCTCATGGGCCTACCGCTACTTCAAAGGGATGCACCCACCTGGATCCGACCGCCTTGAGTACGCCAGAAGCCGTATTTACGACACTGCCACTCATGTTAAGGA
TTATGCTAGAGAGTATGGAGGGTATTTGCAGAGTAAGGTGAAAGATGCAGCTCCTGGAGCTTAATTCTATTTTTTTTTTTTAATTCTAATTATTATTTCTAGCTGTCTCC
CGATTATCTCAGCCGGC
Protein sequenceShow/hide protein sequence
MADRPSTTTTTATGLLRRLQEHAPQSPQVLGFLTLFISGSILIFLIGLTLTAAVLALIFLTPFLLLTTPIWAPLSFFLFLAATALLSLAAFALAAAAAFSWAYRYFKGMH
PPGSDRLEYARSRIYDTATHVKDYAREYGGYLQSKVKDAAPGA