; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0016297 (gene) of Chayote v1 genome

Gene IDSed0016297
OrganismSechium edule (Chayote v1)
DescriptionPhytocyanin domain-containing protein
Genome locationLG05:6828210..6829530
RNA-Seq ExpressionSed0016297
SyntenySed0016297
Gene Ontology termsGO:0031224 - intrinsic component of membrane (cellular component)
GO:0009055 - electron transfer activity (molecular function)
InterPro domainsIPR003245 - Phytocyanin domain
IPR008972 - Cupredoxin
IPR039391 - Phytocyanin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579459.1 hypothetical protein SDJN03_23907, partial [Cucurbita argyrosperma subsp. sororia]2.2e-5265.64Show/hide
Query:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG
        MAISTSHL +FL++ A FAP + ATNYTVG  AGW+T V+YTAWA+ +TFYV D LIF YP G+HNV+KVN + FQNCT+P DQ   STG+D I LAKSG
Subjt:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG

Query:  KKWYICGKEGHCASNMKLAITVMDMAPSSQ-----PSGATRAVVSTRFGFVAVVVGILGMMMA
        KKWYICGKEGHC  + KL ITVMDMAP++      PS AT+AV+S + GFV VVV ILGMM+A
Subjt:  KKWYICGKEGHCASNMKLAITVMDMAPSSQ-----PSGATRAVVSTRFGFVAVVVGILGMMMA

XP_022922161.1 stellacyanin-like [Cucurbita moschata]6.4e-5265.64Show/hide
Query:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG
        MAISTSHL +FL++ A FAP A ATNYTVG  AGW+T V+YTAWA+ +TFYV D LIF Y  G+HNV+KVN + FQNCT+P DQ   STG+D I LAKSG
Subjt:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG

Query:  KKWYICGKEGHCASNMKLAITVMDMAPSSQ-----PSGATRAVVSTRFGFVAVVVGILGMMMA
        KKWYICGK+GHC  + KL ITVMDMAP++      PS AT+AV+S + GFV VVV ILGMMMA
Subjt:  KKWYICGKEGHCASNMKLAITVMDMAPSSQ-----PSGATRAVVSTRFGFVAVVVGILGMMMA

XP_022969776.1 stellacyanin-like [Cucurbita maxima]1.5e-5366.87Show/hide
Query:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG
        MAISTSHL +FL++ A FAPSA ATNYTVG  AGW+T V+YTAWA+ +TFYV D LIF YP G+HNV+KVN + FQNCT+P DQ   STG+D I LAKSG
Subjt:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG

Query:  KKWYICGKEGHCASNMKLAITVMDMAPSSQ-----PSGATRAVVSTRFGFVAVVVGILGMMMA
        KKWYICGKEGHC  + KL ITVMDMAP++      PS AT+AV+S + GFV VVV ILG+MMA
Subjt:  KKWYICGKEGHCASNMKLAITVMDMAPSSQ-----PSGATRAVVSTRFGFVAVVVGILGMMMA

XP_023550536.1 stellacyanin-like [Cucurbita pepo subsp. pepo]3.8e-5265.03Show/hide
Query:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG
        MAIS SHL +FL++ + FAPSA ATNYTVG  AGW+T V+YTAWA+ +TFYV D LIF YP G+HNV+KVN + FQNCT+P DQ   STG+D I LAKSG
Subjt:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG

Query:  KKWYICGKEGHCASNMKLAITVMDMAPSSQ-----PSGATRAVVSTRFGFVAVVVGILGMMMA
        KKWYICGK+GHC  + KL ITVMDMAP++      PS AT+AV+S + GFV VV+ ILGMMMA
Subjt:  KKWYICGKEGHCASNMKLAITVMDMAPSSQ-----PSGATRAVVSTRFGFVAVVVGILGMMMA

XP_038874887.1 blue copper protein 1a-like [Benincasa hispida]7.1e-5164.5Show/hide
Query:  MAISTSHLLVFLTVAA-TFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTD-QNALSTGSDAIVLAK
        MAISTSHL VFL++AA  FAPSA ATNYTVG  AGW+ GV+YT WA D+ F V D+LIFNYP G+HNVFKVN + F +C++P D QNAL+TG+DAIVLAK
Subjt:  MAISTSHLLVFLTVAA-TFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTD-QNALSTGSDAIVLAK

Query:  SGKKWYICGKEGHCASNMKLAITVMDMAPSSQ----------PSGATRAVVSTRFGFVAVVVGILGMMM
         GKKWYICGKEGHC    KL ITVM+MAP++           PS AT+AVVS  FGF+A++V +LGMMM
Subjt:  SGKKWYICGKEGHCASNMKLAITVMDMAPSSQ----------PSGATRAVVSTRFGFVAVVVGILGMMM

TrEMBL top hitse value%identityAlignment
A0A1S3ATX0 mavicyanin-like9.4e-4962.5Show/hide
Query:  MAISTSHLLVFLTVAATF-APSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKS
        MAISTSHL V L+ A    APSA ATNYTVG  AGW   V+YT WA+ + F V D+LIFNYP G+HNVFKVN + F++CT+P DQNAL+TGSD IVLAK 
Subjt:  MAISTSHLLVFLTVAATF-APSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKS

Query:  GKKWYICGKEGHCASNMKLAITVMDMAPSSQ----------PSGATRAVVSTRFGFVAVVVGILGMMM
        G+KWYICGKEGHC    KL I VMDM P++           PS AT+AVVS +FGFVA+VV +LGMMM
Subjt:  GKKWYICGKEGHCASNMKLAITVMDMAPSSQ----------PSGATRAVVSTRFGFVAVVVGILGMMM

A0A6J1DZM9 mavicyanin-like5.7e-4657.47Show/hide
Query:  ISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKK
        IS SHL V     A F PS  AT Y VG  AGWDTGV+YT WA+D+ F V D L+F Y QG+HNV+KVN TQF NCTIP DQNALSTG+D I L   G+K
Subjt:  ISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKK

Query:  WYICGKEGHCASNMKLAITVMDMAPS-----------------SQPSGATRAVVSTRFGFVAVVVGILGMMMAA
        WYICGKEGHC  N KL ITVMDMAP                  S PSGAT+A  S  FG +A V G LGM++ A
Subjt:  WYICGKEGHCASNMKLAITVMDMAPS-----------------SQPSGATRAVVSTRFGFVAVVVGILGMMMAA

A0A6J1E3D3 stellacyanin-like3.1e-5265.64Show/hide
Query:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG
        MAISTSHL +FL++ A FAP A ATNYTVG  AGW+T V+YTAWA+ +TFYV D LIF Y  G+HNV+KVN + FQNCT+P DQ   STG+D I LAKSG
Subjt:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG

Query:  KKWYICGKEGHCASNMKLAITVMDMAPSSQ-----PSGATRAVVSTRFGFVAVVVGILGMMMA
        KKWYICGK+GHC  + KL ITVMDMAP++      PS AT+AV+S + GFV VVV ILGMMMA
Subjt:  KKWYICGKEGHCASNMKLAITVMDMAPSSQ-----PSGATRAVVSTRFGFVAVVVGILGMMMA

A0A6J1H417 stellacyanin-like4.7e-4859.66Show/hide
Query:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG
        MAISTSHL VFL++AA FAPSA ATNYTVG  AGW  GV+YT WA+++TFYV D LIF YP  + NV+ V   QF NCTIPTD+NA +TG D + L + G
Subjt:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG

Query:  KKWYICGKEGHCASNMKLAITVMDMAPSSQP------------------SGATRAVVSTRFGFVAVVVGILGMMMA
        +KW+I GKEGHCA N KL ITVM MAP+S P                  SGATRAV+S +FG VA+VVGILG+M+A
Subjt:  KKWYICGKEGHCASNMKLAITVMDMAPSSQP------------------SGATRAVVSTRFGFVAVVVGILGMMMA

A0A6J1HYR4 stellacyanin-like7.4e-5466.87Show/hide
Query:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG
        MAISTSHL +FL++ A FAPSA ATNYTVG  AGW+T V+YTAWA+ +TFYV D LIF YP G+HNV+KVN + FQNCT+P DQ   STG+D I LAKSG
Subjt:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG

Query:  KKWYICGKEGHCASNMKLAITVMDMAPSSQ-----PSGATRAVVSTRFGFVAVVVGILGMMMA
        KKWYICGKEGHC  + KL ITVMDMAP++      PS AT+AV+S + GFV VVV ILG+MMA
Subjt:  KKWYICGKEGHCASNMKLAITVMDMAPSSQ-----PSGATRAVVSTRFGFVAVVVGILGMMMA

SwissProt top hitse value%identityAlignment
A0A072U307 Blue copper protein 1b1.8e-2844.44Show/hide
Query:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG
        MA S   L++ +++    + +  AT+Y VG   GW    DYT WA+D+ F V D L+FNY    HNVFKVN T FQ+CT P    ALSTG D I L   G
Subjt:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG

Query:  KKWYICGKEGHC-ASNMKLAITVM---DMAPSSQPSGATRAVVSTRFGFVAVVVGILGMMMA
        +KWY+CG   HC A  MKL ITV+     APS  PS    +VVS+ FG V  ++  + ++ A
Subjt:  KKWYICGKEGHC-ASNMKLAITVM---DMAPSSQPSGATRAVVSTRFGFVAVVVGILGMMMA

A0A0M4FTF3 Blue copper protein1.1e-2553.51Show/hide
Query:  APSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKWYICGKEGHCASN-MK
        A  A  T Y VG   GW   VDY AWAK +TF V D L+F Y +G HNVFKVN T FQNC  P     L++G D I LA  GKKWYICG   HC+ +  K
Subjt:  APSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKWYICGKEGHCASN-MK

Query:  LAITVMDMAPSSQP
        LAITV + AP+  P
Subjt:  LAITVMDMAPSSQP

G7L0H3 Blue copper protein 1a1.4e-2844.1Show/hide
Query:  ISTSHLLVFLTVAATFAPS-AFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGK
        +++S +++ L+++     S A AT++ VG   GW    DYT WA+D+ F V D L+FNY    HNVFKVN T FQ+CT P    ALSTG D I L   G+
Subjt:  ISTSHLLVFLTVAATFAPS-AFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGK

Query:  KWYICGKEGHC-ASNMKLAITVM---DMAPSSQPSGATRAVVSTRFGFVAVVVGILGMMMA
        KWY+CG   HC A  MKL ITV+     APS  PS    +VVS+ FG V  ++  + ++ A
Subjt:  KWYICGKEGHC-ASNMKLAITVM---DMAPSSQPSGATRAVVSTRFGFVAVVVGILGMMMA

O82081 Uclacyanin 13.1e-1734.15Show/hide
Query:  STSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKW
        S   L++   +A T      AT++T+GG +GW  G     WA  +TF V D L+F+YP   H+V +V   +F +C         + G+  + L   GK++
Subjt:  STSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKW

Query:  YICGKEGHCASNMKLAITVMDMA
        +ICG  GHC+  MKL + V+  A
Subjt:  YICGKEGHCASNMKLAITVMDMA

Q41001 Blue copper protein9.4e-2240.41Show/hide
Query:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG
        MA S + +L FL      A  + AT YTVG  +GW  G DY+ WA D+TF V D L+FNY  G H V +V ++ +++CT     +  STG+  I L K+G
Subjt:  MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSG

Query:  KKWYICGKEGHCASNMKLAITV-----MDMAPSSQPSGATRAVVST
        K ++ICG  GH    MKL+I V        APS+ PS + +   S+
Subjt:  KKWYICGKEGHCASNMKLAITV-----MDMAPSSQPSGATRAVVST

Arabidopsis top hitse value%identityAlignment
AT1G22480.1 Cupredoxin superfamily protein1.0e-1534.62Show/hide
Query:  SHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKWYI
        S LL  L +  +    A + + TV     W  G DYT     +TF V D ++FNY  G H V +V++  +++CT+     + S+G+  I L  +G +++I
Subjt:  SHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKWYI

Query:  CGKEGHCASNMKLAITVMDMAPSSQPSGAT
        CG  GHCA+ MKLA+TV   + +    G T
Subjt:  CGKEGHCASNMKLAITVMDMAPSSQPSGAT

AT2G32300.1 uclacyanin 12.2e-1834.15Show/hide
Query:  STSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKW
        S   L++   +A T      AT++T+GG +GW  G     WA  +TF V D L+F+YP   H+V +V   +F +C         + G+  + L   GK++
Subjt:  STSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKW

Query:  YICGKEGHCASNMKLAITVMDMA
        +ICG  GHC+  MKL + V+  A
Subjt:  YICGKEGHCASNMKLAITVMDMA

AT3G17675.1 Cupredoxin superfamily protein4.1e-2039.8Show/hide
Query:  TNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKWYICGKEGHCASNMKLAITV
        T + VG + GW+   +YT W +   F+V DVL+FNY   +HNV +VN T + +C +       + G+D+I+L++ GK W+ICG + HC +  KL+I V
Subjt:  TNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKWYICGKEGHCASNMKLAITV

AT3G27200.1 Cupredoxin superfamily protein9.1e-2034.48Show/hide
Query:  SAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKWYICGKEGHCASNMKLAI
        +A A  + +GG+ GW+  VD+ +W+ D++F V D ++F Y +    V   ++T +++C + T  N+LS+G+D + L+K+G +++ CG  GHC   MK+ +
Subjt:  SAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKWYICGKEGHCASNMKLAI

Query:  TVM--DMAPSSQPSGA
         V+  D   +S PSG+
Subjt:  TVM--DMAPSSQPSGA

AT5G07475.1 Cupredoxin superfamily protein1.2e-1633.33Show/hide
Query:  ATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKWYICGKEGHCASNMKLAITVM
        AT Y VG ++GWD   D  +W   + F   DVL+F Y    H+V++V    +QNC         + G+  + L+K G ++++CG   HC + M+L + V 
Subjt:  ATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKWYICGKEGHCASNMKLAITVM

Query:  DMAPSSQPSGATRAVVS
           PS  P G+ +A  S
Subjt:  DMAPSSQPSGATRAVVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATCTCTACCTCTCATCTCTTGGTCTTTCTGACCGTTGCGGCTACTTTTGCCCCTTCGGCTTTTGCCACCAATTACACCGTTGGAGGTGCTGCTGGTTGGGACAC
CGGTGTCGACTACACTGCGTGGGCTAAGGATGAAACATTCTACGTCAATGATGTTCTTATTTTCAACTACCCACAAGGTGAGCACAACGTATTCAAAGTTAACGACACTC
AATTTCAGAATTGCACAATTCCAACTGACCAAAATGCACTGAGCACTGGCAGCGACGCCATTGTACTTGCCAAGTCTGGAAAAAAATGGTACATCTGTGGAAAAGAGGGC
CACTGTGCCTCGAATATGAAGCTCGCCATTACAGTCATGGATATGGCTCCCTCCTCGCAGCCATCGGGTGCCACGAGAGCAGTTGTTTCCACCCGATTCGGGTTCGTCGC
GGTGGTCGTTGGCATCCTTGGGATGATGATGGCAGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCATCTCTACCTCTCATCTCTTGGTCTTTCTGACCGTTGCGGCTACTTTTGCCCCTTCGGCTTTTGCCACCAATTACACCGTTGGAGGTGCTGCTGGTTGGGACAC
CGGTGTCGACTACACTGCGTGGGCTAAGGATGAAACATTCTACGTCAATGATGTTCTTATTTTCAACTACCCACAAGGTGAGCACAACGTATTCAAAGTTAACGACACTC
AATTTCAGAATTGCACAATTCCAACTGACCAAAATGCACTGAGCACTGGCAGCGACGCCATTGTACTTGCCAAGTCTGGAAAAAAATGGTACATCTGTGGAAAAGAGGGC
CACTGTGCCTCGAATATGAAGCTCGCCATTACAGTCATGGATATGGCTCCCTCCTCGCAGCCATCGGGTGCCACGAGAGCAGTTGTTTCCACCCGATTCGGGTTCGTCGC
GGTGGTCGTTGGCATCCTTGGGATGATGATGGCAGCTTAGAAATGTGACTGAGGGGA
Protein sequenceShow/hide protein sequence
MAISTSHLLVFLTVAATFAPSAFATNYTVGGAAGWDTGVDYTAWAKDETFYVNDVLIFNYPQGEHNVFKVNDTQFQNCTIPTDQNALSTGSDAIVLAKSGKKWYICGKEG
HCASNMKLAITVMDMAPSSQPSGATRAVVSTRFGFVAVVVGILGMMMAA