; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0022313 (gene) of Chayote v1 genome

Gene IDSed0022313
OrganismSechium edule (Chayote v1)
Descriptionprotein CURVATURE THYLAKOID 1C, chloroplastic
Genome locationLG07:11249388..11252847
RNA-Seq ExpressionSed0022313
SyntenySed0022313
Gene Ontology termsGO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025564 - Cyanobacterial aminoacyl-tRNA synthetase, CAAD domain
IPR033344 - Protein CURVATURE THYLAKOID 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593469.1 Malonate--CoA ligase, partial [Cucurbita argyrosperma subsp. sororia]8.3e-5480.95Show/hide
Query:  MASIVATLPPPLLSPAKSFTLLNT-SHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDK
        MASI ATLPPPLL+P KSFT L T   KL+VFP IAKG SAN +VKAIGDSS SSTSI+IIKSV+NVWDQPEDR+ L GLG AAV G WTATNL+TA+DK
Subjt:  MASIVATLPPPLLSPAKSFTLLNT-SHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDK

Query:  LPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
        LPLLPGVLE IGILVS WFVYRYLLFKPNR ELL+IINKSVADVLGQ
Subjt:  LPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

XP_022144822.1 protein CURVATURE THYLAKOID 1C, chloroplastic [Momordica charantia]7.0e-5378.77Show/hide
Query:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL
        MASIVA+LPPPLL+P KSFTLLNTS KL+VFP IAK  SA V+VKA GDSS SS S++IIKSVQN+WDQPEDR+ L GLG AAVA +WTATN +TAIDKL
Subjt:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL

Query:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
        PLLPG+LE IG LVS WFVYRYLLFKPNR ELL+IINKS+ADVLGQ
Subjt:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

XP_022964632.1 protein CURVATURE THYLAKOID 1C, chloroplastic [Cucurbita moschata]1.1e-5380.95Show/hide
Query:  MASIVATLPPPLLSPAKSFTLLNT-SHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDK
        MASI ATLPPPLL+P KSFT L T   KL+VFP IAKG SAN +VKAIGDSS SSTSI+IIKSV+NVWDQPEDR+ L GLG AAV G WTATNL+TA+DK
Subjt:  MASIVATLPPPLLSPAKSFTLLNT-SHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDK

Query:  LPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
        LPLLPGVLE IGILVS WFVYRYLLFKPNR ELL+IINKSVADVLGQ
Subjt:  LPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

XP_023000405.1 protein CURVATURE THYLAKOID 1C, chloroplastic [Cucurbita maxima]3.7e-5480.14Show/hide
Query:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL
        MASI ATLPPPLL+P KSF  L T  K++VFP IAKG SAN +VKAIGDSS SSTSI+IIKSV+NVWDQPEDR+ L GLG AAV G WTATNL+TA+DKL
Subjt:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL

Query:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
        PLLPGVLE IGILVS WFVYRYLLFKPNR ELL+IINKSVADVLGQ
Subjt:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

XP_038897597.1 protein CURVATURE THYLAKOID 1C, chloroplastic [Benincasa hispida]6.3e-5479.45Show/hide
Query:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL
        MASIVATLPPPLL+P KSFTLLNTS KL+ FP IA   S NV+VKA+G SS SSTS++IIKSV+NVWDQPEDR+AL GLG AAVA VWTATNL+TAIDKL
Subjt:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL

Query:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
        PLLPGVLE IG LVS WFVYRYLLFKPNR ELL+IINKS+ DV GQ
Subjt:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

TrEMBL top hitse value%identityAlignment
A0A0A0KC66 CAAD domain-containing protein1.3e-5276.71Show/hide
Query:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL
        MASIVATLPPPLL+P KSFT+LN S KLSVF   A G S NV+VKA+G SS SSTS++I+KSV+NVWDQPEDR+AL GLG AAVA  WTATN++TAIDKL
Subjt:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL

Query:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
        PLLPGVLE IG LVS WFVYRYLLFKPNR ELL+IINKS+ DV GQ
Subjt:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

A0A5D3BWY3 Protein CURVATURE THYLAKOID 1C3.7e-5277.4Show/hide
Query:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL
        MASIVATLPPPLL+P KSFT+LN S KLSV    AKG   NV+VKA+G SS SSTS++IIKSV+NVWDQPEDR AL GLG AAVA  WTATNL+TAIDKL
Subjt:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL

Query:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
        PLLPGVLE IG LVS WFVYRYLLFKPNR ELL+IINKS+ DV GQ
Subjt:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

A0A6J1CUT1 protein CURVATURE THYLAKOID 1C, chloroplastic3.4e-5378.77Show/hide
Query:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL
        MASIVA+LPPPLL+P KSFTLLNTS KL+VFP IAK  SA V+VKA GDSS SS S++IIKSVQN+WDQPEDR+ L GLG AAVA +WTATN +TAIDKL
Subjt:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL

Query:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
        PLLPG+LE IG LVS WFVYRYLLFKPNR ELL+IINKS+ADVLGQ
Subjt:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

A0A6J1HLG0 protein CURVATURE THYLAKOID 1C, chloroplastic5.2e-5480.95Show/hide
Query:  MASIVATLPPPLLSPAKSFTLLNT-SHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDK
        MASI ATLPPPLL+P KSFT L T   KL+VFP IAKG SAN +VKAIGDSS SSTSI+IIKSV+NVWDQPEDR+ L GLG AAV G WTATNL+TA+DK
Subjt:  MASIVATLPPPLLSPAKSFTLLNT-SHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDK

Query:  LPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
        LPLLPGVLE IGILVS WFVYRYLLFKPNR ELL+IINKSVADVLGQ
Subjt:  LPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

A0A6J1KFS9 protein CURVATURE THYLAKOID 1C, chloroplastic1.8e-5480.14Show/hide
Query:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL
        MASI ATLPPPLL+P KSF  L T  K++VFP IAKG SAN +VKAIGDSS SSTSI+IIKSV+NVWDQPEDR+ L GLG AAV G WTATNL+TA+DKL
Subjt:  MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKL

Query:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
        PLLPGVLE IGILVS WFVYRYLLFKPNR ELL+IINKSVADVLGQ
Subjt:  PLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

SwissProt top hitse value%identityAlignment
O04616 Protein CURVATURE THYLAKOID 1A, chloroplastic2.8e-1235.64Show/hide
Query:  KAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVL
        +A  + + S  +  +I  ++  WD  E++  ++  G  A+  VW ++ ++ AI+ +PLLP V+EL+G+  + WFVYRYLLFK +R EL E I      + 
Subjt:  KAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVL

Query:  G
        G
Subjt:  G

Q119Z5 Glutamate--tRNA ligase1.3e-0635.29Show/hide
Query:  VQNVWDQPEDRIALVG-LGLAAVAGVWTATNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
        +  ++ Q + ++ L G L L  +   + A  +I A+D +P+L  + ELIG++   WFVYRYLL + NR ELL+ I     ++ G+
Subjt:  VQNVWDQPEDRIALVG-LGLAAVAGVWTATNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

Q8LCA1 Protein CURVATURE THYLAKOID 1B, chloroplastic3.8e-1737.72Show/hide
Query:  NVLVKA---IGDSSGSSTSI------NIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGE
        NV+ +A   +G++  ++T         I+K+ Q  W++ +D+ A+  L  A V  +W +  +I+AID+LPL+PGVLEL+GI  + WF Y+ L+FKP+R  
Subjt:  NVLVKA---IGDSSGSSTSI------NIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGE

Query:  LLEIINKSVADVLG
        L E +  +  D+LG
Subjt:  LLEIINKSVADVLG

Q8LDD3 Protein CURVATURE THYLAKOID 1D, chloroplastic5.1e-0633.33Show/hide
Query:  GLAAVAGVWTATNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLG
        G  A+  ++  + ++++++ +PL P ++E++G+  + WF  RYLLFK NR EL   +++    VLG
Subjt:  GLAAVAGVWTATNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLG

Q9M812 Protein CURVATURE THYLAKOID 1C, chloroplastic1.4e-3549.36Show/hide
Query:  MASIVATLPPPLLSPAKS----------FTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTA
        MASI ATLP PLL   +           F+L   ++ LS   +     S +++VKA G+SS SST ++++ ++QNVWD+ EDR+ L+GLG A +  +W +
Subjt:  MASIVATLPPPLLSPAKS----------FTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTA

Query:  TNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
         NLITAIDKLP++    EL+GIL S+WF YRYLLFKP+R EL +I+ KSVAD+LGQ
Subjt:  TNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

Arabidopsis top hitse value%identityAlignment
AT1G52220.1 FUNCTIONS IN: molecular_function unknown9.8e-3749.36Show/hide
Query:  MASIVATLPPPLLSPAKS----------FTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTA
        MASI ATLP PLL   +           F+L   ++ LS   +     S +++VKA G+SS SST ++++ ++QNVWD+ EDR+ L+GLG A +  +W +
Subjt:  MASIVATLPPPLLSPAKS----------FTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTA

Query:  TNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
         NLITAIDKLP++    EL+GIL S+WF YRYLLFKP+R EL +I+ KSVAD+LGQ
Subjt:  TNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

AT1G52220.2 FUNCTIONS IN: molecular_function unknown7.0e-3548.72Show/hide
Query:  MASIVATLPPPLLSPAKS----------FTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTA
        MASI ATLP PLL   +           F+L   ++ LS   +     S +++VKA G+SS SST ++++ ++QN WD+ EDR+ L+GLG A +  +W +
Subjt:  MASIVATLPPPLLSPAKS----------FTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTA

Query:  TNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
         NLITAIDKLP++    EL+GIL S+WF YRYLLFKP+R EL +I+ KSVAD+LGQ
Subjt:  TNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

AT1G52220.3 FUNCTIONS IN: molecular_function unknown4.4e-2139.74Show/hide
Query:  MASIVATLPPPLLSPAKS----------FTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTA
        MASI ATLP PLL   +           F+L   ++ LS   +     S +++VKA G+SS SST ++++ ++QNV                        
Subjt:  MASIVATLPPPLLSPAKS----------FTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTA

Query:  TNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ
             AIDKLP++    EL+GIL S+WF YRYLLFKP+R EL +I+ KSVAD+LGQ
Subjt:  TNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ

AT2G46820.1 photosystem I P subunit2.7e-1837.72Show/hide
Query:  NVLVKA---IGDSSGSSTSI------NIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGE
        NV+ +A   +G++  ++T         I+K+ Q  W++ +D+ A+  L  A V  +W +  +I+AID+LPL+PGVLEL+GI  + WF Y+ L+FKP+R  
Subjt:  NVLVKA---IGDSSGSSTSI------NIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGE

Query:  LLEIINKSVADVLG
        L E +  +  D+LG
Subjt:  LLEIINKSVADVLG

AT2G46820.2 photosystem I P subunit2.7e-1837.72Show/hide
Query:  NVLVKA---IGDSSGSSTSI------NIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGE
        NV+ +A   +G++  ++T         I+K+ Q  W++ +D+ A+  L  A V  +W +  +I+AID+LPL+PGVLEL+GI  + WF Y+ L+FKP+R  
Subjt:  NVLVKA---IGDSSGSSTSI------NIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKLPLLPGVLELIGILVSSWFVYRYLLFKPNRGE

Query:  LLEIINKSVADVLG
        L E +  +  D+LG
Subjt:  LLEIINKSVADVLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATTGTTGCTACTCTTCCTCCGCCATTGTTGAGCCCCGCCAAAAGCTTCACCCTTCTCAATACTTCTCACAAGCTCTCTGTTTTTCCCATTATTGCAAAGGG
GCCTTCTGCCAATGTTCTTGTAAAGGCTATTGGGGACAGCTCTGGGTCTTCTACTTCCATCAATATTATTAAGTCTGTTCAAAATGTTTGGGATCAACCTGAAGATCGAA
TTGCACTTGTTGGTCTGGGATTAGCAGCTGTAGCTGGCGTATGGACAGCAACAAATCTTATTACGGCTATTGACAAGCTACCACTGCTTCCAGGTGTTTTAGAATTAATA
GGAATACTGGTTTCTTCGTGGTTCGTGTATCGTTACCTCTTGTTCAAACCAAACCGGGGAGAGCTTTTGGAGATAATCAACAAGTCAGTAGCTGATGTATTGGGACAGTA
A
mRNA sequenceShow/hide mRNA sequence
GTTCGTACTTGAAAAAAAAGTACAGGTTCGTCAATCCGAAACGATGAATTCCTATTGATTTGATAATTTCGGCTTAATTTTTGATTTGGGATTATAAATGTGCACTCAAA
GTTCATTCATTTCCCATCCATTTCACTAATCATGGCTTCCATTGTTGCTACTCTTCCTCCGCCATTGTTGAGCCCCGCCAAAAGCTTCACCCTTCTCAATACTTCTCACA
AGCTCTCTGTTTTTCCCATTATTGCAAAGGGGCCTTCTGCCAATGTTCTTGTAAAGGCTATTGGGGACAGCTCTGGGTCTTCTACTTCCATCAATATTATTAAGTCTGTT
CAAAATGTTTGGGATCAACCTGAAGATCGAATTGCACTTGTTGGTCTGGGATTAGCAGCTGTAGCTGGCGTATGGACAGCAACAAATCTTATTACGGCTATTGACAAGCT
ACCACTGCTTCCAGGTGTTTTAGAATTAATAGGAATACTGGTTTCTTCGTGGTTCGTGTATCGTTACCTCTTGTTCAAACCAAACCGGGGAGAGCTTTTGGAGATAATCA
ACAAGTCAGTAGCTGATGTATTGGGACAGTAAGTTGTTCTTGTTGTTAAAGGCCCTATTTGGTCCAAACTGGCTTCTCTATCATTCACTGCTGGTTGGTTTCTGAAATTT
ATTCAACAGTCCAGTGTGAATTGTAACCAAGATTGGTGTTTGATTCATCAATGCATTCCAATGTGAAATCTTCAATCCTTTAGTTTATGCATATATCGGTTTATGTTCAA
AATTTTCTGTTATATTGGCTGTCTGAATCAGCTTACACAAACTTCAACTATTCTCTCAAAACAACCTGTCTGAACGTGCAACATTTGGATGTCAAAGAATTATGTAAGAC
ATTAATTCTTACATATGTCATGACCATGAATTGATCTCATAACCTTTTGATTTCTCAA
Protein sequenceShow/hide protein sequence
MASIVATLPPPLLSPAKSFTLLNTSHKLSVFPIIAKGPSANVLVKAIGDSSGSSTSINIIKSVQNVWDQPEDRIALVGLGLAAVAGVWTATNLITAIDKLPLLPGVLELI
GILVSSWFVYRYLLFKPNRGELLEIINKSVADVLGQ