; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G13080 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G13080
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic
Genome locationClcChr02:24966476..24971449
RNA-Seq ExpressionClc02G13080
SyntenyClc02G13080
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045770.1 uncharacterized protein E6C27_scaffold243G002910 [Cucumis melo var. makuwa]4.0e-16778.27Show/hide
Query:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM
        MK GGGVVCGSPRAAALPSLL RRRGVTVRCSTSSST++HVSFIKD+AATEPPQHLFHLLKMLKTR               +  S+I             
Subjt:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM

Query:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
                         S G KQ    L   L  +S                GT TALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
Subjt:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA

Query:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
        RGSGEQNDELFLAAADAGQKLYGRGDFSES+ITNLDGYLLKKVGLFPDVIERKILRHF+EGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
Subjt:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT

Query:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF
        EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQ+DLKKGKAPAQVALDQAAFLLDLASVDGTWDN VERIAQCYEEAGLHEIA F
Subjt:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF

Query:  VLYRD
        VLYRD
Subjt:  VLYRD

XP_004149691.1 protein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic isoform X1 [Cucumis sativus]3.1e-16777.78Show/hide
Query:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM
        MK GGGVVCGSPRAAALPSLL RRRGVTVRCSTSSSTS+HVSFIKD+AATEPPQHLFHLLKMLKTRG  ++                             
Subjt:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM

Query:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
                         S G KQ    L   L  +S                GT TALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
Subjt:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA

Query:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
        RGSGEQNDELFLAAADAGQKLYGRGDFSES+ITNLDGYLLKKVGLFPD+IERKILRHF+EGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
Subjt:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT

Query:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF
        EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQ+DLKKGKAPAQVALDQAAFLLDLASVDGTWDN VERIAQCYEEAGL EIA F
Subjt:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF

Query:  VLYRD
        VLYRD
Subjt:  VLYRD

XP_008457752.1 PREDICTED: uncharacterized protein LOC103497369 [Cucumis melo]3.7e-16878.02Show/hide
Query:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM
        MK GGGVVCGSPRAAALPSLL RRRGVTVRCSTSSST++HVSFIKD+AATEPPQHLFHLLKMLKTRG  ++                             
Subjt:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM

Query:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
                         S G KQ    L   L  +S                GT TALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
Subjt:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA

Query:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
        RGSGEQNDELFLAAADAGQKLYGRGDFSES+ITNLDGYLLKKVGLFPDVIERKILRHF+EGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
Subjt:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT

Query:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF
        EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQ+DLKKGKAPAQVALDQAAFLLDLASVDGTWDN VERIAQCYEEAGLHEIA F
Subjt:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF

Query:  VLYRD
        VLYRD
Subjt:  VLYRD

XP_023533903.1 uncharacterized protein LOC111795609 [Cucurbita pepo subsp. pepo]3.7e-16074.57Show/hide
Query:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM
        MK GGGVVCGSPRAA LPSLL  RRGVT+RCS+SSSTS+HVSFIKD+AATEPPQHL +LLKMLKTRG  ++                             
Subjt:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM

Query:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
                         S G KQ    L   L  +S                GT TALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
Subjt:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA

Query:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
        +GSGEQNDELFLAAADAGQKLY RGDF+ES+I N+DGYLLKKVG+FPD+IERKILRHF+EGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
Subjt:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT

Query:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF
        EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEY KEKVTEEGK +DLKKGKAPAQVALDQAAFLLDLASVDGTWD SVERIAQCYEEAGL EIARF
Subjt:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF

Query:  VLYRD
        VLYRD
Subjt:  VLYRD

XP_038900520.1 protein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic-like [Benincasa hispida]5.6e-16977.78Show/hide
Query:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM
        MK GGGVVCGSPRAAALPSLL RRRGVTVRCSTSSSTS+HVSF+KDIAATEPPQHLFHLLKMLKTRG  ++                             
Subjt:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM

Query:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
                         S G KQ    L   L  +S                GT TALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
Subjt:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA

Query:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
        RGSG+QNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHF+EGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGR T
Subjt:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT

Query:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF
        EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEK+TEEGKQ+DLKKGKAPAQVALDQAAFLLDLASVDGTWDNSV+RIAQCYEEAGLHEIARF
Subjt:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF

Query:  VLYRD
        +LYRD
Subjt:  VLYRD

TrEMBL top hitse value%identityAlignment
A0A0A0LJE6 Uncharacterized protein1.5e-16777.78Show/hide
Query:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM
        MK GGGVVCGSPRAAALPSLL RRRGVTVRCSTSSSTS+HVSFIKD+AATEPPQHLFHLLKMLKTRG  ++                             
Subjt:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM

Query:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
                         S G KQ    L   L  +S                GT TALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
Subjt:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA

Query:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
        RGSGEQNDELFLAAADAGQKLYGRGDFSES+ITNLDGYLLKKVGLFPD+IERKILRHF+EGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
Subjt:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT

Query:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF
        EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQ+DLKKGKAPAQVALDQAAFLLDLASVDGTWDN VERIAQCYEEAGL EIA F
Subjt:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF

Query:  VLYRD
        VLYRD
Subjt:  VLYRD

A0A1S3C5T9 uncharacterized protein LOC1034973691.8e-16878.02Show/hide
Query:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM
        MK GGGVVCGSPRAAALPSLL RRRGVTVRCSTSSST++HVSFIKD+AATEPPQHLFHLLKMLKTRG  ++                             
Subjt:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM

Query:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
                         S G KQ    L   L  +S                GT TALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
Subjt:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA

Query:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
        RGSGEQNDELFLAAADAGQKLYGRGDFSES+ITNLDGYLLKKVGLFPDVIERKILRHF+EGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
Subjt:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT

Query:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF
        EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQ+DLKKGKAPAQVALDQAAFLLDLASVDGTWDN VERIAQCYEEAGLHEIA F
Subjt:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF

Query:  VLYRD
        VLYRD
Subjt:  VLYRD

A0A5A7TRJ7 Uncharacterized protein2.0e-16778.27Show/hide
Query:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM
        MK GGGVVCGSPRAAALPSLL RRRGVTVRCSTSSST++HVSFIKD+AATEPPQHLFHLLKMLKTR               +  S+I             
Subjt:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM

Query:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
                         S G KQ    L   L  +S                GT TALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
Subjt:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA

Query:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
        RGSGEQNDELFLAAADAGQKLYGRGDFSES+ITNLDGYLLKKVGLFPDVIERKILRHF+EGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
Subjt:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT

Query:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF
        EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQ+DLKKGKAPAQVALDQAAFLLDLASVDGTWDN VERIAQCYEEAGLHEIA F
Subjt:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF

Query:  VLYRD
        VLYRD
Subjt:  VLYRD

A0A5D3BJN5 Uncharacterized protein1.8e-16878.02Show/hide
Query:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM
        MK GGGVVCGSPRAAALPSLL RRRGVTVRCSTSSST++HVSFIKD+AATEPPQHLFHLLKMLKTRG  ++                             
Subjt:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM

Query:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
                         S G KQ    L   L  +S                GT TALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
Subjt:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA

Query:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
        RGSGEQNDELFLAAADAGQKLYGRGDFSES+ITNLDGYLLKKVGLFPDVIERKILRHF+EGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
Subjt:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT

Query:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF
        EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQ+DLKKGKAPAQVALDQAAFLLDLASVDGTWDN VERIAQCYEEAGLHEIA F
Subjt:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF

Query:  VLYRD
        VLYRD
Subjt:  VLYRD

A0A6J1G585 uncharacterized protein LOC111451004 isoform X29.7e-15973.58Show/hide
Query:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM
        MK GGGVVCGSPRAA LPSLL  RRGVT+RCS+SSST +HVSFIKD+AATEPPQHL +LLKMLKTRG  ++                             
Subjt:  MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAM

Query:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
                         S G KQ    L   L  +S                GT TALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA
Subjt:  CIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDA

Query:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
        +GSGEQNDELFLAAADAGQKLY RGD +ES+I N+DGYLLKKVG+FPD+IERKILRHF+EGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT
Subjt:  RGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKT

Query:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF
        EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEY KEKVTEEGK +DLKKGKAPAQVALDQAAFLLDLASVDGTWD SVERIAQCYEEAGL E+ARF
Subjt:  EAKDAARGALKSPWWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARF

Query:  VLYRD
        VL+RD
Subjt:  VLYRD

SwissProt top hitse value%identityAlignment
Q94JY0 Protein IN CHLOROPLAST ATPASE BIOGENESIS, chloroplastic1.5e-10854.34Show/hide
Query:  AAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAMCIHHFSWSQARDY
        + ALP   FR R        S  +S HVSFIKD+AATEPP HL HLLK+L+TRG  ++                                          
Subjt:  AAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAMCIHHFSWSQARDY

Query:  SSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLA
            S G KQ    L   L  +S                G+ TALLRWPTAP GM+MPVV+V R+GV L+A+NVD++IHR+LVEEDA    ++  EL+ A
Subjt:  SSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLA

Query:  AADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDAARGALKSP
        + +AG+KLY +G F+ES I NLD Y+LKKVGLFPD++ERK+LRHFDEGD VSA+VTGEFYTKK+ FPGF RP+V+ A +L KVGR  EAKDAAR AL+SP
Subjt:  AADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDAARGALKSP

Query:  WWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARFVLYRD
        WWTLGC YEEVA+IAQWEDEQIE+ +EKV++EG+ +DL KGKAP QVALD AAFLLDLAS++GTW  S+  IA+CYEEAGLH I+ FVLY D
Subjt:  WWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARFVLYRD

Arabidopsis top hitse value%identityAlignment
AT2G23370.1 unknown protein1.0e-10451.81Show/hide
Query:  LLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAMCIHHFSWSQARDYSSRHST
        LL   R       +SSS SEH  FIKDIA  +PP+HL  LL +   RG                KS++                              S 
Subjt:  LLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAMCIHHFSWSQARDYSSRHST

Query:  GKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAADAGQ
        G KQ    L   L               + +  G++ ALLRWPTAP+ MEMPVV+V ++GVW LA NVDQFIHR+LVEED     E + E+F AA +AG+
Subjt:  GKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAADAGQ

Query:  KLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGC
        KLY +GDF+ SR+ +LD YLL+KVGLFPD +ERK++RH + GD VSALV  EFYTK+ +FPGFARP+ FNA+VLLK+GR  EAKDAARGALKS WWTLGC
Subjt:  KLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGC

Query:  KYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARFVLYRD
        +YEE+A IA+W +EQI  +KE+VT EGKQ D+ +GK  AQ +LD+AAFLL+LAS++GTWD S+ER+AQCY+EAGL++IA+FVLYRD
Subjt:  KYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARFVLYRD

AT4G34090.1 unknown protein1.1e-10954.34Show/hide
Query:  AAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAMCIHHFSWSQARDY
        + ALP   FR R        S  +S HVSFIKD+AATEPP HL HLLK+L+TRG  ++                                          
Subjt:  AAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAMCIHHFSWSQARDY

Query:  SSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLA
            S G KQ    L   L  +S                G+ TALLRWPTAP GM+MPVV+V R+GV L+A+NVD++IHR+LVEEDA    ++  EL+ A
Subjt:  SSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLA

Query:  AADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDAARGALKSP
        + +AG+KLY +G F+ES I NLD Y+LKKVGLFPD++ERK+LRHFDEGD VSA+VTGEFYTKK+ FPGF RP+V+ A +L KVGR  EAKDAAR AL+SP
Subjt:  AADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDAARGALKSP

Query:  WWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARFVLYRD
        WWTLGC YEEVA+IAQWEDEQIE+ +EKV++EG+ +DL KGKAP QVALD AAFLLDLAS++GTW  S+  IA+CYEEAGLH I+ FVLY D
Subjt:  WWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARFVLYRD

AT4G34090.2 unknown protein3.1e-10954.34Show/hide
Query:  AAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAMCIHHFSWSQARDY
        + ALP   FR R        S  +S HVSFIKD+AATEPP HL HLLK+L+TRG  ++                                          
Subjt:  AAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAMCIHHFSWSQARDY

Query:  SSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLA
            S G KQ    L   L  +S                G+ TALLRWPTAP GM+MPVV+V R+GV L+A+NVD++IHR+LVEEDA    ++  EL+ A
Subjt:  SSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLA

Query:  AADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDAARGALKSP
        + +AG+KLY +G F+ES I NLD Y+LKKVGLFPD++ERK+LRHFDEGD VSA+VTGEFYTKK+ FPGF RP+V+ A +L KVGR  EAKDAAR AL+SP
Subjt:  AADAGQKLYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDAARGALKSP

Query:  WWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARFVLYRD
        WWTLGC YEEVA+IAQWEDEQIE+ +EKV++EG+ +DL KGKAP QVALD AAFLLDLAS++GTW  S+  IA+CYEEAGLH I+ FVLY D
Subjt:  WWTLGCKYEEVANIAQWEDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARFVLYRD

AT4G34090.3 unknown protein1.5e-10655.11Show/hide
Query:  HVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAMCIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFR
        HVSFIKD+AATEPP HL HLLK+L+TRG  ++                                              S G KQ    L   L  +S   
Subjt:  HVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAMCIHHFSWSQARDYSSRHSTGKKQLRYMLKFSLQIDSLFR

Query:  CMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYL
                     G+ TALLRWPTAP GM+MPVV+V R+GV L+A+NVD++IHR+LVEEDA    ++  EL+ A+ +AG+KLY +G F+ES I NLD Y+
Subjt:  CMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAADAGQKLYGRGDFSESRITNLDGYL

Query:  LKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLK------VGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDE
        LKKVGLFPD++ERK+LRHFDEGD VSA+VTGEFYTKK+ FPGF RP+V+ A +L K      VGR  EAKDAAR AL+SPWWTLGC YEEVA+IAQWEDE
Subjt:  LKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLK------VGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQWEDE

Query:  QIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARFVLYRD
        QIE+ +EKV++EG+ +DL KGKAP QVALD AAFLLDLAS++GTW  S+  IA+CYEEAGLH I+ FVLY D
Subjt:  QIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARFVLYRD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACGGGTGGTGGAGTGGTCTGTGGAAGTCCACGCGCCGCCGCTCTCCCCTCACTGCTTTTCCGCCGCCGTGGAGTCACCGTACGCTGCTCTACGTCTTCCTCAAC
TTCCGAGCATGTATCATTCATCAAAGATATTGCTGCTACTGAGCCTCCTCAGCATCTGTTTCATTTGCTGAAAATGCTGAAGACTAGAGGTACTCTAGTTCTTTGTCTTT
TGAAGTTGTTGACTGCAAAATTTATGAGCAAGAGTGTTATTGTACGCTTGATATGTAGTTCTGTATCCGTTTCTGCAATGTGCATCCATCATTTCTCCTGGAGCCAAGCA
AGGGATTATTCCTCTCGTCATTCCACTGGCAAAAAACAGCTCAGGTACATGTTAAAATTTTCTCTTCAAATTGACTCGCTTTTTAGGTGTATGATATCTCATCAAACTAT
TCTATTCTTGTTCCAAGGTACTACAACTGCACTGTTGCGCTGGCCTACAGCACCCGCTGGGATGGAGATGCCAGTAGTGGACGTCAATAGGAATGGAGTGTGGCTTCTAG
CCAAGAACGTTGATCAATTTATTCACAGACTTCTAGTTGAAGAAGATGCCAGAGGAAGTGGAGAGCAAAATGATGAGCTATTTCTTGCAGCAGCTGATGCTGGGCAGAAA
CTTTATGGAAGGGGTGATTTTTCTGAATCTCGGATCACAAACTTAGATGGATATCTGCTGAAGAAGGTTGGGTTGTTTCCAGATGTCATAGAACGTAAAATATTGCGCCA
TTTTGACGAAGGCGACCTTGTTTCAGCTCTGGTAACCGGAGAATTCTATACTAAAAAGGAGCACTTCCCAGGATTTGCACGACCATATGTATTCAATGCAGAGGTTCTGC
TGAAGGTGGGACGAAAAACTGAAGCTAAGGATGCTGCCAGGGGAGCCTTAAAATCACCATGGTGGACCCTTGGCTGTAAGTATGAGGAAGTTGCTAATATCGCACAATGG
GAAGATGAACAAATTGAATATTTCAAAGAAAAGGTCACAGAAGAAGGAAAGCAAGACGATCTCAAGAAGGGAAAGGCTCCTGCGCAGGTTGCCTTGGATCAAGCTGCCTT
TCTGTTGGATCTAGCTTCCGTTGATGGGACTTGGGACAACTCTGTGGAGCGTATTGCTCAATGTTATGAAGAGGCAGGCCTCCATGAGATTGCAAGATTCGTACTTTATA
GAGACTGA
mRNA sequenceShow/hide mRNA sequence
CTTCATATTTTGCTCTAAGAAAAAGGAAGAGATAAAAGCTGTTGTCGACGAAGCACACGTAAACTGACGGCTGCATTTTAGCAATCGTAAGCGAATCGTTAGCAAAAGAA
AATGAAAACGGGTGGTGGAGTGGTCTGTGGAAGTCCACGCGCCGCCGCTCTCCCCTCACTGCTTTTCCGCCGCCGTGGAGTCACCGTACGCTGCTCTACGTCTTCCTCAA
CTTCCGAGCATGTATCATTCATCAAAGATATTGCTGCTACTGAGCCTCCTCAGCATCTGTTTCATTTGCTGAAAATGCTGAAGACTAGAGGTACTCTAGTTCTTTGTCTT
TTGAAGTTGTTGACTGCAAAATTTATGAGCAAGAGTGTTATTGTACGCTTGATATGTAGTTCTGTATCCGTTTCTGCAATGTGCATCCATCATTTCTCCTGGAGCCAAGC
AAGGGATTATTCCTCTCGTCATTCCACTGGCAAAAAACAGCTCAGGTACATGTTAAAATTTTCTCTTCAAATTGACTCGCTTTTTAGGTGTATGATATCTCATCAAACTA
TTCTATTCTTGTTCCAAGGTACTACAACTGCACTGTTGCGCTGGCCTACAGCACCCGCTGGGATGGAGATGCCAGTAGTGGACGTCAATAGGAATGGAGTGTGGCTTCTA
GCCAAGAACGTTGATCAATTTATTCACAGACTTCTAGTTGAAGAAGATGCCAGAGGAAGTGGAGAGCAAAATGATGAGCTATTTCTTGCAGCAGCTGATGCTGGGCAGAA
ACTTTATGGAAGGGGTGATTTTTCTGAATCTCGGATCACAAACTTAGATGGATATCTGCTGAAGAAGGTTGGGTTGTTTCCAGATGTCATAGAACGTAAAATATTGCGCC
ATTTTGACGAAGGCGACCTTGTTTCAGCTCTGGTAACCGGAGAATTCTATACTAAAAAGGAGCACTTCCCAGGATTTGCACGACCATATGTATTCAATGCAGAGGTTCTG
CTGAAGGTGGGACGAAAAACTGAAGCTAAGGATGCTGCCAGGGGAGCCTTAAAATCACCATGGTGGACCCTTGGCTGTAAGTATGAGGAAGTTGCTAATATCGCACAATG
GGAAGATGAACAAATTGAATATTTCAAAGAAAAGGTCACAGAAGAAGGAAAGCAAGACGATCTCAAGAAGGGAAAGGCTCCTGCGCAGGTTGCCTTGGATCAAGCTGCCT
TTCTGTTGGATCTAGCTTCCGTTGATGGGACTTGGGACAACTCTGTGGAGCGTATTGCTCAATGTTATGAAGAGGCAGGCCTCCATGAGATTGCAAGATTCGTACTTTAT
AGAGACTGAATACACATATAAAAAGGCATTTTTCTACTCTCTCCTCTTCGTTTCCATTGTCCTGTTGTTCTCATTCCTTTTTTTACATGTAATTATATATATATATATAT
ATTTCTTCGTTATCTAAACATCATCCTTTTCTCTCCCCAATGGATGAGATTTTGATCATTTCACTACAAAAATGGGTCAATCACGAGACATAAAGTACTCGAAAGTTTCT
CTTGATAAATTGTAATGGTTCTTTCTATGCAA
Protein sequenceShow/hide protein sequence
MKTGGGVVCGSPRAAALPSLLFRRRGVTVRCSTSSSTSEHVSFIKDIAATEPPQHLFHLLKMLKTRGTLVLCLLKLLTAKFMSKSVIVRLICSSVSVSAMCIHHFSWSQA
RDYSSRHSTGKKQLRYMLKFSLQIDSLFRCMISHQTILFLFQGTTTALLRWPTAPAGMEMPVVDVNRNGVWLLAKNVDQFIHRLLVEEDARGSGEQNDELFLAAADAGQK
LYGRGDFSESRITNLDGYLLKKVGLFPDVIERKILRHFDEGDLVSALVTGEFYTKKEHFPGFARPYVFNAEVLLKVGRKTEAKDAARGALKSPWWTLGCKYEEVANIAQW
EDEQIEYFKEKVTEEGKQDDLKKGKAPAQVALDQAAFLLDLASVDGTWDNSVERIAQCYEEAGLHEIARFVLYRD