; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G01840 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G01840
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTLDc domain-containing protein
Genome locationChr3:1284356..1286535
RNA-Seq ExpressionCSPI03G01840
SyntenyCSPI03G01840
Gene Ontology termsGO:0015743 - malate transport (biological process)
GO:0034220 - ion transmembrane transport (biological process)
GO:0009705 - plant-type vacuole membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006571 - TLDc domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148731.1 uncharacterized protein LOC101203266 [Cucumis sativus]1.3e-136100Show/hide
Query:  MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF
        MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF
Subjt:  MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF

Query:  KFGAFNPEGYRSTDDYYDTFDAFLFYWKDNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYA
        KFGAFNPEGYRSTDDYYDTFDAFLFYWKDNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYA
Subjt:  KFGAFNPEGYRSTDDYYDTFDAFLFYWKDNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYA

Query:  KRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY
        KRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY
Subjt:  KRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY

XP_008448709.1 PREDICTED: uncharacterized protein LOC103490798 [Cucumis melo]2.9e-13397.44Show/hide
Query:  MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF
        MASCIFNSVFYRLKTTPSCSFGKWNWNFG+GNKKQDKPQIKYHDIVLPFP+SLL+KTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF
Subjt:  MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF

Query:  KFGAFNPEGYRSTDDYYDTFDAFLFYWKDNEDD-DPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSY
        KFGAFNPEGYRSTDDYYDTFDAFLFYWKDNED+ DPI+LPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSY
Subjt:  KFGAFNPEGYRSTDDYYDTFDAFLFYWKDNEDD-DPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSY

Query:  AKRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY
        AKRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY
Subjt:  AKRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY

XP_022145275.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111014765 [Momordica charantia]2.4e-11989.32Show/hide
Query:  MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF
        MASCIFNS+FYRLKT P+ SFG WNW+FGNG+KK+DKP IKYH I LPFP SL++KTFLK +ELKCCYKATSDGFSATDFH CCDFKGPCVIIGYTDKSF
Subjt:  MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF

Query:  KFGAFNPEGYRSTDDYYDTFDAFLFYWK-DNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSY
        KFGAFNPEGYRSTDDYYDTFDAFLFYW+ DNE  DPIILPKVGGSGAALFDYARGGPQFGADGLLI PPLAPVMGGFAGPDTNSGVGDLRQA+SRLGLSY
Subjt:  KFGAFNPEGYRSTDDYYDTFDAFLFYWK-DNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSY

Query:  AKRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY
        AKRKDGK+SIFGDE RAVVAEVQVFCSPQIASLY
Subjt:  AKRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY

XP_023005957.1 uncharacterized protein LOC111498814 [Cucurbita maxima]1.5e-11889.61Show/hide
Query:  CIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSFKFG
        C+      RLKTTPSCSFG WNWNFGN N K +KPQIKYHDI LPFPLSL++ TFLKRKELKCCYKATSDGFSATDFH CCDFKGPCVIIGYTDKSFKFG
Subjt:  CIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSFKFG

Query:  AFNPEGYRSTDDYYDTFDAFLFYWKDNED-DDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYAKR
        AFNPEGYRSTDDYYDTFDAFLFYW+ NE  DDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSG+GDLRQA+SRLGLSYAKR
Subjt:  AFNPEGYRSTDDYYDTFDAFLFYWKDNED-DDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYAKR

Query:  KDGKDSIFGDENRAVVAEVQVFCSPQIASLY
        KDGKDSIFGD NRAVVAEVQVFCSPQIASLY
Subjt:  KDGKDSIFGDENRAVVAEVQVFCSPQIASLY

XP_038905422.1 uncharacterized protein LOC120091462 [Benincasa hispida]7.9e-12395.5Show/hide
Query:  RLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSFKFGAFNPEGYR
        RLKTTPS SFG WNWNFGNGNKK++KPQIKYHDIVLPFPLSL++KTFLKRKELKCCYKATSDGFSATDFH CCDFKGPCVIIGYTDKSFKFGAFNPEGYR
Subjt:  RLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSFKFGAFNPEGYR

Query:  STDDYYDTFDAFLFYWKDNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYAKRKDGKDSIFG
        STDDYYDTFDAFLFYW+DNED DPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYAKRKDGK+SIFG
Subjt:  STDDYYDTFDAFLFYWKDNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYAKRKDGKDSIFG

Query:  DENRAVVAEVQVFCSPQIASLY
        DENRAVVAEVQVFCSPQIASLY
Subjt:  DENRAVVAEVQVFCSPQIASLY

TrEMBL top hitse value%identityAlignment
A0A0A0L1B1 TLDc domain-containing protein6.1e-137100Show/hide
Query:  MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF
        MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF
Subjt:  MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF

Query:  KFGAFNPEGYRSTDDYYDTFDAFLFYWKDNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYA
        KFGAFNPEGYRSTDDYYDTFDAFLFYWKDNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYA
Subjt:  KFGAFNPEGYRSTDDYYDTFDAFLFYWKDNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYA

Query:  KRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY
        KRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY
Subjt:  KRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY

A0A1S3BKB8 uncharacterized protein LOC1034907981.4e-13397.44Show/hide
Query:  MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF
        MASCIFNSVFYRLKTTPSCSFGKWNWNFG+GNKKQDKPQIKYHDIVLPFP+SLL+KTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF
Subjt:  MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF

Query:  KFGAFNPEGYRSTDDYYDTFDAFLFYWKDNEDD-DPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSY
        KFGAFNPEGYRSTDDYYDTFDAFLFYWKDNED+ DPI+LPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSY
Subjt:  KFGAFNPEGYRSTDDYYDTFDAFLFYWKDNEDD-DPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSY

Query:  AKRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY
        AKRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY
Subjt:  AKRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY

A0A6J1CU04 LOW QUALITY PROTEIN: uncharacterized protein LOC1110147651.1e-11989.32Show/hide
Query:  MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF
        MASCIFNS+FYRLKT P+ SFG WNW+FGNG+KK+DKP IKYH I LPFP SL++KTFLK +ELKCCYKATSDGFSATDFH CCDFKGPCVIIGYTDKSF
Subjt:  MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSF

Query:  KFGAFNPEGYRSTDDYYDTFDAFLFYWK-DNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSY
        KFGAFNPEGYRSTDDYYDTFDAFLFYW+ DNE  DPIILPKVGGSGAALFDYARGGPQFGADGLLI PPLAPVMGGFAGPDTNSGVGDLRQA+SRLGLSY
Subjt:  KFGAFNPEGYRSTDDYYDTFDAFLFYWK-DNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSY

Query:  AKRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY
        AKRKDGK+SIFGDE RAVVAEVQVFCSPQIASLY
Subjt:  AKRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY

A0A6J1G5M9 uncharacterized protein LOC1114510122.2e-11889.18Show/hide
Query:  CIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSFKFG
        C+      RLKTTPSCSFG WNWNFGN N K +KPQIKYHDI LPFPLSL++ TFLKRKELKCCYKATSDGFSATDFH CCDFKGPCVIIGYTDKSFKFG
Subjt:  CIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSFKFG

Query:  AFNPEGYRSTDDYYDTFDAFLFYWKDNED-DDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYAKR
        AFNPEGYRSTDDYYDTFDAFLFYW+ NE  DDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSG+GDLRQA+SRLGLSYAKR
Subjt:  AFNPEGYRSTDDYYDTFDAFLFYWKDNED-DDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYAKR

Query:  KDGKDSIFGDENRAVVAEVQVFCSPQIASLY
        KDGKDSIFGD NRAVVAEVQVFCSP+IASLY
Subjt:  KDGKDSIFGDENRAVVAEVQVFCSPQIASLY

A0A6J1KUL2 uncharacterized protein LOC1114988147.4e-11989.61Show/hide
Query:  CIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSFKFG
        C+      RLKTTPSCSFG WNWNFGN N K +KPQIKYHDI LPFPLSL++ TFLKRKELKCCYKATSDGFSATDFH CCDFKGPCVIIGYTDKSFKFG
Subjt:  CIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSFKFG

Query:  AFNPEGYRSTDDYYDTFDAFLFYWKDNED-DDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYAKR
        AFNPEGYRSTDDYYDTFDAFLFYW+ NE  DDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSG+GDLRQA+SRLGLSYAKR
Subjt:  AFNPEGYRSTDDYYDTFDAFLFYWKDNED-DDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYAKR

Query:  KDGKDSIFGDENRAVVAEVQVFCSPQIASLY
        KDGKDSIFGD NRAVVAEVQVFCSPQIASLY
Subjt:  KDGKDSIFGDENRAVVAEVQVFCSPQIASLY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32520.1 unknown protein1.0e-9170.42Show/hide
Query:  MASCIFNSVFYRLKTTPSCSFGKWNWNFG-NGNKKQDK----PQIKYH-DIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIG
        MASC  ++ F+ +          +N  FG N  KK D      Q  YH D+ +PF LS++ KTFLK +ELKCCYKA+ DGF AT FH  CDFKGPCVII 
Subjt:  MASCIFNSVFYRLKTTPSCSFGKWNWNFG-NGNKKQDK----PQIKYH-DIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIG

Query:  YT-DKSFKFGAFNPEGYRSTDDYYDTFDAFLFYWKDNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARS
        YT DKSFKFG F+PEGYRSTDDYYDTFDAFLFYW + + DDPI+LPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSG+GDLR A+S
Subjt:  YT-DKSFKFGAFNPEGYRSTDDYYDTFDAFLFYWKDNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARS

Query:  RLGLSYAKRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY
        RLGLSYAKRKDGK+SIFGDEN+  + +V VFCSP IASLY
Subjt:  RLGLSYAKRKDGKDSIFGDENRAVVAEVQVFCSPQIASLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTGCATCTTCAACAGTGTATTTTACAGGTTGAAGACGACCCCCAGCTGTTCATTTGGCAAATGGAATTGGAACTTTGGCAATGGCAACAAGAAACAAGACAA
ACCCCAAATCAAATACCATGATATTGTTCTTCCATTCCCTCTTTCTCTCCTTGAAAAAACATTCTTGAAACGTAAAGAACTCAAATGCTGCTATAAGGCCACATCAGATG
GATTCAGCGCTACCGATTTCCACGCGTGTTGCGACTTCAAGGGACCATGTGTCATAATTGGCTACACAGACAAATCCTTCAAGTTTGGTGCATTCAATCCTGAGGGCTAC
AGAAGCACAGATGACTACTACGACACTTTTGATGCATTCCTCTTCTATTGGAAAGACAATGAGGATGATGATCCCATCATCTTGCCCAAGGTTGGGGGTAGTGGTGCAGC
GCTATTCGATTACGCGAGGGGCGGGCCACAGTTTGGGGCTGATGGATTGCTCATTGGCCCTCCTTTGGCACCTGTAATGGGTGGCTTTGCTGGGCCAGACACGAACTCGG
GCGTCGGGGACCTAAGGCAAGCGAGGTCTCGGCTTGGACTATCTTATGCCAAGAGAAAAGATGGGAAGGACTCCATTTTTGGAGATGAAAATAGAGCTGTTGTTGCTGAA
GTGCAGGTCTTTTGTAGCCCTCAAATAGCAAGCTTGTATTGA
mRNA sequenceShow/hide mRNA sequence
ATTGGATTGAATTGGCGCCGAGAGAAAAAAATGGGTCAGAAAAATTTTAGAGATAAGGTGATGTTTTTTTCTTCAAAATCCCACTTTGGAATTGAAATGAATGTGAATCA
ATGGCTTCTTGCATCTTCAACAGTGTATTTTACAGGTTGAAGACGACCCCCAGCTGTTCATTTGGCAAATGGAATTGGAACTTTGGCAATGGCAACAAGAAACAAGACAA
ACCCCAAATCAAATACCATGATATTGTTCTTCCATTCCCTCTTTCTCTCCTTGAAAAAACATTCTTGAAACGTAAAGAACTCAAATGCTGCTATAAGGCCACATCAGATG
GATTCAGCGCTACCGATTTCCACGCGTGTTGCGACTTCAAGGGACCATGTGTCATAATTGGCTACACAGACAAATCCTTCAAGTTTGGTGCATTCAATCCTGAGGGCTAC
AGAAGCACAGATGACTACTACGACACTTTTGATGCATTCCTCTTCTATTGGAAAGACAATGAGGATGATGATCCCATCATCTTGCCCAAGGTTGGGGGTAGTGGTGCAGC
GCTATTCGATTACGCGAGGGGCGGGCCACAGTTTGGGGCTGATGGATTGCTCATTGGCCCTCCTTTGGCACCTGTAATGGGTGGCTTTGCTGGGCCAGACACGAACTCGG
GCGTCGGGGACCTAAGGCAAGCGAGGTCTCGGCTTGGACTATCTTATGCCAAGAGAAAAGATGGGAAGGACTCCATTTTTGGAGATGAAAATAGAGCTGTTGTTGCTGAA
GTGCAGGTCTTTTGTAGCCCTCAAATAGCAAGCTTGTATTGAAAGTGGAATGAGCAAGCAGCCTCTAAAGGTGTTTTGCATTTGATGGTTATGTATATTCCTATTGCCAT
GGAACTTTGTTCATTGATTCTTTGTTCTTACAGAAATAATAGTGAATCGTTCAAATTATGAGGACAAAGCCCTAGTTCGTAAACATAGTAAGATAATATAATATAACGTA
CTAGTTTTATTTTATTATAAAAATTTGCCCG
Protein sequenceShow/hide protein sequence
MASCIFNSVFYRLKTTPSCSFGKWNWNFGNGNKKQDKPQIKYHDIVLPFPLSLLEKTFLKRKELKCCYKATSDGFSATDFHACCDFKGPCVIIGYTDKSFKFGAFNPEGY
RSTDDYYDTFDAFLFYWKDNEDDDPIILPKVGGSGAALFDYARGGPQFGADGLLIGPPLAPVMGGFAGPDTNSGVGDLRQARSRLGLSYAKRKDGKDSIFGDENRAVVAE
VQVFCSPQIASLY