; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G43330 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G43330
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionprotein SPA, chloroplastic
Genome locationChr3:37252677..37256061
RNA-Seq ExpressionCSPI03G43330
SyntenyCSPI03G43330
Gene Ontology termsGO:0010206 - photosystem II repair (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003756 - protein disulfide isomerase activity (molecular function)
InterPro domainsIPR035272 - Protein of unknown function DUF5351
IPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136388.1 protein disulfide-isomerase LQY1, chloroplastic [Cucumis sativus]5.2e-86100Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
        MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR

Query:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

XP_008466492.1 PREDICTED: protein SPA, chloroplastic [Cucumis melo]1.7e-8498.09Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
        MTVAPSLSRLHSPFLYCPLKPT STSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVA+GIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR

Query:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGD+KEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

XP_022142253.1 protein SPA, chloroplastic [Momordica charantia]6.1e-7992.36Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
        MT APSLSRLHSPFLY PLKPT + SLS+TFS NQRSP SYPRIRAIDLDQNTVVALSVGLVSVA+GIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQ+
Subjt:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR

Query:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGD+KEVSRCINC+GVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

XP_022936404.1 protein SPA, chloroplastic [Cucurbita moschata]1.5e-8093.63Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
        MTVAPSLSRLHSPFLYCPLKPT S SLS+TFS NQRSP SYPRIRAIDLDQNTVVALSVGL S+A+GIGIPVFYETQIDNAAKRENTQPCFPC+GSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR

Query:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGD+KEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

XP_038899789.1 protein SPA, chloroplastic [Benincasa hispida]1.7e-8194.9Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
        MTVAPS+SRLHSPFLYCPLKPT STSLSVTFS NQRSP +YPRIRAIDLDQNTVVALSVGLVSVA+GIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR

Query:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGD+KEVSRCINCDG GTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

TrEMBL top hitse value%identityAlignment
A0A0A0LJ55 Uncharacterized protein2.5e-86100Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
        MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR

Query:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

A0A1S3CRE4 protein SPA, chloroplastic8.1e-8598.09Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
        MTVAPSLSRLHSPFLYCPLKPT STSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVA+GIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR

Query:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGD+KEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

A0A6J1CMU6 protein SPA, chloroplastic3.0e-7992.36Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
        MT APSLSRLHSPFLY PLKPT + SLS+TFS NQRSP SYPRIRAIDLDQNTVVALSVGLVSVA+GIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQ+
Subjt:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR

Query:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGD+KEVSRCINC+GVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

A0A6J1F877 protein SPA, chloroplastic7.1e-8193.63Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
        MTVAPSLSRLHSPFLYCPLKPT S SLS+TFS NQRSP SYPRIRAIDLDQNTVVALSVGL S+A+GIGIPVFYETQIDNAAKRENTQPCFPC+GSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR

Query:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGD+KEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

A0A6J1IFD3 protein SPA, chloroplastic7.1e-8193.63Show/hide
Query:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR
        MTVAPSLSRLHSPFLYCPLKPT S SLS+TFS NQRSP SYPRIRAIDLDQNTVVALSVGL S+A+GIGIPVFYETQIDNAAKRENTQPCFPC+GSGAQR
Subjt:  MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQR

Query:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        CRFCMGTGNVTVELGGD+KEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  CRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

SwissProt top hitse value%identityAlignment
K4BVL1 Protein SPA, chloroplastic1.3e-6374.84Show/hide
Query:  MTVAPSLSRLHSPFLYCPLK-PTPSTS-LSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGA
        M  APSLSR  SPF+  PLK PT S+S  +  F    R   SYP I+A+DLDQNTV+A++VG++SVAIG+GIPVFYETQIDNAAKRENTQPCFPC+G+GA
Subjt:  MTVAPSLSRLHSPFLYCPLK-PTPSTS-LSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGA

Query:  QRCRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        Q+CRFCMGTG+VTVELGG + EVSRCINCDG G LTCTTCQGSGIQPRYLDRREFKDDD
Subjt:  QRCRFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

Q8GSJ6 Protein disulfide-isomerase LQY1, chloroplastic8.4e-6375.64Show/hide
Query:  APSLSRLHSPFLYCPLKPTPSTSLSVTFSG-NQRSP-PSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQRC
        APS  RLHSPF++CP+  TPS     +FS  N RSP  SYPRI+A +LD NTVVA+SVG+ SVA+GIGIPVFYETQIDNAAKRENTQPCFPC+G+GAQ+C
Subjt:  APSLSRLHSPFLYCPLKPTPSTSLSVTFSG-NQRSP-PSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQRC

Query:  RFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        R C+G+GNVTVELGG +KEVS CINCDG G+LTCTTCQGSG+QPRYLDRREFKDDD
Subjt:  RFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

Arabidopsis top hitse value%identityAlignment
AT1G75690.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein6.0e-6475.64Show/hide
Query:  APSLSRLHSPFLYCPLKPTPSTSLSVTFSG-NQRSP-PSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQRC
        APS  RLHSPF++CP+  TPS     +FS  N RSP  SYPRI+A +LD NTVVA+SVG+ SVA+GIGIPVFYETQIDNAAKRENTQPCFPC+G+GAQ+C
Subjt:  APSLSRLHSPFLYCPLKPTPSTSLSVTFSG-NQRSP-PSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQRC

Query:  RFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD
        R C+G+GNVTVELGG +KEVS CINCDG G+LTCTTCQGSG+QPRYLDRREFKDDD
Subjt:  RFCMGTGNVTVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD

AT4G13670.1 plastid transcriptionally active 59.0e-0430.48Show/hide
Query:  RAIDLDQNTVVALSVGL---VSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQRCRFCMGTGNVTVE------LGGDDKEVSRCINCDGVGTLTC
        R +D+ QN V  L        S  IG   PV      D +        C  C G G   C  C GTG   +E      +G D K    C  C+G+G   C
Subjt:  RAIDLDQNTVVALSVGL---VSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQRCRFCMGTGNVTVE------LGGDDKEVSRCINCDGVGTLTC

Query:  TTCQG
          C G
Subjt:  TTCQG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGTAGCACCTTCGCTTTCCCGCCTCCATTCTCCATTTCTGTATTGTCCTCTCAAGCCAACTCCATCTACCTCTTTATCAGTCACATTCTCTGGAAATCAACGATC
GCCACCATCATATCCACGCATCAGAGCAATAGATCTTGACCAAAACACGGTGGTGGCACTTTCAGTTGGGCTGGTGAGTGTTGCAATTGGAATAGGTATTCCCGTCTTCT
ACGAAACCCAAATTGATAATGCTGCAAAGCGTGAAAATACTCAGCCCTGCTTTCCCTGCAGTGGTTCAGGAGCACAGAGATGCAGATTTTGCATGGGAACTGGCAATGTG
ACCGTAGAACTTGGTGGAGATGACAAAGAAGTCTCCCGGTGTATTAACTGTGATGGTGTTGGCACATTGACATGTACTACATGTCAGGGCTCTGGAATTCAACCTCGATA
CCTAGATCGCAGAGAATTTAAAGATGATGATTGA
mRNA sequenceShow/hide mRNA sequence
TGGGTCTGTAGTAGAGCAGAGGCAGCCAGTAACGGATATCCAAAATCCTGGAATGCGTGGTGAAGCTTTGTAGCTTTGAAACAAACCACAGCCTCAAGTGTTTTCTTTAG
GAATTAGGACTGAGACTGACTTTCTTTTTCTTTCTGATTCGGAGCTCTCAGAAGCAAAGGAATTGAGTCCAAGGAATTCAGAAATGACAGTAGCACCTTCGCTTTCCCGC
CTCCATTCTCCATTTCTGTATTGTCCTCTCAAGCCAACTCCATCTACCTCTTTATCAGTCACATTCTCTGGAAATCAACGATCGCCACCATCATATCCACGCATCAGAGC
AATAGATCTTGACCAAAACACGGTGGTGGCACTTTCAGTTGGGCTGGTGAGTGTTGCAATTGGAATAGGTATTCCCGTCTTCTACGAAACCCAAATTGATAATGCTGCAA
AGCGTGAAAATACTCAGCCCTGCTTTCCCTGCAGTGGTTCAGGAGCACAGAGATGCAGATTTTGCATGGGAACTGGCAATGTGACCGTAGAACTTGGTGGAGATGACAAA
GAAGTCTCCCGGTGTATTAACTGTGATGGTGTTGGCACATTGACATGTACTACATGTCAGGGCTCTGGAATTCAACCTCGATACCTAGATCGCAGAGAATTTAAAGATGA
TGATTGAACCCCTGCAGAACCAAACCCGGCAACATCACCAATTCCGATTTTGTGTTTAATTTGATGGCTTGAACCTTGTTAGTTCCATCGAATTGAAGAGCTGGGTAACT
TCTACCTAGCACTTCTTTTCGGTAACATGCATTCTTCTAATGTATATCTGTGGTTAATAATACAAACATTGTAAAAGGAGAGAAGAATTAACTGTACTATCCCAAGATTA
TGAATTCATCTCTATTCACTGTCAGTGAAATCAGAAATGGGGGATCAAACGCAATCTGATAATCACAAGCACTTCTTTTTGGTACTTTTTCCGTGCATTCTTCTAAAGTA
TAAGAATATCTGTGGCTAATAGAGGAGAGAAGAATTTAGGTGTACTAAGCCAAGATTATGAATTCATCTGTATTCACTGTCAGTAAAATGGGGGGATCAAACTCTATCTG
ATATCACTACCTAGTAATAGTAGCCGGTGCAGGTGGATCTCATCTTTTAGGCACAAGCAGAGAACAGCTGAATTACCAGATCATACATGGTTAAAAATTTTGCAACCATA
TCAGGAATTTGAAACTGGAAGGGTCACTATGAAAAATCCATATAATTAATCGTGAAGTCCGTGAATTCAGATGCTAGCCAAACATGCAGAGGCAGCTAAATCACAGGCAA
GGAGGAGATCATTCCCGTAATGAGCCTATCAAATAATCGGCAACATTTGTTATAAGACATTTCAAGTAGACGAATTTCAAGGCACTGCAATGGGCATCTAAAAAAACAAT
GAATTTGGAAGCATATTTCCCGGGTAACCATGCCGGTGACAGAACCATGAAGAATCCTAATGCGCTGCTACCAAAACAATGCATGCTTGATTCTGAGCAAAAACCAGTTC
ATTGGAGTTAGCAGAGGCAAATAAGCGTATATTTACACTTTGAATATCGCAGCACCAGCACAAAAGCCATCCAGGTTAACATGAGGTAGCTTTACATTTACGACAGTTTT
CTTCTAACCACCTCAAACGTATCTTGTTCCGTTTAAGTAAAATAAAAGTCTCCCTAAAATTTGGGGTATCAAAGAGATCCAAAGCAACAAAATACAGCCGGTTATCATTG
CTTTCAATCTCACCCAGGGCACCTATGCAATTAGTTATACTGAGTCTAGGTCTTCGATCAGTAGGCATTCTTCAGCTGTTCTTGAAAGATTCTACTTCTAACAAGTCTAA
CAATGCTTATGCCATTATCAGATGGGTTTCTGTCATGCTTTGACCTATGGAAGTATGTGCACCTAACTGTCGCTTTTTCTTGCTCTACCACGGCCTTCGTCATTTTTCCA
GAAAATAACTAATTTCCATGCTTAGTTTTGGTTAGGGTTTTCAGGAGACATGGTTCCACGTCCACCCATCAGGACATGAATTTATGACTAATCTATTTGTTCCAGCAGAT
CTATCCAGCTATTCTTAGTGGCTGGATTGTGCATACTTCTTACCAGCAGCTGATTCGGTGAACATTGTGCACAACTGCTCATATATGATGGGACAACAATCTTCGAATTT
GAGTATCTCAAACCGATTCTGGACCTATGAAAAGAATCTACCAGAAAATTAAGGTAGACCTGTAAGGCAAAACTAGGTAAGCTTACGAAGGAAAGCAATGTAGCCCAAGA
TCCTCCCCCTAATCAAACCCAGTGCTACAAGAATCATCAATTGTGAGGTCAATTGTTATTGGAAGTTGATATCAGTTCGCTTATTGAGTTCATCACGAATGTGAATTCAT
GTTTTCTTGTCAAGCACGTTGTTTTGTCTCTCTGTTGAATGTGCGTGTAAACATTGTAAAGTATATCACCAAAGTATAATATAAATCAACAGATTCGGAAGG
Protein sequenceShow/hide protein sequence
MTVAPSLSRLHSPFLYCPLKPTPSTSLSVTFSGNQRSPPSYPRIRAIDLDQNTVVALSVGLVSVAIGIGIPVFYETQIDNAAKRENTQPCFPCSGSGAQRCRFCMGTGNV
TVELGGDDKEVSRCINCDGVGTLTCTTCQGSGIQPRYLDRREFKDDD