; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G011000 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G011000
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionBEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein .
Genome locationchr03:21035284..21042107
RNA-Seq ExpressionLsi03G011000
SyntenyLsi03G011000
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581290.1 hypothetical protein SDJN03_21292, partial [Cucurbita argyrosperma subsp. sororia]1.2e-10891.56Show/hide
Query:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC
        L++IMASSLNGLTVGDPLSENLMDSPARSESMLY RDEMS QYSPMSEDSDDCRFCET+TNLFPSQSDSSVPTSPVSPYRYQRPFSG+TPSTGTNTSLGC
Subjt:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC

Query:  STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSE
        +T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPSSMELPYCSMPEPGP+IEAEER CS +KSLVDERVYQL ECSSMGVSE
Subjt:  STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSE

Query:  PEYNEEKSCKDLNRDMKDMNSESGG
        PEYNE+KSCKDLNR+MKD  SESGG
Subjt:  PEYNEEKSCKDLNRDMKDMNSESGG

XP_004152357.1 uncharacterized protein LOC101212915 isoform X1 [Cucumis sativus]2.5e-11192.98Show/hide
Query:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC
        L++IMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCET+TNLFPSQSDSSVPTSPVSPYRYQRPFSGV PSTGTNTSLGC
Subjt:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC

Query:  S-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--G
        S TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QPPGPSSMELPYCSMPEPGP+IEAE+RPCSC+KSLVDERVYQLEECSSM  G
Subjt:  S-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--G

Query:  VSEPEYNEEKSCKDLNRDMKDMNSESGG
        VSE EYNE+KSCKDLNRDMKD  S SGG
Subjt:  VSEPEYNEEKSCKDLNRDMKDMNSESGG

XP_008454305.1 PREDICTED: uncharacterized protein LOC103494744 isoform X1 [Cucumis melo]1.3e-11091.63Show/hide
Query:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC
        L++IMASSLNGLTVG+PLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCET+TNLFPSQSDSSVPTSPVSPYRYQRPFSG+ PS GTNTSLGC
Subjt:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC

Query:  STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GV
        STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPSSMELPYCSMPEPGP+IEAE+RPCSC+KSLVDERVYQLEECSSM  GV
Subjt:  STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GV

Query:  SEPEYNEEKSCKDLNRDMKDMNSESGG
        SE EYNE+KSCKDLNRDMKD  S+SGG
Subjt:  SEPEYNEEKSCKDLNRDMKDMNSESGG

XP_031739536.1 uncharacterized protein LOC101212915 isoform X3 [Cucumis sativus]1.7e-11093.78Show/hide
Query:  IMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCS-T
        IMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCET+TNLFPSQSDSSVPTSPVSPYRYQRPFSGV PSTGTNTSLGCS T
Subjt:  IMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCS-T

Query:  SPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GVSE
        SPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QPPGPSSMELPYCSMPEPGP+IEAE+RPCSC+KSLVDERVYQLEECSSM  GVSE
Subjt:  SPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GVSE

Query:  PEYNEEKSCKDLNRDMKDMNSESGG
         EYNE+KSCKDLNRDMKD  S SGG
Subjt:  PEYNEEKSCKDLNRDMKDMNSESGG

XP_038904570.1 uncharacterized protein LOC120090942 [Benincasa hispida]9.4e-11494.22Show/hide
Query:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC
        L++IMASSLNGLTVGDPLSENLMDSPARSESMLYQR+EMSWQYSPMSEDSDDCRFCET+TNLFP+QSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC
Subjt:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC

Query:  STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSE
        STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPSSMELPYCSMPEPGP+IEAEERPCSC+KSLVDER +QLEECSSMGVSE
Subjt:  STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSE

Query:  PEYNEEKSCKDLNRDMKDMNSESGG
        PEYNEEKSCKDLNRDMKD  SESGG
Subjt:  PEYNEEKSCKDLNRDMKDMNSESGG

TrEMBL top hitse value%identityAlignment
A0A0A0KWN0 Uncharacterized protein1.2e-11192.98Show/hide
Query:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC
        L++IMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCET+TNLFPSQSDSSVPTSPVSPYRYQRPFSGV PSTGTNTSLGC
Subjt:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC

Query:  S-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--G
        S TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QPPGPSSMELPYCSMPEPGP+IEAE+RPCSC+KSLVDERVYQLEECSSM  G
Subjt:  S-TSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--G

Query:  VSEPEYNEEKSCKDLNRDMKDMNSESGG
        VSE EYNE+KSCKDLNRDMKD  S SGG
Subjt:  VSEPEYNEEKSCKDLNRDMKDMNSESGG

A0A1S3BXT2 uncharacterized protein LOC103494744 isoform X16.1e-11191.63Show/hide
Query:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC
        L++IMASSLNGLTVG+PLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCET+TNLFPSQSDSSVPTSPVSPYRYQRPFSG+ PS GTNTSLGC
Subjt:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC

Query:  STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GV
        STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPSSMELPYCSMPEPGP+IEAE+RPCSC+KSLVDERVYQLEECSSM  GV
Subjt:  STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GV

Query:  SEPEYNEEKSCKDLNRDMKDMNSESGG
        SE EYNE+KSCKDLNRDMKD  S+SGG
Subjt:  SEPEYNEEKSCKDLNRDMKDMNSESGG

A0A5A7TRC2 Uncharacterized protein6.1e-11191.63Show/hide
Query:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC
        L++IMASSLNGLTVG+PLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCET+TNLFPSQSDSSVPTSPVSPYRYQRPFSG+ PS GTNTSLGC
Subjt:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC

Query:  STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GV
        STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPSSMELPYCSMPEPGP+IEAE+RPCSC+KSLVDERVYQLEECSSM  GV
Subjt:  STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSM--GV

Query:  SEPEYNEEKSCKDLNRDMKDMNSESGG
        SE EYNE+KSCKDLNRDMKD  S+SGG
Subjt:  SEPEYNEEKSCKDLNRDMKDMNSESGG

A0A6J1F182 uncharacterized protein LOC111441454 isoform X21.1e-10792.31Show/hide
Query:  MASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSP
        MASSLNGLTVGDPLSENLMDSPARSESMLY RDEMS QYSPMSEDSDDCRFCET+TNLFPSQSDSSVPTSPVSPYRYQRPFSG+TPSTGTNTSLGC+T P
Subjt:  MASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGCSTSP

Query:  VTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSEPEYN
        VTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPSSMELPYCSMPEPGP+IEAEER CS +KSLVDERVYQL ECSSMGVSEPEYN
Subjt:  VTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSEPEYN

Query:  EEKSCKDLNRDMKDMNSESGG
        E+KSCKDLNR+MKD  SESGG
Subjt:  EEKSCKDLNRDMKDMNSESGG

A0A6J1F722 uncharacterized protein LOC111441454 isoform X15.7e-10991.56Show/hide
Query:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC
        L++IMASSLNGLTVGDPLSENLMDSPARSESMLY RDEMS QYSPMSEDSDDCRFCET+TNLFPSQSDSSVPTSPVSPYRYQRPFSG+TPSTGTNTSLGC
Subjt:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNTSLGC

Query:  STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSE
        +T PVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMR QP GPSSMELPYCSMPEPGP+IEAEER CS +KSLVDERVYQL ECSSMGVSE
Subjt:  STSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECSSMGVSE

Query:  PEYNEEKSCKDLNRDMKDMNSESGG
        PEYNE+KSCKDLNR+MKD  SESGG
Subjt:  PEYNEEKSCKDLNRDMKDMNSESGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G25920.1 BEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein (TAIR:AT2G25910.2)6.2e-4753.74Show/hide
Query:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCE--TATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNT--
        L++IMA SLNGLTVGD L  N+++SPARSES LY RD++S QYSPMSEDSD+ RFCE  TAT    S    S PTSPVSPYRYQRP +       + T  
Subjt:  LAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCE--TATNLFPSQSDSSVPTSPVSPYRYQRPFSGVTPSTGTNT--

Query:  --------SLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPS-IEAEERPCSCLKSLVDERV
                S+  + +  T+ Q  QRGSD+EGRFPSSPSDICHS DLRR ALLRSVQMRTQP G SS           GPS I+ EER CS  KS+ ++R 
Subjt:  --------SLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPS-IEAEERPCSCLKSLVDERV

Query:  YQL-EECSSMGVSEPEYNEEKSCKDLN
        Y   E+     VS    ++ KSCK L+
Subjt:  YQL-EECSSMGVSEPEYNEEKSCKDLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTGTGCATTGCCTATTTGCCGTTCTTGAATGTAAAATTTCCTCCTAACCTACATCTCGTCTTTGTTAATAATCTTGCACAGATAATGGCTTCGAGTTTGAATGG
ATTGACGGTTGGAGACCCCCTTTCAGAGAATCTCATGGATTCCCCTGCAAGGTCAGAGTCCATGCTTTATCAAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCAG
AAGATTCAGATGACTGCCGGTTTTGTGAGACAGCTACGAACTTATTTCCCTCGCAGTCTGATAGTAGTGTACCTACCAGCCCAGTTTCTCCATACCGATATCAGAGGCCA
TTTAGTGGGGTGACTCCTTCAACAGGTACCAACACTTCACTTGGATGTTCTACTAGTCCCGTCACTAGCTTGCAGCCCCATCAACGTGGATCAGATTCTGAGGGCCGTTT
CCCATCATCTCCAAGCGATATATGTCACTCAGCAGATCTGAGAAGGGCTGCGCTCCTGCGTTCGGTACAAATGAGAACACAACCTCCCGGTCCATCATCTATGGAGTTGC
CATATTGCTCCATGCCTGAGCCTGGGCCTAGTATAGAAGCTGAAGAGCGGCCATGTTCTTGCTTAAAATCGTTGGTTGATGAAAGAGTTTATCAACTCGAGGAATGCTCG
TCAATGGGAGTGTCCGAGCCTGAATATAATGAAGAAAAATCATGCAAGGACTTGAATAGGGACATGAAGGACATGAACAGTGAGTCCGGAGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTGTGCATTGCCTATTTGCCGTTCTTGAATGTAAAATTTCCTCCTAACCTACATCTCGTCTTTGTTAATAATCTTGCACAGATAATGGCTTCGAGTTTGAATGG
ATTGACGGTTGGAGACCCCCTTTCAGAGAATCTCATGGATTCCCCTGCAAGGTCAGAGTCCATGCTTTATCAAAGGGATGAAATGTCCTGGCAATATTCTCCTATGTCAG
AAGATTCAGATGACTGCCGGTTTTGTGAGACAGCTACGAACTTATTTCCCTCGCAGTCTGATAGTAGTGTACCTACCAGCCCAGTTTCTCCATACCGATATCAGAGGCCA
TTTAGTGGGGTGACTCCTTCAACAGGTACCAACACTTCACTTGGATGTTCTACTAGTCCCGTCACTAGCTTGCAGCCCCATCAACGTGGATCAGATTCTGAGGGCCGTTT
CCCATCATCTCCAAGCGATATATGTCACTCAGCAGATCTGAGAAGGGCTGCGCTCCTGCGTTCGGTACAAATGAGAACACAACCTCCCGGTCCATCATCTATGGAGTTGC
CATATTGCTCCATGCCTGAGCCTGGGCCTAGTATAGAAGCTGAAGAGCGGCCATGTTCTTGCTTAAAATCGTTGGTTGATGAAAGAGTTTATCAACTCGAGGAATGCTCG
TCAATGGGAGTGTCCGAGCCTGAATATAATGAAGAAAAATCATGCAAGGACTTGAATAGGGACATGAAGGACATGAACAGTGAGTCCGGAGGGTAGTAAATGCTAAAAAA
TGTACAGAGGAAAATCTCTTCTAACATAACACATGGAACAAATTGCTGCTGTGGGACTGGAAAGTGTTTGCCTTTTGGTTCAACCTCCATGGAAATTCTCTTTTTTGCAT
ATGCTGCCTAATTTACTTGCTTTCGCTTGATTGAGGATGGCTTGGTCTTCTGCTAGCTCCATTCCCAAGTGCGTTGTGCGTTGTGCCTTGTGCTATCTTCATTGTCAAGT
GTGTGTGGAAGTTATGTTTATCTATTTGACTCTTCTGAAGCCCGAGTTGAGAAGATGTTCATGGTTGATCATGTGTGAAAGCTGTGTCTATCTGTTTGATTCTATTGGAG
CTGGAGTCGAGAGAAAATGTTTCTCAGAGTTAAGTTTCTAGATAAAAAGCTGCGGCTTTATGCGTCATAAGGTTGAAGTTGCAAATGTTCCATTTGTTTCATATTTGTGA
TAACGCTGTTATTATATAATTTTTGTTCTAATTGCCTTCTGAACATGCAGGCAAAATTGATGGGAAAACCAAAAGGTGTGTAGAGATGAAATTATATATTTTGATAGGGC
TGAAATAAGGAAGAGATAAAAA
Protein sequenceShow/hide protein sequence
MALCIAYLPFLNVKFPPNLHLVFVNNLAQIMASSLNGLTVGDPLSENLMDSPARSESMLYQRDEMSWQYSPMSEDSDDCRFCETATNLFPSQSDSSVPTSPVSPYRYQRP
FSGVTPSTGTNTSLGCSTSPVTSLQPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRTQPPGPSSMELPYCSMPEPGPSIEAEERPCSCLKSLVDERVYQLEECS
SMGVSEPEYNEEKSCKDLNRDMKDMNSESGG