; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011123 (gene) of Chayote v1 genome

Gene IDSed0011123
OrganismSechium edule (Chayote v1)
DescriptionVitamin K-dependent protein S-like
Genome locationLG09:12967318..12970325
RNA-Seq ExpressionSed0011123
SyntenySed0011123
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR000742 - EGF-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593876.1 hypothetical protein SDJN03_13352, partial [Cucurbita argyrosperma subsp. sororia]5.5e-9275.68Show/hide
Query:  MASIAVLALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFLPCVIPKCNLTHSC
        MA+IAVLALLL+  SA AN+FNDLL+PLLSPIF++VCK+V+CGKG C+PS N  SFSFECDCDSGWK+SL DDDD DD+DD HFKFLPC+IP C LTHSC
Subjt:  MASIAVLALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFLPCVIPKCNLTHSC

Query:  SSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSPTNNNNAASGLFLK
        SSAPPPG+QTKPRT  SI DPCSWV+CGGGSCNKTSPLTYKCDCL  YYNLLNIT+F C+KDCSIG+DCK+LGIPV++S   T++S   NNNAASGL LK
Subjt:  SSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSPTNNNNAASGLFLK

Query:  RGFISTVSSVVTCVTTLLMLIE
        R  +ST+SSVVTCV TLL+LI+
Subjt:  RGFISTVSSVVTCVTTLLMLIE

TYK30143.1 vitamin K-dependent protein S-like [Cucumis melo var. makuwa]5.5e-10077.45Show/hide
Query:  MPISLICERWRVMASIAV-LALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFL
        M ISL C+RWRVMASI V LALLLVF SAKA+DFNDLL+PLLSPIF+NVCKEV+CGKG C+ SGNG SFSFECDC+SGWK++LFDD   DD+D +HFKFL
Subjt:  MPISLICERWRVMASIAV-LALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFL

Query:  PCVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSP
        PC+IPKCNLTHSCSSAPPPGVQTKPR N +ILDPCSWVDCGGG CNKTSPLTYKC+CLEGYYNLLNIT+F C+KDCSIG+DCK+LGIPV+NS ++T+++ 
Subjt:  PCVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSP

Query:  TNNNNAASGLFLKRGFISTVSSVVTCVTTLLMLIE
        T NNNAAS LFLKRG +ST+SSVV  + TLL+LI+
Subjt:  TNNNNAASGLFLKRGFISTVSSVVTCVTTLLMLIE

XP_004145389.1 neurogenic locus notch homolog protein 1 [Cucumis sativus]2.6e-10280.43Show/hide
Query:  MPISLICERWRVMASIAV-LALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFL
        M ISL C+RWRVMASI + L LLLVF SAKA+D NDLL+PLLSPIF+NVCKEV+CGKG C+ SGNG SFSFECDCDSGWK++LFDDDD DD D +HFKFL
Subjt:  MPISLICERWRVMASIAV-LALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFL

Query:  PCVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNS-TSTTSNS
        PC+IPKCNLTHSCSSAPPPGVQTKPRTN++ILDPCSWVDCGGG CNKTSPLTYKC+CLEGYYNLLNIT+F C+KDCSIG+DCK+LGIPV+NS  STTS S
Subjt:  PCVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNS-TSTTSNS

Query:  PTNNNNAASGLFLKRGFISTVSSVVTCVTTLLMLI
         TNNNNAASGLFLK+G +ST+SSVV  V TLL+LI
Subjt:  PTNNNNAASGLFLKRGFISTVSSVVTCVTTLLMLI

XP_008449322.1 PREDICTED: uncharacterized protein LOC103491235 [Cucumis melo]5.5e-10077.45Show/hide
Query:  MPISLICERWRVMASIAV-LALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFL
        M ISL C+RWRVMASI V LALLLVF SAKA+DFNDLL+PLLSPIF+NVCKEV+CGKG C+ SGNG SFSFECDC+SGWK++LFDD   DD+D +HFKFL
Subjt:  MPISLICERWRVMASIAV-LALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFL

Query:  PCVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSP
        PC+IPKCNLTHSCSSAPPPGVQTKPR N +ILDPCSWVDCGGG CNKTSPLTYKC+CLEGYYNLLNIT+F C+KDCSIG+DCK+LGIPV+NS ++T+++ 
Subjt:  PCVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSP

Query:  TNNNNAASGLFLKRGFISTVSSVVTCVTTLLMLIE
        T NNNAAS LFLKRG +ST+SSVV  + TLL+LI+
Subjt:  TNNNNAASGLFLKRGFISTVSSVVTCVTTLLMLIE

XP_038906891.1 uncharacterized protein LOC120092768 [Benincasa hispida]1.2e-9978.97Show/hide
Query:  MPISLICERWRVMASIAVLALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFLP
        M ISL  +RWR MASIA L+LLL+F SAKANDFNDLL+PLLSPIF+NVCKEV+CGKG C+ SGNGT FSFECDCDSGWK+SL DDDD    D SHFKFLP
Subjt:  MPISLICERWRVMASIAVLALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFLP

Query:  CVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNST-STTSNSP
        C+IPKCNLTHSCSSAP PGVQTKPRTN+SI DPCSWVDCGGG CNKTSPLTYKCDCLEGYYNLL+IT+F C+K+CSIG+DC++LGIPV+NST ST S S 
Subjt:  CVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNST-STTSNSP

Query:  TNNNNAASGLFLKRGFISTVSSVVTCVTTLLML
        TNNNNAASGLFLKRG +ST+SSVV   +TLL+L
Subjt:  TNNNNAASGLFLKRGFISTVSSVVTCVTTLLML

TrEMBL top hitse value%identityAlignment
A0A0A0LKT4 Uncharacterized protein1.3e-10280.43Show/hide
Query:  MPISLICERWRVMASIAV-LALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFL
        M ISL C+RWRVMASI + L LLLVF SAKA+D NDLL+PLLSPIF+NVCKEV+CGKG C+ SGNG SFSFECDCDSGWK++LFDDDD DD D +HFKFL
Subjt:  MPISLICERWRVMASIAV-LALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFL

Query:  PCVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNS-TSTTSNS
        PC+IPKCNLTHSCSSAPPPGVQTKPRTN++ILDPCSWVDCGGG CNKTSPLTYKC+CLEGYYNLLNIT+F C+KDCSIG+DCK+LGIPV+NS  STTS S
Subjt:  PCVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNS-TSTTSNS

Query:  PTNNNNAASGLFLKRGFISTVSSVVTCVTTLLMLI
         TNNNNAASGLFLK+G +ST+SSVV  V TLL+LI
Subjt:  PTNNNNAASGLFLKRGFISTVSSVVTCVTTLLMLI

A0A1S3BLS9 uncharacterized protein LOC1034912352.7e-10077.45Show/hide
Query:  MPISLICERWRVMASIAV-LALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFL
        M ISL C+RWRVMASI V LALLLVF SAKA+DFNDLL+PLLSPIF+NVCKEV+CGKG C+ SGNG SFSFECDC+SGWK++LFDD   DD+D +HFKFL
Subjt:  MPISLICERWRVMASIAV-LALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFL

Query:  PCVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSP
        PC+IPKCNLTHSCSSAPPPGVQTKPR N +ILDPCSWVDCGGG CNKTSPLTYKC+CLEGYYNLLNIT+F C+KDCSIG+DCK+LGIPV+NS ++T+++ 
Subjt:  PCVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSP

Query:  TNNNNAASGLFLKRGFISTVSSVVTCVTTLLMLIE
        T NNNAAS LFLKRG +ST+SSVV  + TLL+LI+
Subjt:  TNNNNAASGLFLKRGFISTVSSVVTCVTTLLMLIE

A0A5D3E219 Vitamin K-dependent protein S-like2.7e-10077.45Show/hide
Query:  MPISLICERWRVMASIAV-LALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFL
        M ISL C+RWRVMASI V LALLLVF SAKA+DFNDLL+PLLSPIF+NVCKEV+CGKG C+ SGNG SFSFECDC+SGWK++LFDD   DD+D +HFKFL
Subjt:  MPISLICERWRVMASIAV-LALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFL

Query:  PCVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSP
        PC+IPKCNLTHSCSSAPPPGVQTKPR N +ILDPCSWVDCGGG CNKTSPLTYKC+CLEGYYNLLNIT+F C+KDCSIG+DCK+LGIPV+NS ++T+++ 
Subjt:  PCVIPKCNLTHSCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSP

Query:  TNNNNAASGLFLKRGFISTVSSVVTCVTTLLMLIE
        T NNNAAS LFLKRG +ST+SSVV  + TLL+LI+
Subjt:  TNNNNAASGLFLKRGFISTVSSVVTCVTTLLMLIE

A0A6J1EQD2 uncharacterized protein LOC1114367485.6e-9074.32Show/hide
Query:  MASIAVLALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFLPCVIPKCNLTHSC
        MA+IAVLALLL+  SA AN+FNDLL+PLLSPIF++VCK+V+CGKG C+PS N  SFSFECDCDSGWK+SL DDDD    DD HFKFLPC+IP C LTHSC
Subjt:  MASIAVLALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFLPCVIPKCNLTHSC

Query:  SSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSPTNNNNAASGLFLK
        S APPPG+QTKPRT  SI DPCSWV+CGGGSCNKTSPLTYKCDCL  YYNLLNIT+F C+KDCSIG+DCK+LGIPV++S   T++S   NNNAASGL LK
Subjt:  SSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSPTNNNNAASGLFLK

Query:  RGFISTVSSVVTCVTTLLMLIE
        R  +ST+SSVVTCV TLL+LI+
Subjt:  RGFISTVSSVVTCVTTLLMLIE

A0A6J1KDY4 uncharacterized protein LOC1114947875.0e-9174.77Show/hide
Query:  MASIAVLALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFLPCVIPKCNLTHSC
        MA+IAVLALLL+  SA AN+ NDLL+PLLSPIF+NVCK+V+CGKG C+PS N T FSFECDCDSGWK+SL DD   DD+DD HFKFLPC+IP C LTHSC
Subjt:  MASIAVLALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFLPCVIPKCNLTHSC

Query:  SSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSPTNNNNAASGLFLK
        SSAPPPG+QTKPRTN  I DPCSWVDCGGGSCNKTSP TYKCDCL  YYNLLNIT+F C+KDCSIG+DCK+LGIPV++S   T+ S   NNNAASGL LK
Subjt:  SSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSPTNNNNAASGLFLK

Query:  RGFISTVSSVVTCVTTLLMLIE
        RG +S +SSVVTCV TLL+LI+
Subjt:  RGFISTVSSVVTCVTTLLMLIE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G14746.1 CONTAINS InterPro DOMAIN/s: EGF-like (InterPro:IPR006210); Has 259 Blast hits to 234 proteins in 55 species: Archae - 0; Bacteria - 0; Metazoa - 184; Fungi - 0; Plants - 69; Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink).6.4e-4643.35Show/hide
Query:  RVMASIAVLALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFLPCVIPKCNLTH
        R+  ++ ++A LL+ +   A   +D L+PL +P++DN+CKEV+CGKGKC+   N T+F +EC+C+ GWK           + D H KFLPC+ P C    
Subjt:  RVMASIAVLALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFLPCVIPKCNLTH

Query:  SCSSAPPPGVQTKPRTNQSI---LDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNS-PTNNNNAA
        +C  A  P  Q KP    +I    DPC W+DCGGG CN + P  Y C+C EGY NL+NIT+F C K C++G+DC  LGIP+SNS+S++  + P ++ N  
Subjt:  SCSSAPPPGVQTKPRTNQSI---LDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNS-PTNNNNAA

Query:  SGL
          L
Subjt:  SGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCATTTCTCTGATTTGTGAACGTTGGAGGGTTATGGCTTCCATTGCTGTTCTTGCTCTGTTGCTCGTATTTCACTCTGCCAAGGCTAATGACTTCAACGATCTATT
GGCTCCTCTTCTCTCGCCCATTTTCGATAATGTGTGCAAAGAAGTGGACTGTGGGAAAGGAAAGTGTGAACCTTCTGGTAATGGGACTTCATTTTCGTTTGAATGTGATT
GTGATTCTGGGTGGAAGAGGTCTCTTTTTGATGATGATGATGAAGATGATGAAGATGATAGCCACTTCAAGTTTCTCCCTTGCGTTATTCCCAAATGTAATTTGACTCAT
TCGTGCTCATCGGCTCCTCCACCAGGCGTACAGACGAAACCAAGAACTAATCAATCGATACTCGATCCTTGCAGTTGGGTAGACTGTGGAGGAGGTTCGTGCAACAAGAC
ATCGCCATTGACATACAAATGTGATTGCTTGGAGGGTTACTATAATCTTCTCAACATCACTTCCTTTGGATGTTTCAAAGATTGCTCCATAGGATTGGATTGCAAGAAAT
TGGGAATTCCAGTGTCAAACTCTACTTCTACAACTTCTAATTCACCAACAAATAACAACAATGCTGCTAGCGGATTATTTCTCAAGCGAGGCTTCATCTCAACGGTTAGC
TCGGTGGTAACGTGTGTCACCACACTGCTGATGCTGATCGAATAA
mRNA sequenceShow/hide mRNA sequence
ATCAGTTTATGGCTTAATTCAAAAAAACCCCAACTTCCAACCACCCAAAGCCGTCCCATAAAACCCCAATTGCGTAGAGCTTTTAAGCATTTCAAATCCCAAATGCCCAT
TTCTCTGATTTGTGAACGTTGGAGGGTTATGGCTTCCATTGCTGTTCTTGCTCTGTTGCTCGTATTTCACTCTGCCAAGGCTAATGACTTCAACGATCTATTGGCTCCTC
TTCTCTCGCCCATTTTCGATAATGTGTGCAAAGAAGTGGACTGTGGGAAAGGAAAGTGTGAACCTTCTGGTAATGGGACTTCATTTTCGTTTGAATGTGATTGTGATTCT
GGGTGGAAGAGGTCTCTTTTTGATGATGATGATGAAGATGATGAAGATGATAGCCACTTCAAGTTTCTCCCTTGCGTTATTCCCAAATGTAATTTGACTCATTCGTGCTC
ATCGGCTCCTCCACCAGGCGTACAGACGAAACCAAGAACTAATCAATCGATACTCGATCCTTGCAGTTGGGTAGACTGTGGAGGAGGTTCGTGCAACAAGACATCGCCAT
TGACATACAAATGTGATTGCTTGGAGGGTTACTATAATCTTCTCAACATCACTTCCTTTGGATGTTTCAAAGATTGCTCCATAGGATTGGATTGCAAGAAATTGGGAATT
CCAGTGTCAAACTCTACTTCTACAACTTCTAATTCACCAACAAATAACAACAATGCTGCTAGCGGATTATTTCTCAAGCGAGGCTTCATCTCAACGGTTAGCTCGGTGGT
AACGTGTGTCACCACACTGCTGATGCTGATCGAATAAACTGATATACAAGTGCCAGTTGACTGTAGTTTAATTATGTGTATGATATTTGCTGTGTATGGCCTCTATTTAT
GTCTTTTTTGCTATGAAATAAGTTTCCCATTTGAGTTGTGAGATGATGGGGCATTGTATATAAGTCTACAGACAGTTACATCGTTTAGTAAATTTTTTTTTTTTTGGCAT
ATAACCCAACCCTTTGTTAATGATATATAAGAAGAGCCTCATGGATGTTAATGGAGAAAAGTGAAAGAATGATTAAATAACTGACAGATTGTATAATAAATTTAATACAT
GGAGAAACCGGA
Protein sequenceShow/hide protein sequence
MPISLICERWRVMASIAVLALLLVFHSAKANDFNDLLAPLLSPIFDNVCKEVDCGKGKCEPSGNGTSFSFECDCDSGWKRSLFDDDDEDDEDDSHFKFLPCVIPKCNLTH
SCSSAPPPGVQTKPRTNQSILDPCSWVDCGGGSCNKTSPLTYKCDCLEGYYNLLNITSFGCFKDCSIGLDCKKLGIPVSNSTSTTSNSPTNNNNAASGLFLKRGFISTVS
SVVTCVTTLLMLIE