; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G20220 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G20220
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUnknown protein
Genome locationChr7:17626755..17630702
RNA-Seq ExpressionCSPI07G20220
SyntenyCSPI07G20220
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058721.1 uncharacterized protein E6C27_scaffold339G001910 [Cucumis melo var. makuwa]2.4e-9182.28Show/hide
Query:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD--------------------------------CGGGGGGGGFVDASDAARSKIAGKSEVVATMKR
        MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD                                  GGGGGGGFVDASDAARSKIAGKSEVVATMKR
Subjt:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD--------------------------------CGGGGGGGGFVDASDAARSKIAGKSEVVATMKR

Query:  PRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQLT
        PRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKK+KLDLESQQA +MVVTSAV GEANSNQ QQLQTP RS+CSTTPIGADASYQLT
Subjt:  PRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQLT

Query:  MPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALY
        MPNVSCKLQEI TLGTVRLLPDLNLPFQEDSSTEALY
Subjt:  MPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALY

XP_004135872.3 uncharacterized protein LOC101212535 [Cucumis sativus]2.7e-9885.89Show/hide
Query:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD---------------------------------CGGGGGGGGFVDASDAARSKIAGKSEVVATMK
        MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD                                  GGGGGGGGFVDASDAARSKIAGKSEVVATMK
Subjt:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD---------------------------------CGGGGGGGGFVDASDAARSKIAGKSEVVATMK

Query:  RPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQL
        RPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQL
Subjt:  RPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQL

Query:  TMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS
        TMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS
Subjt:  TMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS

XP_008461167.1 PREDICTED: uncharacterized protein LOC103499830 [Cucumis melo]5.8e-9382.5Show/hide
Query:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD--------------------------------CGGGGGGGGFVDASDAARSKIAGKSEVVATMKR
        MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD                                  GGGGGGGFVDASDAARSKIAGKSEVVATMKR
Subjt:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD--------------------------------CGGGGGGGGFVDASDAARSKIAGKSEVVATMKR

Query:  PRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQLT
        PRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKK+KLDLESQQA +MVVTSAV GEANSNQ QQLQTP RS+CSTTPIGADASYQLT
Subjt:  PRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQLT

Query:  MPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS
        MPNVSCKLQEI TLGTVRLLPDLNLPFQEDSSTEALYRMS
Subjt:  MPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS

XP_038896159.1 uncharacterized protein LOC120084450 isoform X1 [Benincasa hispida]3.7e-7973.09Show/hide
Query:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLDCGG---------------------------------GGGGGGFVDASDAARSKIAGKSEVVATMK
        MSD EWVDVALSDDSLVVDLLLRLNRPPSPPLPL+                                        GGFVDASDAARSKIAGKSEVVAT+K
Subjt:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLDCGG---------------------------------GGGGGGFVDASDAARSKIAGKSEVVATMK

Query:  RPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIG-------
        RPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQA + +VTSAVL EANS+Q  QLQ PPRS+C+TT IG       
Subjt:  RPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIG-------

Query:  -ADASYQLTMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS
          DASYQLT+PNVSCKLQEI TLGTVRLLPDLNLPFQEDS TEALYRMS
Subjt:  -ADASYQLTMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS

XP_038896160.1 uncharacterized protein LOC120084450 isoform X2 [Benincasa hispida]7.1e-6766.27Show/hide
Query:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLDCGG---------------------------------GGGGGGFVDASDAARSKIAGKSEVVATMK
        MSD EWVDVALSDDSLVVDLLLRLNRPPSPPLPL+                                        GGFVDASDAARS             
Subjt:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLDCGG---------------------------------GGGGGGFVDASDAARSKIAGKSEVVATMK

Query:  RPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIG-------
             KTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQA + +VTSAVL EANS+Q  QLQ PPRS+C+TT IG       
Subjt:  RPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIG-------

Query:  -ADASYQLTMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS
          DASYQLT+PNVSCKLQEI TLGTVRLLPDLNLPFQEDS TEALYRMS
Subjt:  -ADASYQLTMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS

TrEMBL top hitse value%identityAlignment
A0A0A0K8M7 Uncharacterized protein1.3e-9885.89Show/hide
Query:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD---------------------------------CGGGGGGGGFVDASDAARSKIAGKSEVVATMK
        MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD                                  GGGGGGGGFVDASDAARSKIAGKSEVVATMK
Subjt:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD---------------------------------CGGGGGGGGFVDASDAARSKIAGKSEVVATMK

Query:  RPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQL
        RPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQL
Subjt:  RPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQL

Query:  TMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS
        TMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS
Subjt:  TMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS

A0A1S3CDM0 uncharacterized protein LOC1034998302.8e-9382.5Show/hide
Query:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD--------------------------------CGGGGGGGGFVDASDAARSKIAGKSEVVATMKR
        MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD                                  GGGGGGGFVDASDAARSKIAGKSEVVATMKR
Subjt:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD--------------------------------CGGGGGGGGFVDASDAARSKIAGKSEVVATMKR

Query:  PRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQLT
        PRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKK+KLDLESQQA +MVVTSAV GEANSNQ QQLQTP RS+CSTTPIGADASYQLT
Subjt:  PRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQLT

Query:  MPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS
        MPNVSCKLQEI TLGTVRLLPDLNLPFQEDSSTEALYRMS
Subjt:  MPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS

A0A5D3CK05 Uncharacterized protein1.2e-9182.28Show/hide
Query:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD--------------------------------CGGGGGGGGFVDASDAARSKIAGKSEVVATMKR
        MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD                                  GGGGGGGFVDASDAARSKIAGKSEVVATMKR
Subjt:  MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLD--------------------------------CGGGGGGGGFVDASDAARSKIAGKSEVVATMKR

Query:  PRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQLT
        PRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKK+KLDLESQQA +MVVTSAV GEANSNQ QQLQTP RS+CSTTPIGADASYQLT
Subjt:  PRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQLT

Query:  MPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALY
        MPNVSCKLQEI TLGTVRLLPDLNLPFQEDSSTEALY
Subjt:  MPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALY

A0A6J1GPA3 uncharacterized protein LOC111456207 isoform X12.5e-6562.83Show/hide
Query:  MSDLEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLD---------------------------------------CGGGGGG--------GGFVDA
        MSD EWV+VALSDDSLVVDLLLRLNRPP      PPL LD                                        GGGGGG        GGFVDA
Subjt:  MSDLEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLD---------------------------------------CGGGGGG--------GGFVDA

Query:  SDAARSKIAGKSEVVATMKRPRKKK-TLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQ
        SDAARSKI  KSEVV TMKRPRKKK TLGELKEEE LLLKERRSLKDALATLR++VEKQR +NGSLKKMKL+LE QQ       S V  E NS+QS QLQ
Subjt:  SDAARSKIAGKSEVVATMKRPRKKK-TLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQ

Query:  TPPRSMCSTTPI--------GADASYQLTMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS
          PRS+C+TTPI        G DASYQLT+PNVSCKLQE+ TLGTVRLLPDLNLPFQ+DS  EALYRMS
Subjt:  TPPRSMCSTTPI--------GADASYQLTMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS

A0A6J1GQQ9 uncharacterized protein LOC111456207 isoform X21.0e-6663.06Show/hide
Query:  MSDLEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLD---------------------------------------CGGGGGG--------GGFVDA
        MSD EWV+VALSDDSLVVDLLLRLNRPP      PPL LD                                        GGGGGG        GGFVDA
Subjt:  MSDLEWVDVALSDDSLVVDLLLRLNRPPS-----PPLPLD---------------------------------------CGGGGGG--------GGFVDA

Query:  SDAARSKIAGKSEVVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQT
        SDAARSKI  KSEVV TMKRPRKKKTLGELKEEE LLLKERRSLKDALATLR++VEKQR +NGSLKKMKL+LE QQ       S V  E NS+QS QLQ 
Subjt:  SDAARSKIAGKSEVVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQT

Query:  PPRSMCSTTPI--------GADASYQLTMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS
         PRS+C+TTPI        G DASYQLT+PNVSCKLQE+ TLGTVRLLPDLNLPFQ+DS  EALYRMS
Subjt:  PPRSMCSTTPI--------GADASYQLTMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15800.1 unknown protein2.0e-1137.78Show/hide
Query:  SPPLPLDCGG-----GGGGG------------GFVDASDAARSKIAGKSEVVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNG
        SP  PL   G     GGGGG            G V  S+A RSKI   S   +  KR RKKKTL +LKEEE +LLKER  L++ LAT++  +++QRA N 
Subjt:  SPPLPLDCGG-----GGGGG------------GFVDASDAARSKIAGKSEVVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNG

Query:  SLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQL
        SLKK++ + +    +  ++    +   N++  + L
Subjt:  SLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQL

AT1G80610.1 unknown protein8.8e-1536.96Show/hide
Query:  MSDLEWVDVALSDDSLVVDLLLRL------NRPPSPPLPL-----------------------------DCGGGGGGGG----------------FVDAS
        MS   W+ VA+SDDS+V + LLRL      NR  + PL L                                GGGG GG                 V  S
Subjt:  MSDLEWVDVALSDDSLVVDLLLRL------NRPPSPPLPL-----------------------------DCGGGGGGGG----------------FVDAS

Query:  DAARSKIAGKSEVVAT-----MKRPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKM
        +  RSKI+  S +  T      KR RKKKTL ELKEEE++LLKE   LK+ LA +R  +E+QRA N +LKKMK + +S  + K+
Subjt:  DAARSKIAGKSEVVAT-----MKRPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNGSLKKMKLDLESQQATKM

AT4G32030.1 unknown protein1.8e-0730.46Show/hide
Query:  PSPPLPLDCGGGGGGG-------GFVD--------ASDAARSKIAGKSEVVATM-KRPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNG
        P  PL    G G GGG       GF D         S  + SK+   +E+ +   KR +K+K+  ELK EE L LKER  L+  +A+LR + ++Q   N 
Subjt:  PSPPLPLDCGGGGGGG-------GFVD--------ASDAARSKIAGKSEVVATM-KRPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMNG

Query:  SLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQLTMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS
         LK++KLDL S + T       +      +Q ++LQ                       + SCK  +    G+  +LPDLN+   E+   E LY  S
Subjt:  SLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQLTMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGACCTTGAATGGGTCGACGTTGCTCTCTCCGATGACTCTCTCGTCGTCGACTTACTTCTTCGCCTCAACCGTCCTCCCTCTCCCCCACTCCCTCTCGACTGCGG
CGGCGGCGGTGGCGGCGGCGGATTCGTCGACGCATCCGACGCCGCAAGATCTAAGATTGCTGGTAAGAGTGAAGTAGTTGCAACAATGAAGAGGCCAAGAAAGAAAAAAA
CATTAGGAGAACTTAAAGAGGAGGAAGTTTTGCTATTAAAGGAAAGGAGAAGCTTGAAAGATGCCTTGGCTACCTTGAGGCTTTCCGTGGAAAAACAAAGGGCTATGAAT
GGAAGCTTGAAGAAAATGAAGCTTGATCTCGAATCACAGCAAGCGACCAAAATGGTTGTAACATCTGCTGTTCTGGGGGAAGCAAATTCCAACCAATCACAACAACTGCA
GACGCCACCCAGATCAATGTGCAGCACAACGCCCATTGGAGCCGACGCTTCTTACCAATTAACAATGCCAAATGTTTCTTGCAAACTACAAGAGATTGCAACTTTAGGGA
CCGTTCGTTTATTACCCGATCTTAATTTGCCTTTTCAGGAGGATTCTAGCACCGAGGCCCTGTACCGAATGAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGACCTTGAATGGGTCGACGTTGCTCTCTCCGATGACTCTCTCGTCGTCGACTTACTTCTTCGCCTCAACCGTCCTCCCTCTCCCCCACTCCCTCTCGACTGCGG
CGGCGGCGGTGGCGGCGGCGGATTCGTCGACGCATCCGACGCCGCAAGATCTAAGATTGCTGGTAAGAGTGAAGTAGTTGCAACAATGAAGAGGCCAAGAAAGAAAAAAA
CATTAGGAGAACTTAAAGAGGAGGAAGTTTTGCTATTAAAGGAAAGGAGAAGCTTGAAAGATGCCTTGGCTACCTTGAGGCTTTCCGTGGAAAAACAAAGGGCTATGAAT
GGAAGCTTGAAGAAAATGAAGCTTGATCTCGAATCACAGCAAGCGACCAAAATGGTTGTAACATCTGCTGTTCTGGGGGAAGCAAATTCCAACCAATCACAACAACTGCA
GACGCCACCCAGATCAATGTGCAGCACAACGCCCATTGGAGCCGACGCTTCTTACCAATTAACAATGCCAAATGTTTCTTGCAAACTACAAGAGATTGCAACTTTAGGGA
CCGTTCGTTTATTACCCGATCTTAATTTGCCTTTTCAGGAGGATTCTAGCACCGAGGCCCTGTACCGAATGAGCTAGGCAGATTCAACAGAATGAGCATATATTAAGCTT
CCTTAACCGAGAGACAAAAATTGACAACGTGGACGAGGAGAGGGCATTTGCTTCTTAATAAGATGTTCCTAAATCCTAACCCTGCCATAGGCTTCAGTTACTTGTAACTG
TAATTAACAAAACCAATTTTCTCAGCTGCATATTCATATAACATCCTTTTTTTCCCTTTCCATAGTACCAAAAAAAAAAAAATTTGAGAAGAGAACCACGTCGAGACCCC
TGAAAAACCCTTTAGCTCTTTAGTTATATTATTCATTCCACAGGTTTCTGCATATGCTTATATTCTTTGGAGCCAGACCCTCTCCCTCTCCGTCATCTCAGACTCAAAAG
AGATTGATTGCAAACACTGTAATTATTATTATTAGATAAAAAGCAAACAAGATTCTGTGTTCTCTACATCTAATTTTTAATGATAGAATTGCTATGTCATTTCTTCAATC
CCTCTTTTAATAATGTGATCAGTGAGCATTCTTTGCACAATTTACAGGCCAACAGAAGCTTCAAACTTGGAAAATTCAATAAACAGTAATGGCC
Protein sequenceShow/hide protein sequence
MSDLEWVDVALSDDSLVVDLLLRLNRPPSPPLPLDCGGGGGGGGFVDASDAARSKIAGKSEVVATMKRPRKKKTLGELKEEEVLLLKERRSLKDALATLRLSVEKQRAMN
GSLKKMKLDLESQQATKMVVTSAVLGEANSNQSQQLQTPPRSMCSTTPIGADASYQLTMPNVSCKLQEIATLGTVRLLPDLNLPFQEDSSTEALYRMS