; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G009740 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G009740
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDUF4050 family protein
Genome locationchr06:19723462..19724878
RNA-Seq ExpressionLsi06G009740
SyntenyLsi06G009740
Gene Ontology termsNA
InterPro domainsIPR025124 - Domain of unknown function DUF4050


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050173.1 DUF4050 family protein [Cucumis melo var. makuwa]8.9e-5365.22Show/hide
Query:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS
        GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLS+SNV G    SS      +    ++ C    +
Subjt:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS

Query:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
         + ++   SG          KL  + +Q+           K    WRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
Subjt:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

XP_004150590.1 uncharacterized protein LOC101203426 isoform X2 [Cucumis sativus]2.6e-5265.59Show/hide
Query:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCF--Q
        GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNV G    SS        E   H      Q
Subjt:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCF--Q

Query:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
             I  G +           K   + +Q+           K    WRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
Subjt:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

XP_008443992.1 PREDICTED: uncharacterized protein LOC103487445 [Cucumis melo]2.2e-5165.05Show/hide
Query:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCF--Q
        GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSIST NLTLS+SNV G    SS        E   H      Q
Subjt:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCF--Q

Query:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
             I  G +           KL  + +Q+           K    WRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
Subjt:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

XP_022144929.1 uncharacterized protein LOC111014486 [Momordica charantia]1.3e-5164.13Show/hide
Query:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS
        GCFGCCTKPTPIIAVDEPSKGLRIQGR+VKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTL+ SNVGG    S+  P +  +            
Subjt:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS

Query:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
                    ++L  Q +        K+   TQ     K    WRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
Subjt:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

XP_038879580.1 uncharacterized protein LOC120071390 [Benincasa hispida]1.2e-5266.3Show/hide
Query:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS
        GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGG    SS        E   H     + 
Subjt:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS

Query:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
        ++  RL   G          K   + +Q+           K    WRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWE EGLYD
Subjt:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

TrEMBL top hitse value%identityAlignment
A0A1S3B9E5 uncharacterized protein LOC1034874451.1e-5165.05Show/hide
Query:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCF--Q
        GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSIST NLTLS+SNV G    SS        E   H      Q
Subjt:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCF--Q

Query:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
             I  G +           KL  + +Q+           K    WRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
Subjt:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

A0A5A7U2N6 DUF4050 family protein4.3e-5365.22Show/hide
Query:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS
        GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLS+SNV G    SS      +    ++ C    +
Subjt:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS

Query:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
         + ++   SG          KL  + +Q+           K    WRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
Subjt:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

A0A5D3C7W3 DUF4050 family protein1.1e-5165.05Show/hide
Query:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCF--Q
        GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSIST NLTLS+SNV G    SS        E   H      Q
Subjt:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCF--Q

Query:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
             I  G +           KL  + +Q+           K    WRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
Subjt:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

A0A6J1CTQ5 uncharacterized protein LOC1110144866.2e-5264.13Show/hide
Query:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS
        GCFGCCTKPTPIIAVDEPSKGLRIQGR+VKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTL+ SNVGG    S+  P +  +            
Subjt:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS

Query:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
                    ++L  Q +        K+   TQ     K    WRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
Subjt:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

A0A6J1H9M2 uncharacterized protein LOC1114618003.4e-5063.04Show/hide
Query:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS
        GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLT S SNVGG    S   P +  +            
Subjt:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS

Query:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
                     +LL  + +L       +    ++    K    WRATYDSLLGTRQPFPH IPLSEMVNFLVEVWEQEGLYD
Subjt:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15350.1 unknown protein2.4e-1936.9Show/hide
Query:  GCFGCCT--KPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQ
        GC GC    + T     D PS  +    R  KKPS+S+ FWSTST D+DN T  SQ S+SS   SN T    +       S+  P               
Subjt:  GCFGCCT--KPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQ

Query:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWR-ATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
                   GL +    + + + KD+        Q   L      W  ATYDSLLG+ + FP PIPL+EMV+FLV++WEQEGLYD
Subjt:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWR-ATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

AT1G15350.2 unknown protein2.4e-1936.9Show/hide
Query:  GCFGCCT--KPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQ
        GC GC    + T     D PS  +    R  KKPS+S+ FWSTST D+DN T  SQ S+SS   SN T    +       S+  P               
Subjt:  GCFGCCT--KPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQ

Query:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWR-ATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
                   GL +    + + + KD+        Q   L      W  ATYDSLLG+ + FP PIPL+EMV+FLV++WEQEGLYD
Subjt:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWR-ATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

AT4G32342.1 unknown protein4.0e-2742.47Show/hide
Query:  CFGCCTKPTP-IIAVDEPSKGLRIQGRVVKKPSI-SDGFWSTSTCDLD-NSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQ
        CFGCC +    ++ VDEPSKGL+IQG++VKK S  SD FWSTSTCD+D N TIQSQ S                            QC    ST F    
Subjt:  CFGCCTKPTP-IIAVDEPSKGLRIQGRVVKKPSI-SDGFWSTSTCDLD-NSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQ

Query:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLA-PWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLY
            G+ L                N  R+Q     T+  CL  + A  W +TYDSLL T + FP PIPL EMV+FLV+VWE+EGLY
Subjt:  VSFSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLA-PWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLY

AT5G25360.1 unknown protein4.3e-3746.74Show/hide
Query:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS
        GCFGCC KP  I+AVDEPSKGLRIQGR+VKKPS+S+ FWSTSTC++DNST+QSQRS+SSIS +N T + ++                    T F      
Subjt:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS

Query:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
                 GL +    +++ L     QK  +      + +    W ATY+SLLG  + F  PIPL EMV+FLV+VWEQEGLYD
Subjt:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD

AT5G25360.2 unknown protein4.3e-3746.74Show/hide
Query:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS
        GCFGCC KP  I+AVDEPSKGLRIQGR+VKKPS+S+ FWSTSTC++DNST+QSQRS+SSIS +N T + ++                    T F      
Subjt:  GCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVS

Query:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD
                 GL +    +++ L     QK  +      + +    W ATY+SLLG  + F  PIPL EMV+FLV+VWEQEGLYD
Subjt:  FSGIRLGCSGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TGCAGGGGTTGTTTTGGATGCTGCACTAAACCCACGCCTATTATTGCTGTGGATGAGCCATCTAAAGGATTAAGAATTCAAGGACGAGTTGTTAAGAAACCTAGCATATC
TGATGGTTTTTGGAGCACAAGCACGTGCGATTTGGATAATAGCACCATTCAGTCTCAACGAAGCATTTCGTCTATCAGTACATCAAACCTCACTCTCAGTCATAGCAATG
TTGGTGGCAGACCCATCATCTCTTCATTAGAACCCGGTCAATGTAGGGACGAGCGATCTACTCATTTTTGTTGCTTTCAGGTCTCCTTCTCTGGAATCAGACTAGGCTGC
AGTGGATTGGAAGTAGTACTACTAAGGCAACGGAAGAAACTCAACAAAGACAGAAAGCAAAAATCAGGTCAGTTCACTCAATCTTTCTGCCTTTATAAACAGTTGGCACC
TTGGCGTGCAACATATGACAGTTTACTGGGTACAAGACAGCCTTTTCCCCATCCAATTCCTTTGTCTGAAATGGTGAACTTTCTTGTGGAAGTATGGGAACAGGAGGGCC
TATATGATTGA
mRNA sequenceShow/hide mRNA sequence
TGCAGGGGTTGTTTTGGATGCTGCACTAAACCCACGCCTATTATTGCTGTGGATGAGCCATCTAAAGGATTAAGAATTCAAGGACGAGTTGTTAAGAAACCTAGCATATC
TGATGGTTTTTGGAGCACAAGCACGTGCGATTTGGATAATAGCACCATTCAGTCTCAACGAAGCATTTCGTCTATCAGTACATCAAACCTCACTCTCAGTCATAGCAATG
TTGGTGGCAGACCCATCATCTCTTCATTAGAACCCGGTCAATGTAGGGACGAGCGATCTACTCATTTTTGTTGCTTTCAGGTCTCCTTCTCTGGAATCAGACTAGGCTGC
AGTGGATTGGAAGTAGTACTACTAAGGCAACGGAAGAAACTCAACAAAGACAGAAAGCAAAAATCAGGTCAGTTCACTCAATCTTTCTGCCTTTATAAACAGTTGGCACC
TTGGCGTGCAACATATGACAGTTTACTGGGTACAAGACAGCCTTTTCCCCATCCAATTCCTTTGTCTGAAATGGTGAACTTTCTTGTGGAAGTATGGGAACAGGAGGGCC
TATATGATTGAAAATGCTTTTGTTTTGGATACATTCCTTGAATCTTTAGGAAGCTTCTCTGTACGGATTGCAAAAGGGAAAAAAAAAGGGTGTTTTTGTTTTTCTTCATC
AATGATCCTACAATTTTCAGCTGTACAAATGTATTTAACACTATTGTCATTTTAAGACCAGAGACAGCAGAAGAAGCGACTGCCTTTAATTGTGTTAAGTCAATATATGG
GCATTCTTGAAAATATGGTATCATATAGCCACCATTTTCAATGTATTCTCCTGTCTTTTTCTCCACTGTTCATTCTATGAATAAGAGCTTGAAAATATCCATCATTCACT
GTTTCTGTCTGCTGCTCAATCATCAATGAAAAAATTTGATGTATCTGAGAACAACTGCAAGTTTGAGTTGTATTATGAATTTATATCTATATTTTGATGGTTTTAATTGG
ATTTTTTGCATGGAATTTTTAAAGATATTATTCATGGTATGGTTTTTCATTGGATTGATCTTTTGGAAGAAATGATATTAGAAGGTGGCCCAAGTGAAAGATTG
Protein sequenceShow/hide protein sequence
CRGCFGCCTKPTPIIAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGGRPIISSLEPGQCRDERSTHFCCFQVSFSGIRLGC
SGLEVVLLRQRKKLNKDRKQKSGQFTQSFCLYKQLAPWRATYDSLLGTRQPFPHPIPLSEMVNFLVEVWEQEGLYD