; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy2G025783 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy2G025783
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionProtein of unknown function (DUF1068)
Genome locationGy14Chr2:33253825..33257711
RNA-Seq ExpressionCsGy2G025783
SyntenyCsGy2G025783
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652279.1 hypothetical protein Csa_022101 [Cucumis sativus]1.55e-167100Show/hide
Query:  MKLKVNKSNIVFNPKFNIYSTFLSTKYKHIFPQRVGSHFPFLDYTKVLKMAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLS
        MKLKVNKSNIVFNPKFNIYSTFLSTKYKHIFPQRVGSHFPFLDYTKVLKMAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLS
Subjt:  MKLKVNKSNIVFNPKFNIYSTFLSTKYKHIFPQRVGSHFPFLDYTKVLKMAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLS

Query:  TCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARER
        TCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARER
Subjt:  TCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARER

Query:  AEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        AEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
Subjt:  AEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

XP_004151898.1 uncharacterized protein LOC101219040 [Cucumis sativus]2.79e-129100Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

XP_008455917.1 PREDICTED: uncharacterized protein LOC103495987 isoform X1 [Cucumis melo]3.18e-11993.65Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MA KPVGS SPGLTKVGL  MA+CIAAYILGPPLYWHF EGL AFSSSSLSTCPPCFCDCSSLTDFAFTEEL+NTTFRDCVKHDSGMNEETEKNFAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLT LWETRARQRGWRD+IVTSRGT+Q S
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

XP_008455918.1 PREDICTED: uncharacterized protein LOC103495987 isoform X2 [Cucumis melo]1.15e-11089.95Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MA KPVGS SPGLTKVGL  MA+CIAAYILGPPLYWHF EGL AFSSSSLSTCPPCFCDCSSLTDFAFTE        DCVKHDSGMNEETEKNFAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLT LWETRARQRGWRD+IVTSRGT+Q S
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

XP_023534557.1 uncharacterized protein LOC111796098 isoform X1 [Cucurbita pepo subsp. pepo]3.42e-11090.11Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MAVKP GSCSPGLTKVGL  +ALCIAAYILGPPLYWHFMEGL   SSSS STCPPCFCDCSS TDFAFT+E ENTTFRDCVKHDSGMNEETE++FAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS
        EELKLREAEA+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTALWE RARQRGWRD+IVTS
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS

TrEMBL top hitse value%identityAlignment
A0A0A0LQC2 Uncharacterized protein1.35e-129100Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

A0A1S3C1Z9 uncharacterized protein LOC103495987 isoform X11.54e-11993.65Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MA KPVGS SPGLTKVGL  MA+CIAAYILGPPLYWHF EGL AFSSSSLSTCPPCFCDCSSLTDFAFTEEL+NTTFRDCVKHDSGMNEETEKNFAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLT LWETRARQRGWRD+IVTSRGT+Q S
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

A0A1S3C2Q4 uncharacterized protein LOC103495987 isoform X25.58e-11189.95Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MA KPVGS SPGLTKVGL  MA+CIAAYILGPPLYWHF EGL AFSSSSLSTCPPCFCDCSSLTDFAFTE        DCVKHDSGMNEETEKNFAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLT LWETRARQRGWRD+IVTSRGT+Q S
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

A0A6J1G5W1 uncharacterized protein LOC111451120 isoform X15.51e-10989.01Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MAVKP GSCSPGLTKVGL  +ALCIAAYILGPPLYWHF+EGL   SSSS STCPPCFCDCSS TDFAFT+E ENTTFRDCVKHDSGMNEETE+ FAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS
        EELKLREAEA+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTALWE RARQRGWRD+IV S
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS

A0A6J1I131 uncharacterized protein LOC111469880 isoform X19.10e-10887.91Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MAVKP GSCSPGLTKVGL  +ALCIAAYILGPPLYWHFMEGL   SSS  STCPPCFCDCSS TDFAFT+E ENTTFRDCVKHDSGMNEETE++FAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS
        E+LKLREA+A+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTALWE RARQRGWRD+IV S
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)7.6e-5565.03Show/hide
Query:  KVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENH
        K+GL L+ L +A YILGPPLYWH  E L A S+SS   CP C C+CS+ +     +EL N +F DC KHD  +NE+TEKN+AELL+EELKLREAE+LE H
Subjt:  KVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENH

Query:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRD
        +RAD+ LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  LA QK+LT+ WE RARQ+GWR+
Subjt:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRD

AT2G32580.1 Protein of unknown function (DUF1068)6.3e-4958.93Show/hide
Query:  KVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENH
        KVGL L+AL +  YILGPPLYWH  E L    + S ++C  C CDCSSL        L N +F DC K D  +NE+TEKN+AELL+EELK REA ++E H
Subjt:  KVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENH

Query:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS
        +R D  LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  L  QK+LT++WE RARQ+G++D    S
Subjt:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS

AT2G32580.2 Protein of unknown function (DUF1068)5.3e-3264.42Show/hide
Query:  DCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDN
        +C K D  +NE+TEKN+AELL+EELK REA ++E H+R D  LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  L  QK+LT++WE RARQ+G++D 
Subjt:  DCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDN

Query:  IVTS
           S
Subjt:  IVTS

AT4G04360.1 Protein of unknown function (DUF1068)7.9e-4455.36Show/hide
Query:  KVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENH
        KV   +M LCI AYI GP LYWH  E     + S  S+CPPC CDCSS    +  + L N +F DC++H+ G +EE+E +F E+++EELKLREA+A E+ 
Subjt:  KVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENH

Query:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS
         RAD  LL+AKK  SQYQKEADKC+ GMETCE ARE+AEA L  Q+RL+ +WE RARQ GW++  V S
Subjt:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS

AT4G30996.1 Protein of unknown function (DUF1068)3.7e-3347.8Show/hide
Query:  LFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDC-SSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRR
        L + A+  A  + GP LYW F +G    S+ + S CPPC CDC   L+       L N +  DC   D  + +E EK F +LL+EELKL+EA A E+ R 
Subjt:  LFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDC-SSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRR

Query:  ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGW
         +++L EAK++ SQYQKEA+KCN+  E CE+ARERAEA L  ++++T+LWE RARQ GW
Subjt:  ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTGAAGGTAAATAAATCAAACATAGTATTTAATCCAAAGTTTAATATTTATTCCACTTTCCTCTCTACTAAGTATAAACACATCTTTCCCCAACGGGTTGGTTC
CCATTTTCCATTTCTGGACTACACCAAGGTCTTGAAAATGGCAGTGAAGCCGGTGGGTTCCTGCTCTCCGGGCCTGACGAAGGTGGGATTGTTTTTAATGGCTCTTTGTA
TAGCAGCCTACATTCTGGGTCCGCCTCTATACTGGCATTTCATGGAGGGCTTGCCCGCTTTCTCTTCTTCCTCTCTCTCAACTTGCCCACCTTGCTTTTGTGACTGTTCT
TCTCTCACTGACTTTGCCTTCACTGAAGAGCTCGAAAATACCACTTTTAGAGATTGTGTGAAACATGACTCTGGCATGAATGAGGAAACAGAAAAGAATTTTGCAGAGTT
ATTGTCTGAGGAACTGAAACTGAGGGAAGCTGAAGCTTTGGAGAATCATCGGCGTGCCGACATATCTCTACTAGAAGCAAAGAAGATGACATCTCAATATCAGAAAGAAG
CAGACAAGTGCAATTCAGGCATGGAAACATGTGAAGCAGCAAGGGAAAGAGCTGAAGCTACATTAGCTTCACAAAAGAGGCTAACAGCATTATGGGAAACTAGGGCTCGC
CAAAGAGGATGGAGAGACAACATTGTTACATCCCGTGGTACCATCCAAGGCTCATAA
mRNA sequenceShow/hide mRNA sequence
AATTTATTTTACAATAAGATGAAGCTGAAGGTAAATAAATCAAACATAGTATTTAATCCAAAGTTTAATATTTATTCCACTTTCCTCTCTACTAAGTATAAACACATCTT
TCCCCAACGGGTTGGTTCCCATTTTCCATTTCTGGACTACACCAAGGTCTTGAAAATGGCAGTGAAGCCGGTGGGTTCCTGCTCTCCGGGCCTGACGAAGGTGGGATTGT
TTTTAATGGCTCTTTGTATAGCAGCCTACATTCTGGGTCCGCCTCTATACTGGCATTTCATGGAGGGCTTGCCCGCTTTCTCTTCTTCCTCTCTCTCAACTTGCCCACCT
TGCTTTTGTGACTGTTCTTCTCTCACTGACTTTGCCTTCACTGAAGAGCTCGAAAATACCACTTTTAGAGATTGTGTGAAACATGACTCTGGCATGAATGAGGAAACAGA
AAAGAATTTTGCAGAGTTATTGTCTGAGGAACTGAAACTGAGGGAAGCTGAAGCTTTGGAGAATCATCGGCGTGCCGACATATCTCTACTAGAAGCAAAGAAGATGACAT
CTCAATATCAGAAAGAAGCAGACAAGTGCAATTCAGGCATGGAAACATGTGAAGCAGCAAGGGAAAGAGCTGAAGCTACATTAGCTTCACAAAAGAGGCTAACAGCATTA
TGGGAAACTAGGGCTCGCCAAAGAGGATGGAGAGACAACATTGTTACATCCCGTGGTACCATCCAAGGCTCATAAAAGATCTGATGCTTATTGAACGAGTTGCTTTGACA
GCAGCTTGGAATTCATTGAGCCAGAAGCTTTTCCTTTATATATTCCACTCGCATTTCTTCAAACTTCCTCCTGTATATGAAAAAGATACAAACCAATCCCAAATGTCACT
CTCCCTTCCCTCCAAGATATCATGACCCAAAACCTTTAGCAAGGTTATGCAACAAGCAAACTGGTCTGTTTCAAATTTTGGTACAACAATTAGTTAGATTACTCGTGTAA
CTTGTTTGTCACTAGAATGTTTAAGAGATTGTTGTTGTCCATCAATACTGTATAAGAGAAAGAAATTTCATGACTAATTTGAGCATTTATGTTAATAGCCTTTTATTGAG
CCTTTCTAAATAATTAGGCTTCAACAGATAAAAAAAGAATAACAAGCATAAACAAGAAGTGAAGTAGGAAGTGGAATGGAAGCAAGC
Protein sequenceShow/hide protein sequence
MKLKVNKSNIVFNPKFNIYSTFLSTKYKHIFPQRVGSHFPFLDYTKVLKMAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCS
SLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRAR
QRGWRDNIVTSRGTIQGS