; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G14000 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G14000
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTransposase
Genome locationChr5:14163390..14164313
RNA-Seq ExpressionCSPI05G14000
SyntenyCSPI05G14000
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR004242 - Transposon, En/Spm-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031737548.1 uncharacterized protein LOC116402438 [Cucumis sativus]3.8e-14585.71Show/hide
Query:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE
        DSPAWKLVDMKW DFGSEP NLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPW CMKR                                   +GVE
Subjt:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE

Query:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL
        CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNH YRRQKKSFNGKKELDTIPEPLSGEDVYL
Subjt:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL

Query:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI
        KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNIL T LDIPGKSKDGLNARRDLVDLKLRPELAPI
Subjt:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI

XP_031741731.1 uncharacterized protein LOC116403926 [Cucumis sativus]7.7e-14685.71Show/hide
Query:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE
        DSPAWKLVDMKWPDFGSEPRNLRLALS DGVNPHGDMSSKYSCWPVVMVIYNLPPW CMKR                                   +GVE
Subjt:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE

Query:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL
        CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNH YRRQKKSFNGKKELDTIPEPLSGEDVYL
Subjt:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL

Query:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI
        KLKD EFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNIL T LDIPGKSKDGLNARRDLVDLKLRPELAPI
Subjt:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI

XP_031742172.1 uncharacterized protein LOC116404095 [Cucumis sativus]6.1e-14383.67Show/hide
Query:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE
        +SPAWKLVDMKWPDF SEPRNL LALS DGVNPHGDMSSKYSCWPVV+VIYNLPPW CMKR                                   +GVE
Subjt:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE

Query:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL
        CYDAYREEPFNLRS+LLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNH YRRQKKSFNGKKELDTIP+PLSGEDVYL
Subjt:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL

Query:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI
        KLKD+EFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNIL T LDIPGKSKDGLNARRDLVDLKLRPELAPI
Subjt:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI

XP_031742313.1 uncharacterized protein LOC116404153 [Cucumis sativus]3.6e-14384.69Show/hide
Query:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE
        DSPAWKLVDMKWPDFGSE RNLRLALS D VNPHGDMSSKYSCW VVM+IYNLPPW CMKR                                   +GVE
Subjt:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE

Query:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL
        CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNH YRRQKKSFNGKKELDTIPEPLSGEDVYL
Subjt:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL

Query:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI
        KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNIL T LDIPGKSKDGLNARRDLVDLKLRPELAPI
Subjt:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI

XP_031744385.1 uncharacterized protein LOC116405016 [Cucumis sativus]1.6e-15486.64Show/hide
Query:  MVSYDIQPDSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-------------------------------
        MVSYDIQPDSPAWKLVDMKW DFGSEP NLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPW CMKR                               
Subjt:  MVSYDIQPDSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-------------------------------

Query:  ----NGVECYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEP
            +GVECYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEP
Subjt:  ----NGVECYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEP

Query:  LSGEDVYLKLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELA
        L GEDVYLKLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNIL TFLDIPGKSKDGLNARRDLVDLKLRPELA
Subjt:  LSGEDVYLKLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELA

Query:  PIRICYS
        PIRICYS
Subjt:  PIRICYS

TrEMBL top hitse value%identityAlignment
A0A5A7UY50 Transposase7.5e-13981.63Show/hide
Query:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE
        DSPAWKLVD KWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWP+VMVIYNLPPW CMKR                                   NGVE
Subjt:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE

Query:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL
        CYDAYREE FNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRL++GKK+AYLGHRRFLAR+H YRRQKKSFNGKKEL TIPEPLSGEDVYL
Subjt:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL

Query:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI
        KLKDLEF +GKK HK   MNRS+KICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNIL T LDIPGKSKDGLNARRDLVDLKLRPELAPI
Subjt:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI

A0A5D3CA82 Transposase7.5e-13981.63Show/hide
Query:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE
        DSPAWKLVD KWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWP+VMVIYNLPPW CMKR                                   NGVE
Subjt:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE

Query:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL
        CYDAYREE FNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRL++GKK+AYLGHRRFLAR+H YRRQKKSFNGKKEL TIPEPLSGEDVYL
Subjt:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL

Query:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI
        KLKDLEF +GKK HK   MNRS+KICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNIL T LDIPGKSKDGLNARRDLVDLKLRPELAPI
Subjt:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI

A0A5D3DLB9 Transposase7.5e-13981.63Show/hide
Query:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE
        DSPAWKLVD KWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWP+VMVIYNLPPW CMKR                                   NGVE
Subjt:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE

Query:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL
        CYDAYREE FNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRL++GKK+AYLGHRRFLAR+H YRRQKKSFNGKKEL TIPEPLSGEDVYL
Subjt:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL

Query:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI
        KLKDLEF +GKK HK   MNRS+KICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNIL T LDIPGKSKDGLNARRDLVDLKLRPELAPI
Subjt:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI

A0A5D3DN97 Transposase7.5e-13981.63Show/hide
Query:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE
        DSPAWKLVD KWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWP+VMVIYNLPPW CMKR                                   NGVE
Subjt:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE

Query:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL
        CYDAYREE FNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRL++GKK+AYLGHRRFLAR+H YRRQKKSFNGKKEL TIPEPLSGEDVYL
Subjt:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL

Query:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI
        KLKDLEF +GKK HK   MNRS+KICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNIL T LDIPGKSKDGLNARRDLVDLKLRPELAPI
Subjt:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI

A0A5D3E310 Transposase7.5e-13981.63Show/hide
Query:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE
        DSPAWKLVD KWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWP+VMVIYNLPPW CMKR                                   NGVE
Subjt:  DSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKR-----------------------------------NGVE

Query:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL
        CYDAYREE FNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRL++GKK+AYLGHRRFLAR+H YRRQKKSFNGKKEL TIPEPLSGEDVYL
Subjt:  CYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGYKACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYL

Query:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI
        KLKDLEF +GKK HK   MNRS+KICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNIL T LDIPGKSKDGLNARRDLVDLKLRPELAPI
Subjt:  KLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHCLDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAAGTTACGACATCCAGCCAGACTCTCCAGCATGGAAGTTAGTAGACATGAAATGGCCAGACTTCGGTTCTGAACCCAGAAATCTTCGTTTAGCATTGTCAGCCGA
TGGAGTAAATCCTCATGGTGACATGAGTTCTAAATACAGTTGTTGGCCGGTAGTGATGGTTATTTACAATCTTCCACCATGGTCGTGTATGAAAAGAAATGGTGTTGAAT
GTTATGATGCTTATCGAGAAGAACCATTCAACTTAAGGTCAGTTTTGTTGTGGACAATCAATGATTTTCCTGCATATGGTAACCTTAGTGGATGTTGTGTGAAAGGGTAT
AAAGCATGCCCAATTTGTGGAGATAATACAAATTCTATAAGGTTAAAGTATGGGAAGAAAATGGCATACCTTGGACATCGTAGATTTTTGGCACGAAATCATCTGTATCG
ACGACAAAAGAAGTCATTCAATGGTAAAAAAGAACTTGATACAATTCCAGAGCCACTTTCTGGGGAGGATGTGTATTTAAAATTGAAAGATCTTGAATTTTCTAGAGGGA
AGAAGAACCATAAGAAACGGTTGATGAACAGAAGTGACAAAATTTGTTGGAATAGATTATCTTCTTTTTTTGAGTTGCCATACTGGAAGGATCTTCATGTTAGACATTGT
TTAGATGTGATGCACATTGAAAAAAATGTTTGCATGAATATCTTAGATACATTTCTTGATATTCCTGGAAAAAGTAAGGATGGATTGAATGCTAGACGCGATTTAGTTGA
TCTAAAACTTCGACCAGAGCTTGCCCCTATCAGAATATGTTACTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAAGTTACGACATCCAGCCAGACTCTCCAGCATGGAAGTTAGTAGACATGAAATGGCCAGACTTCGGTTCTGAACCCAGAAATCTTCGTTTAGCATTGTCAGCCGA
TGGAGTAAATCCTCATGGTGACATGAGTTCTAAATACAGTTGTTGGCCGGTAGTGATGGTTATTTACAATCTTCCACCATGGTCGTGTATGAAAAGAAATGGTGTTGAAT
GTTATGATGCTTATCGAGAAGAACCATTCAACTTAAGGTCAGTTTTGTTGTGGACAATCAATGATTTTCCTGCATATGGTAACCTTAGTGGATGTTGTGTGAAAGGGTAT
AAAGCATGCCCAATTTGTGGAGATAATACAAATTCTATAAGGTTAAAGTATGGGAAGAAAATGGCATACCTTGGACATCGTAGATTTTTGGCACGAAATCATCTGTATCG
ACGACAAAAGAAGTCATTCAATGGTAAAAAAGAACTTGATACAATTCCAGAGCCACTTTCTGGGGAGGATGTGTATTTAAAATTGAAAGATCTTGAATTTTCTAGAGGGA
AGAAGAACCATAAGAAACGGTTGATGAACAGAAGTGACAAAATTTGTTGGAATAGATTATCTTCTTTTTTTGAGTTGCCATACTGGAAGGATCTTCATGTTAGACATTGT
TTAGATGTGATGCACATTGAAAAAAATGTTTGCATGAATATCTTAGATACATTTCTTGATATTCCTGGAAAAAGTAAGGATGGATTGAATGCTAGACGCGATTTAGTTGA
TCTAAAACTTCGACCAGAGCTTGCCCCTATCAGAATATGTTACTCCTAA
Protein sequenceShow/hide protein sequence
MVSYDIQPDSPAWKLVDMKWPDFGSEPRNLRLALSADGVNPHGDMSSKYSCWPVVMVIYNLPPWSCMKRNGVECYDAYREEPFNLRSVLLWTINDFPAYGNLSGCCVKGY
KACPICGDNTNSIRLKYGKKMAYLGHRRFLARNHLYRRQKKSFNGKKELDTIPEPLSGEDVYLKLKDLEFSRGKKNHKKRLMNRSDKICWNRLSSFFELPYWKDLHVRHC
LDVMHIEKNVCMNILDTFLDIPGKSKDGLNARRDLVDLKLRPELAPIRICYS