; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G01510 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G01510
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTransposase
Genome locationChr5:2113042..2117213
RNA-Seq ExpressionCSPI05G01510
SyntenyCSPI05G01510
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR004242 - Transposon, En/Spm-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK08445.1 transposase [Cucumis melo var. makuwa]9.5e-12880.36Show/hide
Query:  LRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF----------------
        L +RK VRDHLYVNGIDESYK WFWHGE LPNSSFY E SKFDTHTCE+ DVGSVKEMIEVAHEEYSKDP GFEKLLIDAEKP                 
Subjt:  LRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF----------------

Query:  ----------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPSKVIWY
                  D SFSELL+TLKEILP TNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANA EC ECGQSRWKNVKD +E RKQIPSKVIWY
Subjt:  ----------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPSKVIWY

Query:  FPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC
        FP IP FKRLFRSIECAENLTWH++ERI +GKLRHPADSPAWKLVD KWP FGSEP NLRLALSADGVNPHGDMSSKY+C
Subjt:  FPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC

XP_031739753.1 uncharacterized protein LOC116403284 [Cucumis sativus]1.2e-12780.35Show/hide
Query:  GTLQLRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF------------
        G  +  +RKGVRDHLYVNGIDESYK WFWHGEELPNSSFYDESSKFD HTCEDQ VGSVKEMIEV HEEYSK+PTGFEKLLIDAEKP             
Subjt:  GTLQLRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF------------

Query:  ---------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPS
                        TSF ELLETLKEILP TNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEF+NAIEC ECGQSRWKNVKDT+ERRKQIPS
Subjt:  ---------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPS

Query:  KVIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC
        KVIWYFPIIP FKRLFRSIECAENLTWHSTERIN+ KLRH    PAWKLVDMKWP FG EP NLRLALSADGVNPHGDMSSKY+C
Subjt:  KVIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC

XP_031739988.1 uncharacterized protein LOC116403337 [Cucumis sativus]2.1e-13583.86Show/hide
Query:  GTLQLRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCED-QDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF-----------
        G  +  +RKGVRDHLYVNGIDESYK WFWHGEELPNSSFYDESSKF  HTCED QDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKP            
Subjt:  GTLQLRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCED-QDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF-----------

Query:  ---------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPS
                       DTSFSELLETLKEILP TNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIEC ECGQSRWKN+KDT+ERRKQIPS
Subjt:  ---------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPS

Query:  KVIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC
        KVIWYFPIIP FKRLFRSIECAENLTWHSTERIN+GKLRHPADSPAWKLVDMKWP FGSEP NLRLALSADGVNPHGDMSSKY+C
Subjt:  KVIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC

XP_031742172.1 uncharacterized protein LOC116404095 [Cucumis sativus]5.7e-13382.39Show/hide
Query:  GTLQLRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF------------
        G  +  NR+GVRDHLYVNGIDESYK WFWHGEELPNSSFYDESSKFD HTCED DVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKP             
Subjt:  GTLQLRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF------------

Query:  --------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPSK
                      DTSFSELLETLKEILP TNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIEC ECGQSRWKNVKDT+ERRKQI SK
Subjt:  --------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPSK

Query:  VIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC
        VIWYFPIIP FKRLFRSIEC ENLTWHSTERIN+GKLRHPA+SPAWKLVDMKWP F SEP NL LALS DGVNPHGDMSSKY+C
Subjt:  VIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC

XP_031742381.1 uncharacterized protein LOC116404332 [Cucumis sativus]3.7e-14085.56Show/hide
Query:  GTLQLRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF------------
        G  +  +RKGVRDHLYVNGIDESYK WFWHGEELPNSSFYDESSKFD HTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKP             
Subjt:  GTLQLRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF------------

Query:  --------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPSK
                      DTSFSELLETLKEI+PNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDT+ERRKQIPSK
Subjt:  --------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPSK

Query:  VIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC
        VIWYFPIIP FKRLFRSIECAENLTWHSTERIN+GKLRHPADSPAWKLVDMKWP FGSEPINLRLALSADGVNPHGDMSSKY+C
Subjt:  VIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC

TrEMBL top hitse value%identityAlignment
A0A5A7TUX7 Transposase1.4e-10566.32Show/hide
Query:  GTLQLRNRKGVRDHLYVNGIDESYKFWFWHGE-ELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF-----------
        G  + ++R  +RDHLYVNGIDESYK WFWHGE +LP SS Y+ESSKFDTH  E  DVG + EMIEVAHEEYSKDP  FEKLL DAEK             
Subjt:  GTLQLRNRKGVRDHLYVNGIDESYKFWFWHGE-ELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF-----------

Query:  ---------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPS
                       D SFSELL+TLKEILP  NE+P S+YEAKKTLGALGM YEKIHACPN+CCLYRKE ANA EC ECG+SRWK   + +E +KQIP 
Subjt:  ---------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPS

Query:  KVIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC
        KV+WYFP IP FKRLFRSI  A+NL WHS ER+  GKLRHPADSPAWKL+D+KWP FGSEP N+RLALSAD +NPH +MSSKY+C
Subjt:  KVIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC

A0A5A7U2S8 Transposase2.1e-11775.72Show/hide
Query:  GTLQLRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF------------
        G  +  +RK VRDHLYVNGIDESYK WFWHG+     SFY ESSKFDTHTCE+ DVGSVKE+IEVAHEEYSKDP GFEKLLIDAEKP             
Subjt:  GTLQLRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF------------

Query:  --------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPSK
                      D SFSELL+TLKEILP TNELPNSLYEAKKTLGALGMEYE+IHACPNNCCLYRKEFANA EC ECGQSRWKNVKD +E RKQ PSK
Subjt:  --------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPSK

Query:  VIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHG
        VIWYFP IP FKRLFRSIECAENLTWH++ERI +GKLRHPADSPAWKLVD KW  FGSEP NLRLALS DGVNPHG
Subjt:  VIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHG

A0A5A7UKU6 Transposase7.6e-10767.03Show/hide
Query:  RNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF-----------------
        ++R  +RDHLYVNGIDESYK WFWHGE+LP SS Y+ESSKFDTH  E+ DVGS+ E IEVAHEEYSKDP  FEKLL DAEK                   
Subjt:  RNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF-----------------

Query:  ---------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPSKVIWYF
                 D  FSELL+TLKEILP +NE+P S+YEAKKTLGALGM YEKIHAC N+CCLYRKE ANA EC ECG+SRWK   + +  +KQIP KV+WYF
Subjt:  ---------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPSKVIWYF

Query:  PIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC
        P IP FKRLFRSI+ A+NL WHS ER+ +GKLRHPADSPAWKL+D+KWP FGSEP N+RLALSADG+NPHG+MSSKY+C
Subjt:  PIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC

A0A5D3CA82 Transposase4.6e-12880.36Show/hide
Query:  LRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF----------------
        L +RK VRDHLYVNGIDESYK WFWHGE LPNSSFY E SKFDTHTCE+ DVGSVKEMIEVAHEEYSKDP GFEKLLIDAEKP                 
Subjt:  LRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF----------------

Query:  ----------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPSKVIWY
                  D SFSELL+TLKEILP TNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANA EC ECGQSRWKNVKD +E RKQIPSKVIWY
Subjt:  ----------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPSKVIWY

Query:  FPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC
        FP IP FKRLFRSIECAENLTWH++ERI +GKLRHPADSPAWKLVD KWP FGSEP NLRLALSADGVNPHGDMSSKY+C
Subjt:  FPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC

A0A5D3CRI9 Transposase3.1e-10865.51Show/hide
Query:  IEEGTLQLRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF---------
        ++ G  + ++R  +RDHLYVNGIDESYK WFWHGE+LP SS Y+ESSKFDTH  E+ DVGS+ EMIEVAHEEYSKDP  FEKLL DA+KP          
Subjt:  IEEGTLQLRNRKGVRDHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPF---------

Query:  -----------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQI
                         D SFSELL+TLKEI P +NE+P S+YEAKKTLGALGM YEKIHACPN+CCLYRKE ANA EC ECG+SRWK   + +  +KQI
Subjt:  -----------------DTSFSELLETLKEILPNTNELPNSLYEAKKTLGALGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQI

Query:  PSKVIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC
        P KV+WYFP I  FKRLFRSI+ A+NL W S ER+ +GKLRHPADSPAWKL+D+KWP FGSEP N+RLALSADG+NPHG+MSSKY+C
Subjt:  PSKVIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGSEPINLRLALSADGVNPHGDMSSKYNC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAATTTGGCTTGGTTTAGCTTCTAGCTTGCGGTTTGGCTCTGGTTCACTATCTTGCAGGCGCGGGCACGACTCGACTTGGCTTCGAACGGGTTTACACATGGTCGG
CTATGGTTCAGAGCGGTTGGCTTCGATTCTATGTGGTTCAACTTTTGGCATGCAGTCAACTCCAGTTTGGCGTAGATTATGTGTTTTGGCGCCGACTCAGCTTGTTTTGG
GGCGGTTCTTCCATTTTTGGATCGGTTCAAGGGAAGCTCCTGTTATTGCTTACACTGAGAAGATAATCGAAGAAGGGACTCTTCAATTGAGAAATAGAAAGGGTGTTAGA
GATCACTTGTATGTTAATGGTATTGATGAAAGTTATAAATTTTGGTTTTGGCATGGGGAAGAACTTCCTAACTCATCCTTCTATGATGAATCTTCAAAGTTTGACACCCA
TACATGTGAAGATCAGGATGTTGGAAGTGTAAAAGAAATGATTGAAGTTGCTCATGAGGAGTATTCAAAAGACCCAACTGGATTTGAGAAGTTGCTTATTGATGCTGAAA
AACCATTTGATACTAGTTTTTCAGAATTACTTGAAACTTTGAAGGAAATTCTGCCTAATACCAATGAGCTCCCGAATTCATTGTATGAAGCAAAGAAAACATTAGGTGCA
TTAGGAATGGAATACGAAAAGATTCATGCATGCCCGAATAATTGTTGTCTCTATAGAAAAGAATTTGCTAATGCAATTGAATGTCATGAATGTGGTCAATCAAGGTGGAA
AAACGTCAAGGATACCAGTGAAAGGAGAAAGCAAATTCCCTCTAAAGTGATATGGTACTTTCCAATCATTCCACTATTTAAAAGGCTATTCAGAAGCATTGAATGTGCTG
AAAACTTGACTTGGCATTCTACTGAAAGAATTAATAATGGTAAGTTACGACATCCAGCAGACTCTCCCGCATGGAAGTTAGTAGACATGAAATGGCCAAAGTTTGGTTCT
GAACCCATAAATCTTCGTTTAGCATTGTCAGCCGATGGAGTAAATCCTCATGGTGACATGAGTTCTAAATACAATTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAATTTGGCTTGGTTTAGCTTCTAGCTTGCGGTTTGGCTCTGGTTCACTATCTTGCAGGCGCGGGCACGACTCGACTTGGCTTCGAACGGGTTTACACATGGTCGG
CTATGGTTCAGAGCGGTTGGCTTCGATTCTATGTGGTTCAACTTTTGGCATGCAGTCAACTCCAGTTTGGCGTAGATTATGTGTTTTGGCGCCGACTCAGCTTGTTTTGG
GGCGGTTCTTCCATTTTTGGATCGGTTCAAGGGAAGCTCCTGTTATTGCTTACACTGAGAAGATAATCGAAGAAGGGACTCTTCAATTGAGAAATAGAAAGGGTGTTAGA
GATCACTTGTATGTTAATGGTATTGATGAAAGTTATAAATTTTGGTTTTGGCATGGGGAAGAACTTCCTAACTCATCCTTCTATGATGAATCTTCAAAGTTTGACACCCA
TACATGTGAAGATCAGGATGTTGGAAGTGTAAAAGAAATGATTGAAGTTGCTCATGAGGAGTATTCAAAAGACCCAACTGGATTTGAGAAGTTGCTTATTGATGCTGAAA
AACCATTTGATACTAGTTTTTCAGAATTACTTGAAACTTTGAAGGAAATTCTGCCTAATACCAATGAGCTCCCGAATTCATTGTATGAAGCAAAGAAAACATTAGGTGCA
TTAGGAATGGAATACGAAAAGATTCATGCATGCCCGAATAATTGTTGTCTCTATAGAAAAGAATTTGCTAATGCAATTGAATGTCATGAATGTGGTCAATCAAGGTGGAA
AAACGTCAAGGATACCAGTGAAAGGAGAAAGCAAATTCCCTCTAAAGTGATATGGTACTTTCCAATCATTCCACTATTTAAAAGGCTATTCAGAAGCATTGAATGTGCTG
AAAACTTGACTTGGCATTCTACTGAAAGAATTAATAATGGTAAGTTACGACATCCAGCAGACTCTCCCGCATGGAAGTTAGTAGACATGAAATGGCCAAAGTTTGGTTCT
GAACCCATAAATCTTCGTTTAGCATTGTCAGCCGATGGAGTAAATCCTCATGGTGACATGAGTTCTAAATACAATTGTTGA
Protein sequenceShow/hide protein sequence
MPIWLGLASSLRFGSGSLSCRRGHDSTWLRTGLHMVGYGSERLASILCGSTFGMQSTPVWRRLCVLAPTQLVLGRFFHFWIGSREAPVIAYTEKIIEEGTLQLRNRKGVR
DHLYVNGIDESYKFWFWHGEELPNSSFYDESSKFDTHTCEDQDVGSVKEMIEVAHEEYSKDPTGFEKLLIDAEKPFDTSFSELLETLKEILPNTNELPNSLYEAKKTLGA
LGMEYEKIHACPNNCCLYRKEFANAIECHECGQSRWKNVKDTSERRKQIPSKVIWYFPIIPLFKRLFRSIECAENLTWHSTERINNGKLRHPADSPAWKLVDMKWPKFGS
EPINLRLALSADGVNPHGDMSSKYNC