; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G15518 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G15518
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: 6 growth stages;
Genome locationctg2009:135717..141373
RNA-Seq ExpressionCucsat.G15518
SyntenyCucsat.G15518
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439251.1 PREDICTED: uncharacterized protein LOC103484091 isoform X1 [Cucumis melo]0.093.69Show/hide
Query:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW
        MASTSPTCSP+SLQLRLALNCNNCGKFPS+ VRARVRKLDPRLR++C PIVHNG KFDRGNG RGTGVCFAGSEST DGFSGWSESDSQGE LDLRRKKW
Subjt:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW

Query:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG
        FGG VGIGITGFILVSGITFAAWSINKQNSSRQK QMEALSTQQELLLDSETGTDRLGEDEKED SVDADDET AGKAGNQEDSSS TENEETLNKNRVG
Subjt:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG

Query:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT
        DGVDVEELAEN+VESSSSNNDVHN ASLQEDFQSDSSL VT+VAPGSLSS ISPESEFDSN+ASCLKDVNN HPGLEVSTSEPEMN+LKDEPDN PNSN 
Subjt:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN
        NSLNLKTDIRDERPDTGEN+DL SKKLPVYD+SSSNYISGNQDETL  V+EITDSSLQGFS +S DTAKES LFDG TVAKS EGV SPS+IEQFSSED 
Subjt:  NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN

Query:  APSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE
        APSIEQQLES LSEAALVSI+DYPLADDQE NHETIMNGTAAKRELQEI FSSAGVPAPLVSAAVKT PGKVL+PAVVDQVQGQALAALQVLKVIE DVE
Subjt:  APSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMY+ENVTELAFDDITPQDPDFASIQGLAEAG+ISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA
        WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPA+VADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI
        VAAHSALVAQVEKDINASFEKELSIEREKVEAVE+MAEEAKQELERLRSER R+ LALMMERAS+ESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWEKRGIKVVVDSDLREQES GDTWLDSSKQFTVEETT+RAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENLME

Query:  KLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG
        KLKRMAAEVRG+SRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKN AISRA+RSA ELQQSTAELSLAMKEGAKRVVGDCREGVEK TQKFRTSYG
Subjt:  KLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG

XP_008439252.1 PREDICTED: uncharacterized protein LOC103484091 isoform X2 [Cucumis melo]0.093.67Show/hide
Query:  KWFGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNR
        ++  G VGIGITGFILVSGITFAAWSINKQNSSRQK QMEALSTQQELLLDSETGTDRLGEDEKED SVDADDET AGKAGNQEDSSS TENEETLNKNR
Subjt:  KWFGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNR

Query:  VGDGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNS
        VGDGVDVEELAEN+VESSSSNNDVHN ASLQEDFQSDSSL VT+VAPGSLSS ISPESEFDSN+ASCLKDVNN HPGLEVSTSEPEMN+LKDEPDN PNS
Subjt:  VGDGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNS

Query:  NTNSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSE
        N NSLNLKTDIRDERPDTGEN+DL SKKLPVYD+SSSNYISGNQDETL  V+EITDSSLQGFS +S DTAKES LFDG TVAKS EGV SPS+IEQFSSE
Subjt:  NTNSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSE

Query:  DNAPSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVD
        D APSIEQQLES LSEAALVSI+DYPLADDQE NHETIMNGTAAKRELQEI FSSAGVPAPLVSAAVKT PGKVL+PAVVDQVQGQALAALQVLKVIE D
Subjt:  DNAPSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVD

Query:  VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDL
        VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMY+ENVTELAFDDITPQDPDFASIQGLAEAG+ISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDL
Subjt:  VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDL

Query:  VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAE
        VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPA+VADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAE
Subjt:  VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAE

Query:  NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKE
        NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVE+MAEEAKQELERLRSER R+ LALMMERAS+ESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKE
Subjt:  NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKE

Query:  RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENL
        RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWEKRGIKVVVDSDLREQES GDTWLDSSKQFTVEETT+RAENL
Subjt:  RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENL

Query:  MEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG
        MEKLKRMAAEVRG+SRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKN AISRA+RSA ELQQSTAELSLAMKEGAKRVVGDCREGVEK TQKFRTSYG
Subjt:  MEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG

XP_011651163.1 uncharacterized protein LOC101215442 isoform X1 [Cucumis sativus]0.099.5Show/hide
Query:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW
        MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW
Subjt:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW

Query:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG
        FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG
Subjt:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG

Query:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT
        DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFD+N+ASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT
Subjt:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN
        NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFS IS DTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN
Subjt:  NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN

Query:  APSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE
        APSIEQ LESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE
Subjt:  APSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA
        WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI
        VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENLME

Query:  KLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG
        KLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG
Subjt:  KLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG

XP_011651165.1 uncharacterized protein LOC101215442 isoform X2 [Cucumis sativus]0.099Show/hide
Query:  KWFGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNR
        ++  GFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNR
Subjt:  KWFGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNR

Query:  VGDGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNS
        VGDGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFD+N+ASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNS
Subjt:  VGDGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNS

Query:  NTNSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSE
        NTNSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFS IS DTAKESGLFDGETVAKSSEGVSSPSRIEQFSSE
Subjt:  NTNSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSE

Query:  DNAPSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVD
        DNAPSIEQ LESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVD
Subjt:  DNAPSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVD

Query:  VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDL
        VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDL
Subjt:  VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDL

Query:  VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAE
        VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAE
Subjt:  VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAE

Query:  NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKE
        NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKE
Subjt:  NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKE

Query:  RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENL
        RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENL
Subjt:  RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENL

Query:  MEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG
        MEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG
Subjt:  MEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG

XP_038892464.1 uncharacterized protein LOC120081550 isoform X1 [Benincasa hispida]0.085.87Show/hide
Query:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW
        MAST  TCSP+SLQLRLALNC NCGKFPS+LVRAR+RKLDPRLRVICHPIV+NG + +RGNG   TGVCFAGSEST DGFSGWSESDSQ + LDL RKKW
Subjt:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW

Query:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG
         GGFVGIGITGFIL+SGITFAAWSINKQNSSRQKPQMEALSTQQELLL S+TG D+LGED KE+  ++ADDET  GK GNQEDSSS TENEETLNKNRVG
Subjt:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG

Query:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT
        D VDV+ELAEN VESSSSNNDV++V SLQEDFQSDSSL VTSVAPGSLSSLISPESEFDSNIASC KDVNN H G EVSTSE EMNILKDEPDNLPNSNT
Subjt:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN
        NSLNLKTDI DERPDTGENYD  SKKLP+YDDSSSNY SGNQD+TL PV+EITDSSLQ  S                               EQF SE  
Subjt:  NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN

Query:  APSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE
        A +IEQ++E  LSEAALVS++DYP ADDQEKNHE++MNGTAAK ELQEI FSSAGVPAP+VSAAVKT PGKVL+PAVVDQVQGQALAALQVLKVIE DV 
Subjt:  APSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAG+ISSKLSRHDISSSLDEDQGPL FSPES LSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA
        WKMALEKRQLP ADRKMLHQVSGFID DKIHPDACPALVADLSVGEQGI+ALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI
        VA HSALVAQVEKDINASFEKELSIEREKV+AVEKMAEEAKQELERLRS+RER+ + L+ ERASIESEME+LSRLRSELEEQL+GLMSNKVE+S+EKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWEKRGIKVVVDSDLREQESA DTWLDSSKQF VEETT+RAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENLME

Query:  KLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG
        KLKRMA EVRG+SRDVIEKIIQKIALLVSNLRQWISKTGEQAE+LKN AISRA+RSA+ELQQSTAELSLA+KEGAKRVVGDCREGVEK TQKF+TSYG
Subjt:  KLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG

TrEMBL top hitse value%identityAlignment
A0A0A0L9T9 Uncharacterized protein0.099.5Show/hide
Query:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW
        MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW
Subjt:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW

Query:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG
        FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG
Subjt:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG

Query:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT
        DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFD+N+ASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT
Subjt:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN
        NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFS IS DTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN
Subjt:  NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN

Query:  APSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE
        APSIEQ LESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE
Subjt:  APSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA
        WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI
        VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENLME

Query:  KLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG
        KLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG
Subjt:  KLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG

A0A1S3AYZ8 uncharacterized protein LOC103484091 isoform X10.093.69Show/hide
Query:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW
        MASTSPTCSP+SLQLRLALNCNNCGKFPS+ VRARVRKLDPRLR++C PIVHNG KFDRGNG RGTGVCFAGSEST DGFSGWSESDSQGE LDLRRKKW
Subjt:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW

Query:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG
        FGG VGIGITGFILVSGITFAAWSINKQNSSRQK QMEALSTQQELLLDSETGTDRLGEDEKED SVDADDET AGKAGNQEDSSS TENEETLNKNRVG
Subjt:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG

Query:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT
        DGVDVEELAEN+VESSSSNNDVHN ASLQEDFQSDSSL VT+VAPGSLSS ISPESEFDSN+ASCLKDVNN HPGLEVSTSEPEMN+LKDEPDN PNSN 
Subjt:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN
        NSLNLKTDIRDERPDTGEN+DL SKKLPVYD+SSSNYISGNQDETL  V+EITDSSLQGFS +S DTAKES LFDG TVAKS EGV SPS+IEQFSSED 
Subjt:  NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN

Query:  APSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE
        APSIEQQLES LSEAALVSI+DYPLADDQE NHETIMNGTAAKRELQEI FSSAGVPAPLVSAAVKT PGKVL+PAVVDQVQGQALAALQVLKVIE DVE
Subjt:  APSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMY+ENVTELAFDDITPQDPDFASIQGLAEAG+ISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA
        WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPA+VADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI
        VAAHSALVAQVEKDINASFEKELSIEREKVEAVE+MAEEAKQELERLRSER R+ LALMMERAS+ESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWEKRGIKVVVDSDLREQES GDTWLDSSKQFTVEETT+RAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENLME

Query:  KLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG
        KLKRMAAEVRG+SRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKN AISRA+RSA ELQQSTAELSLAMKEGAKRVVGDCREGVEK TQKFRTSYG
Subjt:  KLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG

A0A1S3AZ05 uncharacterized protein LOC103484091 isoform X20.093.67Show/hide
Query:  KWFGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNR
        ++  G VGIGITGFILVSGITFAAWSINKQNSSRQK QMEALSTQQELLLDSETGTDRLGEDEKED SVDADDET AGKAGNQEDSSS TENEETLNKNR
Subjt:  KWFGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNR

Query:  VGDGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNS
        VGDGVDVEELAEN+VESSSSNNDVHN ASLQEDFQSDSSL VT+VAPGSLSS ISPESEFDSN+ASCLKDVNN HPGLEVSTSEPEMN+LKDEPDN PNS
Subjt:  VGDGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNS

Query:  NTNSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSE
        N NSLNLKTDIRDERPDTGEN+DL SKKLPVYD+SSSNYISGNQDETL  V+EITDSSLQGFS +S DTAKES LFDG TVAKS EGV SPS+IEQFSSE
Subjt:  NTNSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSE

Query:  DNAPSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVD
        D APSIEQQLES LSEAALVSI+DYPLADDQE NHETIMNGTAAKRELQEI FSSAGVPAPLVSAAVKT PGKVL+PAVVDQVQGQALAALQVLKVIE D
Subjt:  DNAPSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVD

Query:  VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDL
        VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMY+ENVTELAFDDITPQDPDFASIQGLAEAG+ISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDL
Subjt:  VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDL

Query:  VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAE
        VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPA+VADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAE
Subjt:  VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAE

Query:  NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKE
        NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVE+MAEEAKQELERLRSER R+ LALMMERAS+ESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKE
Subjt:  NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKE

Query:  RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENL
        RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWEKRGIKVVVDSDLREQES GDTWLDSSKQFTVEETT+RAENL
Subjt:  RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEETTERAENL

Query:  MEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG
        MEKLKRMAAEVRG+SRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKN AISRA+RSA ELQQSTAELSLAMKEGAKRVVGDCREGVEK TQKFRTSYG
Subjt:  MEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRTSYG

A0A5A7SWX6 Putative oxidoreductase/transition metal ion-binding protein0.093.63Show/hide
Query:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW
        MASTSPTCSP+SLQLRLALNCNNCGKFPS+ VRARVRKLDPRLR++C PIVHNG KFDRGNG RGTGVCFAGSEST DGFSGWSESDSQGE LDLRRKKW
Subjt:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW

Query:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG
        FGG VGIGITGFILVSGITFAAWSINKQNSSRQK QMEALSTQQELLLDSETGTDRLGEDEKED SVDADDETLAGKAGNQEDSSS TENEETLNKNRVG
Subjt:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG

Query:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT
        DGVDVEELAEN+VESSSSNNDVHN ASLQEDFQSDSSL VT+VAPGSLSS ISPESEFDSN+ASCLKDVNN HPGLEVSTSEPEMN+LKDEPDN PNSN 
Subjt:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN
        NSLNLKTDIRDERPDTGENYDL SKKLPVYDDSSSNYISGNQDETL  V+EITDSSLQGFS +S DTAKES LFDG TVAKS EGV SPS+IEQFSSED 
Subjt:  NSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDN

Query:  APSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE
        APSIEQQLES LSEAALVSI+DYPLADDQE NHETIMNGTAAKRELQEI FSSAGVPAPLVSAAVKT PGKVL+ AVVDQVQGQALAALQVLKVIE DVE
Subjt:  APSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMY+ENVTELAFDDITPQDPDFASIQGLAEAG+ISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA
        WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPA+VADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI
        VAAHSALVAQVEKDINASFEKELSIEREKVEAVE+MAEEAKQELERLRSER R+ LALMMERAS+ESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMAR
        NKLRKEAEIENQEISRLQYELEVERKALSMAR
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMAR

A0A6J1CGI7 uncharacterized protein LOC111011467 isoform X10.080.73Show/hide
Query:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW
        MAST  TCSP SLQLRLALNC NC KFPS+LVRARVRKLDPR+R+ C+PIV+NG   +R NG+R +GVCFA S+ST DGFSGWSESDS  E LDLRRK W
Subjt:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKW

Query:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG
        FGG VGIGITGFILVSGITFAAWSI+KQNSSRQKPQMEALSTQQELLLDS+TG DRLGE+EKED SV+ADD TLAGK GN E+SSSYTENE+ L+KN VG
Subjt:  FGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVG

Query:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHP-GLEVSTSEPEMNILKDEPDNLPNSN
        DGVDVE+L+ N VESSSSNNDV+NVAS QEDFQSDS   VTSVA GSLSSL+    E DS++AS  KD N+ H  G EV  SEPEMNILKD PDN  NSN
Subjt:  DGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHP-GLEVSTSEPEMNILKDEPDNLPNSN

Query:  TNSLNLKTDIRDERPDTGENYDLGSK--------KLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSR
        TNSLN KTDI+DE PDT ENYD  S+        KLP+YDDS+SN+ SGNQ E   P++EI+DSSL   S +S DTAKESG  D ETV +SS+ V +P++
Subjt:  TNSLNLKTDIRDERPDTGENYDLGSK--------KLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSR

Query:  IEQFSSEDNAPSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQV
         E+  SE    ++EQQ+E  LSEAA VS++ YPL D QEK+HETIMN TAAK ELQ I FSSAGVPAPL SAA+KT PGKVL+PAVVDQVQGQAL+ALQV
Subjt:  IEQFSSEDNAPSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQV

Query:  LKVIEVDVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPES
        LKVIE +VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDFASIQGLAEAG+ISSKLSRHDI SS DEDQGP YFSPES
Subjt:  LKVIEVDVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPES

Query:  LLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARI
         LSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGE GIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARI
Subjt:  LLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARI

Query:  EAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKV
        EAESMAENAVAAH ALVAQVEKDINASFEK+LSIEREKV+AVEKMAEEAKQELERLRSERERE LALM E A+IESEMEV SRLR+ELEEQLQGLMSNKV
Subjt:  EAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKV

Query:  EVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEET
        EVSYEKERINKLRKEAEIENQEI+RLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWE+RGIKVVVDSDLREQESAGDTWLDSSKQF+V+ET
Subjt:  EVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFTVEET

Query:  TERAENLMEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQ
         +RAENLM+KLK MAAE+RG+S+++++KII+KIALL+SNLRQW+S  G+QAE+LK  AISRA RS  ELQQSTAEL LA+KEGAKRVVGDCREGVEK TQ
Subjt:  TERAENLMEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQ

Query:  KFRTSYG
        KF+TSYG
Subjt:  KFRTSYG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25680.1 FUNCTIONS IN: molecular_function unknown2.2e-5533.56Show/hide
Query:  LVSISDYPLADDQEKNHETIM---NGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVEPSDLCTRREYARW
        L+   +  L  +  +  ET+    N + ++   + + + S  V +   ++  KTH  +V  P  VD  Q +A+A L+ LK+ E D+   +LCT+REYARW
Subjt:  LVSISDYPLADDQEKNHETIM---NGTAAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVEPSDLCTRREYARW

Query:  LVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVSWKMALEKRQLPEA
        LV ++S L RN    + PA+ +   +  AFDDI   DPDF  IQ LAEAG+ SSKLS  D  +    D G   F+PES +SR DLV+WK  LE    PE 
Subjt:  LVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVSWKMALEKRQLPEA

Query:  DRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEK
          ++      +IDT  I+PD       D  +G++  I   FG  + FQP++PVTKAQAA+AL +G+    ++ EL+R+EAES+++ A           + 
Subjt:  DRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEK

Query:  DINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQE
        +I   +++++  ER +   +E++      E+E  ++ +E+     + E+A+I+ + ++L+ L  E++E  Q L+S+K     E  ++ ++  + + + + 
Subjt:  DINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQE

Query:  ISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWE
        +   +  LE E +AL + R+W EDE K ++ +AK LEEA  RW+
Subjt:  ISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWE

AT5G23890.1 LOCATED IN: mitochondrion, chloroplast thylakoid membrane, chloroplast, plastid, chloroplast envelope9.5e-20046.88Show/hide
Query:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQG-EGLDLRRKK
        MAS + T +PTSLQLRLAL+     K P++ +R           ++C   V    + D   G          S+S+ D  +GW +SD+   +   +++K 
Subjt:  MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQG-EGLDLRRKK

Query:  WFGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDS--ETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKN
           G VG G+ G IL  G+++AA S +K+    +K +M +L++QQE ++ S  E  +D +     E++++  +D+++      Q+      E++    + 
Subjt:  WFGGFVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDS--ETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKN

Query:  RVGDGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESE--FDSNIASCLKDVNNYH-PGLEVSTSEPEMNILKDEPDN
           DGV  +E   +  ES   N             ++D  +N  +    + S  I  ES+   DS+    L D  + +  G+E + SE   ++L  EP  
Subjt:  RVGDGVDVEELAENHVESSSSNNDVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESE--FDSNIASCLKDVNNYH-PGLEVSTSEPEMNILKDEPDN

Query:  LPNSNTNSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQ
             TN  +L+  +  ++ D+          L       +   SG   E L  V    DS+ +   +  +DT         ET   ++E +S  +   +
Subjt:  LPNSNTNSLNLKTDIRDERPDTGENYDLGSKKLPVYDDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQ

Query:  FSSEDNAPSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAA--KRELQEI-----SFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALA
        +    +  SI    + + ++    S S  P + D  K+   I +       R L EI     +FSSAG+PAP +S  V  +PGK+L+P   DQ+Q QA A
Subjt:  FSSEDNAPSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAA--KRELQEI-----SFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALA

Query:  ALQVLKVIEVDVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYF
        ALQVLKVIE D +PSDLCTRREYARWL+SASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDF+SIQGLAEAG+I+SKLS  D+   LD+ +G   F
Subjt:  ALQVLKVIEVDVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYF

Query:  SPESLLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEE
        SPESLLSRQDL+SWKMALEKRQLPEAD+KML+++SGFID DKI+PDA P+++ADLS GEQGI ALAFG TRLFQP KPVTK QAAIAL++GEASDIVSEE
Subjt:  SPESLLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEE

Query:  LARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLM
        LARIEAESMAE AV+AH+ALVA+VEKD+NASFEKELS+EREK+EAVEKMAE AK ELE+LR +RE E LAL+ ERA++ESEMEVLSRLR + EE+L+ LM
Subjt:  LARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLM

Query:  SNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLRE---QESAGDTWLDSSK
        SNK E+++EKER+  LRKEAE E+Q IS+LQYELEVERKALSMAR+WAE+EAKKAREQ +ALEEAR RWE  G++VVVD DL+E   +E+     L+  +
Subjt:  SNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLRE---QESAGDTWLDSSK

Query:  QFTVEETTERAENLMEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCRE
        + +VEET  RA+ LM+KLK MA  V G+SR+VI  +++KI L ++ L+++    G++A E+++ AI RA  +A +++Q T ++S    +  K++  +CR+
Subjt:  QFTVEETTERAENLMEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCRE

Query:  GVEKFTQKFRT
        GV K +Q+F+T
Subjt:  GVEKFTQKFRT

AT5G52410.1 CONTAINS InterPro DOMAIN/s: S-layer homology domain (InterPro:IPR001119)1.0e-15662.94Show/hide
Query:  AALQVLKVIEVDVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLY
        AALQ LKVIE D  P DLCTRRE+ARW+VSAS+ LSRN+ SKVYPAMYIENVTELAFDDITP+DPDF  IQGLAEAG+ISSKLS +++ SS   +   + 
Subjt:  AALQVLKVIEVDVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLY

Query:  FSPESLLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSE
        FSPES L+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D DKI+P+A PAL+ADLS GE GI AL+FG TRLFQP K VTKAQ A++LA G+A ++V E
Subjt:  FSPESLLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSE

Query:  ELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGL
        ELARIEAE+MAEN V AH+ LVAQVEKDINASFEKEL  E+E V+AVEK+AEEAK EL RLR E+E E LAL  ER SIE+EME L+R+R+ELEEQLQ L
Subjt:  ELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGL

Query:  MSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGD-TWLDSSKQ
         SNK E+SYEKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA++AREQAK LEEAR RWEK G+KV+VDSDL EQ +  + TWL++ KQ
Subjt:  MSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGD-TWLDSSKQ

Query:  FTVEETTERAENLMEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREG
          VE T +RA NL+ KLK+MA +V  +SR+VI  II+KI+LL+S L+Q +     +A++LK    S+A+   ++      E+       AK  V + ++ 
Subjt:  FTVEETTERAENLMEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREG

Query:  VEKFTQKFRT
        V K  +KF++
Subjt:  VEKFTQKFRT

AT5G52410.2 INVOLVED IN: biological_process unknown1.8e-16154.6Show/hide
Query:  TLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDNAPSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKR
        T   +D+I+    +G  + S D  +    F    V+  S+ V SP  I+  S      S   + ++E +E A VS ++   +  +    +T    T+   
Subjt:  TLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDNAPSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGTAAKR

Query:  ELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFD
          ++ +    G+PAP  S   +    K + P VVD VQ Q  AALQ LKVIE D  P DLCTRRE+ARW+VSAS+ LSRN+ SKVYPAMYIENVTELAFD
Subjt:  ELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFD

Query:  DITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSV
        DITP+DPDF  IQGLAEAG+ISSKLS +++ SS   +   + FSPES L+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D DKI+P+A PAL+ADLS 
Subjt:  DITPQDPDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSV

Query:  GEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQEL
        GE GI AL+FG TRLFQP K VTKAQ A++LA G+A ++V EELARIEAE+MAEN V AH+ LVAQVEKDINASFEKEL  E+E V+AVEK+AEEAK EL
Subjt:  GEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQEL

Query:  ERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKARE
         RLR E+E E LAL  ER SIE+EME L+R+R+ELEEQLQ L SNK E+SYEKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA++ARE
Subjt:  ERLRSEREREGLALMMERASIESEMEVLSRLRSELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKARE

Query:  QAKALEEARDRWEKRGIKVVVDSDLREQESAGD-TWLDSSKQFTVEETTERAENLMEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAE
        QAK LEEAR RWEK G+KV+VDSDL EQ +  + TWL++ KQ  VE T +RA NL+ KLK+MA +V  +SR+VI  II+KI+LL+S L+Q +     +A+
Subjt:  QAKALEEARDRWEKRGIKVVVDSDLREQESAGD-TWLDSSKQFTVEETTERAENLMEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAE

Query:  ELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRT
        +LK    S+A+   ++      E+       AK  V + ++ V K  +KF++
Subjt:  ELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFTQKFRT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTTCTCCCACGTGTTCACCCACCTCTCTTCAGCTTCGTCTCGCTCTCAATTGCAACAACTGCGGAAAATTCCCTTCACTTCTTGTTCGGGCGAGAGTTAG
GAAATTGGATCCTCGACTCCGCGTCATTTGTCATCCTATTGTTCATAATGGCGTCAAATTTGATAGAGGAAATGGACGGCGTGGCACTGGAGTTTGTTTTGCTGGATCGG
AGTCTACGCCTGATGGGTTCTCTGGGTGGTCGGAGTCGGATTCCCAAGGGGAGGGTTTGGACTTGCGAAGAAAGAAGTGGTTTGGAGGATTCGTGGGGATTGGAATTACT
GGATTCATCCTCGTCTCCGGAATCACCTTTGCAGCATGGTCAATAAACAAGCAGAATAGTTCCAGACAAAAGCCACAAATGGAGGCCTTAAGTACGCAGCAAGAATTATT
GTTGGACTCCGAGACTGGAACTGATAGGCTTGGTGAAGATGAAAAGGAAGATACTAGTGTGGATGCAGATGATGAAACTCTCGCTGGTAAAGCAGGTAATCAAGAGGACT
CTTCTTCATATACAGAAAATGAAGAAACTCTCAACAAAAATAGAGTTGGTGACGGTGTTGATGTTGAGGAGTTGGCAGAAAATCATGTTGAATCTTCATCCAGCAATAAC
GATGTCCATAATGTTGCTTCCTTGCAAGAAGATTTCCAATCTGATTCCTCACTAAATGTTACATCAGTTGCTCCTGGAAGTTTGAGTTCACTTATCTCACCTGAATCTGA
GTTTGATTCTAACATAGCTTCTTGTTTAAAAGATGTAAACAACTACCATCCTGGTTTAGAAGTCTCAACTTCTGAACCAGAAATGAATATATTGAAAGATGAACCAGACA
ACTTGCCTAACTCTAATACTAATTCATTAAACCTTAAAACTGATATTCGGGATGAAAGACCTGACACTGGTGAAAATTATGATCTCGGCTCTAAGAAGTTACCAGTTTAT
GATGATAGTTCATCAAACTATATTTCTGGCAACCAGGATGAGACACTTGATCCAGTAGATGAAATTACAGATTCTTCATTGCAAGGATTTTCTATCATATCTCACGACAC
AGCCAAGGAATCAGGATTATTTGACGGAGAGACTGTGGCTAAATCATCTGAAGGAGTTTCGAGCCCCAGCAGAATTGAACAATTCTCATCTGAGGATAATGCACCAAGCA
TAGAACAGCAACTAGAAAGTGAATTATCTGAAGCAGCATTAGTCTCTATCTCAGATTATCCACTTGCAGATGATCAAGAGAAAAATCATGAAACTATCATGAATGGTACT
GCTGCCAAACGAGAACTACAAGAGATTTCATTTTCTTCTGCAGGTGTTCCTGCTCCTTTGGTTTCTGCTGCTGTTAAAACACATCCTGGCAAGGTCCTAATTCCTGCAGT
AGTGGATCAGGTTCAGGGGCAAGCCTTGGCAGCACTGCAAGTTTTAAAGGTGATTGAAGTTGATGTTGAGCCTAGTGATCTATGTACTCGTCGGGAATATGCTCGTTGGC
TGGTGTCTGCAAGCAGTGCTCTCTCTAGGAACACAACATCTAAAGTATACCCAGCAATGTATATAGAGAATGTTACCGAGCTTGCTTTTGATGATATTACACCCCAAGAC
CCTGATTTTGCATCTATTCAAGGTTTAGCTGAAGCTGGAATGATATCAAGCAAGCTTTCTAGACATGATATTTCTTCTTCATTGGACGAAGACCAGGGTCCCTTATATTT
CTCTCCTGAAAGCCTCCTGTCACGTCAAGATCTTGTGAGTTGGAAGATGGCCCTTGAAAAAAGACAGCTGCCAGAGGCAGATAGAAAGATGCTCCACCAAGTTTCTGGAT
TTATTGATACTGATAAGATCCATCCAGATGCTTGTCCTGCACTTGTTGCTGATCTTTCTGTAGGAGAACAGGGAATAATAGCTCTTGCATTTGGATATACAAGGCTTTTC
CAGCCAGATAAGCCTGTAACAAAAGCACAAGCTGCCATTGCTCTTGCAACTGGGGAGGCTTCCGATATAGTAAGTGAGGAGCTTGCAAGGATTGAAGCTGAATCAATGGC
AGAAAATGCTGTTGCTGCGCATAGTGCTTTGGTAGCTCAAGTTGAGAAAGATATTAATGCCAGCTTTGAGAAAGAACTTTCCATTGAAAGAGAGAAGGTTGAGGCTGTGG
AGAAAATGGCAGAAGAGGCAAAGCAAGAATTGGAAAGATTAAGATCAGAAAGAGAGAGAGAAGGTCTTGCGTTGATGATGGAACGTGCTTCTATTGAATCTGAAATGGAA
GTTCTTTCAAGGTTAAGGAGTGAGTTGGAGGAGCAGTTGCAAGGCCTGATGAGCAACAAAGTAGAGGTATCCTATGAAAAGGAAAGAATCAACAAACTCAGGAAAGAAGC
TGAAATTGAAAATCAAGAGATTTCCCGCCTGCAGTATGAGCTTGAAGTTGAGAGAAAGGCGCTGTCCATGGCCAGAGCATGGGCCGAGGATGAAGCAAAAAAAGCAAGAG
AACAAGCAAAAGCCCTTGAAGAGGCTAGGGATCGCTGGGAAAAGCGTGGCATCAAAGTAGTCGTCGACAGTGATCTCCGTGAACAAGAATCTGCTGGCGATACCTGGCTT
GATTCTAGCAAACAGTTCACAGTCGAAGAAACTACAGAACGAGCCGAGAACTTAATGGAGAAGCTGAAAAGAATGGCTGCAGAAGTAAGAGGGCAATCGAGAGATGTAAT
CGAGAAGATCATCCAGAAGATAGCTTTACTGGTATCAAACTTGAGACAATGGATTTCGAAAACTGGAGAACAGGCTGAAGAACTAAAGAATGGGGCGATTTCAAGAGCAG
ATAGATCAGCAAAAGAGCTGCAACAGAGCACTGCAGAGTTGAGCTTGGCTATGAAGGAGGGAGCAAAGAGAGTTGTGGGAGATTGTAGGGAAGGAGTAGAGAAATTTACC
CAAAAGTTTAGAACATCTTATGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCACTTCTCCCACGTGTTCACCCACCTCTCTTCAGCTTCGTCTCGCTCTCAATTGCAACAACTGCGGAAAATTCCCTTCACTTCTTGTTCGGGCGAGAGTTAG
GAAATTGGATCCTCGACTCCGCGTCATTTGTCATCCTATTGTTCATAATGGCGTCAAATTTGATAGAGGAAATGGACGGCGTGGCACTGGAGTTTGTTTTGCTGGATCGG
AGTCTACGCCTGATGGGTTCTCTGGGTGGTCGGAGTCGGATTCCCAAGGGGAGGGTTTGGACTTGCGAAGAAAGAAGTGGTTTGGAGGATTCGTGGGGATTGGAATTACT
GGATTCATCCTCGTCTCCGGAATCACCTTTGCAGCATGGTCAATAAACAAGCAGAATAGTTCCAGACAAAAGCCACAAATGGAGGCCTTAAGTACGCAGCAAGAATTATT
GTTGGACTCCGAGACTGGAACTGATAGGCTTGGTGAAGATGAAAAGGAAGATACTAGTGTGGATGCAGATGATGAAACTCTCGCTGGTAAAGCAGGTAATCAAGAGGACT
CTTCTTCATATACAGAAAATGAAGAAACTCTCAACAAAAATAGAGTTGGTGACGGTGTTGATGTTGAGGAGTTGGCAGAAAATCATGTTGAATCTTCATCCAGCAATAAC
GATGTCCATAATGTTGCTTCCTTGCAAGAAGATTTCCAATCTGATTCCTCACTAAATGTTACATCAGTTGCTCCTGGAAGTTTGAGTTCACTTATCTCACCTGAATCTGA
GTTTGATTCTAACATAGCTTCTTGTTTAAAAGATGTAAACAACTACCATCCTGGTTTAGAAGTCTCAACTTCTGAACCAGAAATGAATATATTGAAAGATGAACCAGACA
ACTTGCCTAACTCTAATACTAATTCATTAAACCTTAAAACTGATATTCGGGATGAAAGACCTGACACTGGTGAAAATTATGATCTCGGCTCTAAGAAGTTACCAGTTTAT
GATGATAGTTCATCAAACTATATTTCTGGCAACCAGGATGAGACACTTGATCCAGTAGATGAAATTACAGATTCTTCATTGCAAGGATTTTCTATCATATCTCACGACAC
AGCCAAGGAATCAGGATTATTTGACGGAGAGACTGTGGCTAAATCATCTGAAGGAGTTTCGAGCCCCAGCAGAATTGAACAATTCTCATCTGAGGATAATGCACCAAGCA
TAGAACAGCAACTAGAAAGTGAATTATCTGAAGCAGCATTAGTCTCTATCTCAGATTATCCACTTGCAGATGATCAAGAGAAAAATCATGAAACTATCATGAATGGTACT
GCTGCCAAACGAGAACTACAAGAGATTTCATTTTCTTCTGCAGGTGTTCCTGCTCCTTTGGTTTCTGCTGCTGTTAAAACACATCCTGGCAAGGTCCTAATTCCTGCAGT
AGTGGATCAGGTTCAGGGGCAAGCCTTGGCAGCACTGCAAGTTTTAAAGGTGATTGAAGTTGATGTTGAGCCTAGTGATCTATGTACTCGTCGGGAATATGCTCGTTGGC
TGGTGTCTGCAAGCAGTGCTCTCTCTAGGAACACAACATCTAAAGTATACCCAGCAATGTATATAGAGAATGTTACCGAGCTTGCTTTTGATGATATTACACCCCAAGAC
CCTGATTTTGCATCTATTCAAGGTTTAGCTGAAGCTGGAATGATATCAAGCAAGCTTTCTAGACATGATATTTCTTCTTCATTGGACGAAGACCAGGGTCCCTTATATTT
CTCTCCTGAAAGCCTCCTGTCACGTCAAGATCTTGTGAGTTGGAAGATGGCCCTTGAAAAAAGACAGCTGCCAGAGGCAGATAGAAAGATGCTCCACCAAGTTTCTGGAT
TTATTGATACTGATAAGATCCATCCAGATGCTTGTCCTGCACTTGTTGCTGATCTTTCTGTAGGAGAACAGGGAATAATAGCTCTTGCATTTGGATATACAAGGCTTTTC
CAGCCAGATAAGCCTGTAACAAAAGCACAAGCTGCCATTGCTCTTGCAACTGGGGAGGCTTCCGATATAGTAAGTGAGGAGCTTGCAAGGATTGAAGCTGAATCAATGGC
AGAAAATGCTGTTGCTGCGCATAGTGCTTTGGTAGCTCAAGTTGAGAAAGATATTAATGCCAGCTTTGAGAAAGAACTTTCCATTGAAAGAGAGAAGGTTGAGGCTGTGG
AGAAAATGGCAGAAGAGGCAAAGCAAGAATTGGAAAGATTAAGATCAGAAAGAGAGAGAGAAGGTCTTGCGTTGATGATGGAACGTGCTTCTATTGAATCTGAAATGGAA
GTTCTTTCAAGGTTAAGGAGTGAGTTGGAGGAGCAGTTGCAAGGCCTGATGAGCAACAAAGTAGAGGTATCCTATGAAAAGGAAAGAATCAACAAACTCAGGAAAGAAGC
TGAAATTGAAAATCAAGAGATTTCCCGCCTGCAGTATGAGCTTGAAGTTGAGAGAAAGGCGCTGTCCATGGCCAGAGCATGGGCCGAGGATGAAGCAAAAAAAGCAAGAG
AACAAGCAAAAGCCCTTGAAGAGGCTAGGGATCGCTGGGAAAAGCGTGGCATCAAAGTAGTCGTCGACAGTGATCTCCGTGAACAAGAATCTGCTGGCGATACCTGGCTT
GATTCTAGCAAACAGTTCACAGTCGAAGAAACTACAGAACGAGCCGAGAACTTAATGGAGAAGCTGAAAAGAATGGCTGCAGAAGTAAGAGGGCAATCGAGAGATGTAAT
CGAGAAGATCATCCAGAAGATAGCTTTACTGGTATCAAACTTGAGACAATGGATTTCGAAAACTGGAGAACAGGCTGAAGAACTAAAGAATGGGGCGATTTCAAGAGCAG
ATAGATCAGCAAAAGAGCTGCAACAGAGCACTGCAGAGTTGAGCTTGGCTATGAAGGAGGGAGCAAAGAGAGTTGTGGGAGATTGTAGGGAAGGAGTAGAGAAATTTACC
CAAAAGTTTAGAACATCTTATGGTTAA
Protein sequenceShow/hide protein sequence
MASTSPTCSPTSLQLRLALNCNNCGKFPSLLVRARVRKLDPRLRVICHPIVHNGVKFDRGNGRRGTGVCFAGSESTPDGFSGWSESDSQGEGLDLRRKKWFGGFVGIGIT
GFILVSGITFAAWSINKQNSSRQKPQMEALSTQQELLLDSETGTDRLGEDEKEDTSVDADDETLAGKAGNQEDSSSYTENEETLNKNRVGDGVDVEELAENHVESSSSNN
DVHNVASLQEDFQSDSSLNVTSVAPGSLSSLISPESEFDSNIASCLKDVNNYHPGLEVSTSEPEMNILKDEPDNLPNSNTNSLNLKTDIRDERPDTGENYDLGSKKLPVY
DDSSSNYISGNQDETLDPVDEITDSSLQGFSIISHDTAKESGLFDGETVAKSSEGVSSPSRIEQFSSEDNAPSIEQQLESELSEAALVSISDYPLADDQEKNHETIMNGT
AAKRELQEISFSSAGVPAPLVSAAVKTHPGKVLIPAVVDQVQGQALAALQVLKVIEVDVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPQD
PDFASIQGLAEAGMISSKLSRHDISSSLDEDQGPLYFSPESLLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLF
QPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSEREREGLALMMERASIESEME
VLSRLRSELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKKAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWL
DSSKQFTVEETTERAENLMEKLKRMAAEVRGQSRDVIEKIIQKIALLVSNLRQWISKTGEQAEELKNGAISRADRSAKELQQSTAELSLAMKEGAKRVVGDCREGVEKFT
QKFRTSYG