; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012307 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012307
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: 6 growth stages;
Genome locationChr01:19948741..19953963
RNA-Seq ExpressionHG10012307
SyntenyHG10012307
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439251.1 PREDICTED: uncharacterized protein LOC103484091 isoform X1 [Cucumis melo]0.0e+0090.88Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW
        MAST PTCSPSSLQLRLALNC NCGKFPSV VRARVRKLDPRLR++C PIVHNGAK +RGNGLR +G+CFAGS+STADGFSGWSESDSQGEVLDLRRKKW
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW

Query:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG
        FGGLVGIGITGFILVSGITFAAWSINKQNSSRQK QMEALS QQELLLDS+TG DRLGEDEKEDNSV+ADDET AG+ GNQEDSSSCTENEETLNKNRVG
Subjt:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG

Query:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT
        D VDV ELAEN VESSSSNNDV+N ASLQEDFQSDSSLTVT+VAPGSLSS ISPESEFDSNVASC KDVNNCH G EVSTSEPEMN+LKDEPDN PNSN 
Subjt:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT
        NSLNLKTDI+DE PDTGEN+D SSKKLPVYD+SSSNY SGNQDETLG VNEITDSSLQGFS +S DT KES LFD  TVA+S +GVLSPSK EQ  SED 
Subjt:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT

Query:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE
        A +IEQQ+E GLSEAALVSITDYP ADDQE NHETI N TAAK ELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQV GQALAALQVLKVIE+DVE
Subjt:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMY+ENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSR+DISSSLDEDQGPLYFSPES LSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA
        WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPA+VADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI
        VAAHSALVAQVEKDINASFEKELSIEREKVEAVE+MAEEAKQELERLRSER RDS+ LM ERAS+ESEMEV SRLR+ELEEQLQGLMSNKVEVS+EKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEK GIKVVVDSDLREQES GDTWLDSSKQF VEETTDRAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME

Query:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG
        KLKRM+ EVRGKSRDVIEKIIQKIALLVSNLRQWISK+GE+AE+LKNVAIS+A+RSATELQQSTAELSLA+KEGAKRVVGDCREGVEKITQKFRTSYG
Subjt:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG

XP_008439252.1 PREDICTED: uncharacterized protein LOC103484091 isoform X2 [Cucumis melo]0.0e+0090.78Show/hide
Query:  KWFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNR
        ++  GLVGIGITGFILVSGITFAAWSINKQNSSRQK QMEALS QQELLLDS+TG DRLGEDEKEDNSV+ADDET AG+ GNQEDSSSCTENEETLNKNR
Subjt:  KWFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNR

Query:  VGDSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNS
        VGD VDV ELAEN VESSSSNNDV+N ASLQEDFQSDSSLTVT+VAPGSLSS ISPESEFDSNVASC KDVNNCH G EVSTSEPEMN+LKDEPDN PNS
Subjt:  VGDSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNS

Query:  NTNSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSE
        N NSLNLKTDI+DE PDTGEN+D SSKKLPVYD+SSSNY SGNQDETLG VNEITDSSLQGFS +S DT KES LFD  TVA+S +GVLSPSK EQ  SE
Subjt:  NTNSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSE

Query:  DTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEAD
        D A +IEQQ+E GLSEAALVSITDYP ADDQE NHETI N TAAK ELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQV GQALAALQVLKVIE+D
Subjt:  DTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEAD

Query:  VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDL
        VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMY+ENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSR+DISSSLDEDQGPLYFSPES LSRQDL
Subjt:  VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDL

Query:  VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAE
        VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPA+VADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RIEAESMAE
Subjt:  VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAE

Query:  NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKE
        NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVE+MAEEAKQELERLRSER RDS+ LM ERAS+ESEMEV SRLR+ELEEQLQGLMSNKVEVS+EKE
Subjt:  NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKE

Query:  RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENL
        RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEK GIKVVVDSDLREQES GDTWLDSSKQF VEETTDRAENL
Subjt:  RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENL

Query:  MEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG
        MEKLKRM+ EVRGKSRDVIEKIIQKIALLVSNLRQWISK+GE+AE+LKNVAIS+A+RSATELQQSTAELSLA+KEGAKRVVGDCREGVEKITQKFRTSYG
Subjt:  MEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG

XP_011651163.1 uncharacterized protein LOC101215442 isoform X1 [Cucumis sativus]0.0e+0090.28Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW
        MAST PTCSP+SLQLRLALNC NCGKFPS+LVRARVRKLDPRLRVICHPIVHNG K +RGNG R +G+CFAGS+ST DGFSGWSESDSQGE LDLRRKKW
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW

Query:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG
        FGG VGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALS QQELLLDS+TG DRLGEDEKED SV+ADDETLAG+ GNQEDSSS TENEETLNKNRVG
Subjt:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG

Query:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT
        D VDV ELAEN VESSSSNNDV+NVASLQEDFQSDSSL VTSVAPGSLSSLISPESEFD+NVASC KDVNN H G EVSTSEPEMNILKDEPDNLPNSNT
Subjt:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT
        NSLNLKTDI+DE PDTGENYD  SKKLPVYDDSSSNY SGNQDETL PV+EITDSSLQGFS IS DT KESGLFD ETVA+SS+GV SPS+ EQ  SED 
Subjt:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT

Query:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE
        A +IEQ +E  LSEAALVSI+DYP ADDQEKNHETI N TAAK ELQEI FSSAGVPAPLVSAAVKT PGKVL+PAVVDQV GQALAALQVLKVIE DVE
Subjt:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDFASIQGLAEAG+ISSKLSR+DISSSLDEDQGPLYFSPES LSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA
        WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI
        VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERER+ + LM ERASIESEMEV SRLR+ELEEQLQGLMSNKVEVS+EKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWEK GIKVVVDSDLREQESAGDTWLDSSKQF VEETT+RAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME

Query:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG
        KLKRM+ EVRG+SRDVIEKIIQKIALLVSNLRQWISK+GE+AE+LKN AIS+A RSA ELQQSTAELSLA+KEGAKRVVGDCREGVEK TQKFRTSYG
Subjt:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG

XP_022140920.1 uncharacterized protein LOC111011467 isoform X1 [Momordica charantia]0.0e+0084.41Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW
        MASTP TCSP SLQLRLALNCKNC KFPSVLVRARVRKLDPR+R+ C+PIV+NGA IER NG RRSG+CFA S ST DGFSGWSESDS  EVLDLRRK W
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW

Query:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG
        FGGLVGIGITGFILVSGITFAAWSI+KQNSSRQKPQMEALS QQELLLDSDTGNDRLGE+EKEDNSVNADD TLAG+TGN E+SSS TENE+ L+KN VG
Subjt:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG

Query:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHS-GAEVSTSEPEMNILKDEPDNLPNSN
        D VDV +L+ NDVESSSSNNDVNNVAS QEDFQSDS   VTSVA GSLSSL+    E DS+VAS  KD N+CH+ G EV  SEPEMNILKD PDN  NSN
Subjt:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHS-GAEVSTSEPEMNILKDEPDNLPNSN

Query:  TNSLNLKTDIQDESPDTGENYDFSS--------KKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSK
        TNSLN KTDIQDE+PDT ENYDFSS        +KLP+YDDS+SN+NSGNQ E  GP+NEI+DSSL   S +SGDT KESG  DRETV ESS  VL+P+K
Subjt:  TNSLNLKTDIQDESPDTGENYDFSS--------KKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSK

Query:  TEQLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQV
        TE+LLSE T ST+EQQ+ERGLSEAA VS+T YP  D QEK+HETI NSTAAKPELQ ILFSSAGVPAPL SAA+KTLPGKVLVPAVVDQV GQAL+ALQV
Subjt:  TEQLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQV

Query:  LKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPES
        LKVIEA+VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSR+DI SS DEDQGP YFSPES
Subjt:  LKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPES

Query:  PLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRI
        PLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGE GIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RI
Subjt:  PLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRI

Query:  EAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKV
        EAESMAENAVAAH ALVAQVEKDINASFEK+LSIEREKV+AVEKMAEEAKQELERLRSERER+++ LM+E A+IESEMEVFSRLRNELEEQLQGLMSNKV
Subjt:  EAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKV

Query:  EVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEET
        EVS+EKERINKLRKEAEIENQEI+RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE+ GIKVVVDSDLREQESAGDTWLDSSKQF+V+ET
Subjt:  EVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEET

Query:  TDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQ
         DRAENLM+KLK M+ E+RGKS+++++KII+KIALL+SNLRQW+S +G++AEDLK VAIS+ASRS +ELQQSTAEL LALKEGAKRVVGDCREGVEKITQ
Subjt:  TDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQ

Query:  KFRTSYG
        KF+TSYG
Subjt:  KFRTSYG

XP_038892464.1 uncharacterized protein LOC120081550 isoform X1 [Benincasa hispida]0.0e+0089.28Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW
        MASTP TCSPSSLQLRLALNCKNCGKFPSVLVRAR+RKLDPRLRVICHPIV+NGA++ERGNGL  +G+CFAGS+STADGFSGWSESDSQ +VLDL RKKW
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW

Query:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG
         GG VGIGITGFIL+SGITFAAWSINKQNSSRQKPQMEALS QQELLL SDTGND+LGED KE+N +NADDET  G+TGNQEDSSSCTENEETLNKNRVG
Subjt:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG

Query:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT
        DSVDV ELAENDVESSSSNNDVN+V SLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSN+ASCFKDVNN HSGAEVSTSE EMNILKDEPDNLPNSNT
Subjt:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT
        NSLNLKTDI DE PDTGENYDFSSKKLP+YDDSSSNYNSGNQD+TLGPVNEITDSSLQ  S                               EQ LSE T
Subjt:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT

Query:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE
        ASTIEQ++ERGLSEAALVS+TDYPSADDQEKNHE++ N TAAKPELQEILFSSAGVPAP+VSAAVKTLPGKVLVPAVVDQV GQALAALQVLKVIEADV 
Subjt:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSR+DISSSLDEDQGPL FSPES LSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA
        WKMALEKRQLP ADRKMLHQVSGFID DKIHPDACPALVADLSVGEQGI+ALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI
        VA HSALVAQVEKDINASFEKELSIEREKV+AVEKMAEEAKQELERLRS+RERDSITL+RERASIESEME+ SRLR+ELEEQL+GLMSNKVE+SFEKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEK GIKVVVDSDLREQESA DTWLDSSKQFAVEETTDRAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME

Query:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG
        KLKRM+TEVRGKSRDVIEKIIQKIALLVSNLRQWISK+GE+AEDLKNVAIS+A+RSA ELQQSTAELSLALKEGAKRVVGDCREGVEKITQKF+TSYG
Subjt:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG

TrEMBL top hitse value%identityAlignment
A0A0A0L9T9 Uncharacterized protein0.0e+0090.28Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW
        MAST PTCSP+SLQLRLALNC NCGKFPS+LVRARVRKLDPRLRVICHPIVHNG K +RGNG R +G+CFAGS+ST DGFSGWSESDSQGE LDLRRKKW
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW

Query:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG
        FGG VGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALS QQELLLDS+TG DRLGEDEKED SV+ADDETLAG+ GNQEDSSS TENEETLNKNRVG
Subjt:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG

Query:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT
        D VDV ELAEN VESSSSNNDV+NVASLQEDFQSDSSL VTSVAPGSLSSLISPESEFD+NVASC KDVNN H G EVSTSEPEMNILKDEPDNLPNSNT
Subjt:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT
        NSLNLKTDI+DE PDTGENYD  SKKLPVYDDSSSNY SGNQDETL PV+EITDSSLQGFS IS DT KESGLFD ETVA+SS+GV SPS+ EQ  SED 
Subjt:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT

Query:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE
        A +IEQ +E  LSEAALVSI+DYP ADDQEKNHETI N TAAK ELQEI FSSAGVPAPLVSAAVKT PGKVL+PAVVDQV GQALAALQVLKVIE DVE
Subjt:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDFASIQGLAEAG+ISSKLSR+DISSSLDEDQGPLYFSPES LSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA
        WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI
        VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERER+ + LM ERASIESEMEV SRLR+ELEEQLQGLMSNKVEVS+EKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWEK GIKVVVDSDLREQESAGDTWLDSSKQF VEETT+RAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME

Query:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG
        KLKRM+ EVRG+SRDVIEKIIQKIALLVSNLRQWISK+GE+AE+LKN AIS+A RSA ELQQSTAELSLA+KEGAKRVVGDCREGVEK TQKFRTSYG
Subjt:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG

A0A1S3AYZ8 uncharacterized protein LOC103484091 isoform X10.0e+0090.88Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW
        MAST PTCSPSSLQLRLALNC NCGKFPSV VRARVRKLDPRLR++C PIVHNGAK +RGNGLR +G+CFAGS+STADGFSGWSESDSQGEVLDLRRKKW
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW

Query:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG
        FGGLVGIGITGFILVSGITFAAWSINKQNSSRQK QMEALS QQELLLDS+TG DRLGEDEKEDNSV+ADDET AG+ GNQEDSSSCTENEETLNKNRVG
Subjt:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG

Query:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT
        D VDV ELAEN VESSSSNNDV+N ASLQEDFQSDSSLTVT+VAPGSLSS ISPESEFDSNVASC KDVNNCH G EVSTSEPEMN+LKDEPDN PNSN 
Subjt:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT
        NSLNLKTDI+DE PDTGEN+D SSKKLPVYD+SSSNY SGNQDETLG VNEITDSSLQGFS +S DT KES LFD  TVA+S +GVLSPSK EQ  SED 
Subjt:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT

Query:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE
        A +IEQQ+E GLSEAALVSITDYP ADDQE NHETI N TAAK ELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQV GQALAALQVLKVIE+DVE
Subjt:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMY+ENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSR+DISSSLDEDQGPLYFSPES LSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA
        WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPA+VADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI
        VAAHSALVAQVEKDINASFEKELSIEREKVEAVE+MAEEAKQELERLRSER RDS+ LM ERAS+ESEMEV SRLR+ELEEQLQGLMSNKVEVS+EKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEK GIKVVVDSDLREQES GDTWLDSSKQF VEETTDRAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME

Query:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG
        KLKRM+ EVRGKSRDVIEKIIQKIALLVSNLRQWISK+GE+AE+LKNVAIS+A+RSATELQQSTAELSLA+KEGAKRVVGDCREGVEKITQKFRTSYG
Subjt:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG

A0A1S3AZ05 uncharacterized protein LOC103484091 isoform X20.0e+0090.78Show/hide
Query:  KWFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNR
        ++  GLVGIGITGFILVSGITFAAWSINKQNSSRQK QMEALS QQELLLDS+TG DRLGEDEKEDNSV+ADDET AG+ GNQEDSSSCTENEETLNKNR
Subjt:  KWFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNR

Query:  VGDSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNS
        VGD VDV ELAEN VESSSSNNDV+N ASLQEDFQSDSSLTVT+VAPGSLSS ISPESEFDSNVASC KDVNNCH G EVSTSEPEMN+LKDEPDN PNS
Subjt:  VGDSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNS

Query:  NTNSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSE
        N NSLNLKTDI+DE PDTGEN+D SSKKLPVYD+SSSNY SGNQDETLG VNEITDSSLQGFS +S DT KES LFD  TVA+S +GVLSPSK EQ  SE
Subjt:  NTNSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSE

Query:  DTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEAD
        D A +IEQQ+E GLSEAALVSITDYP ADDQE NHETI N TAAK ELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQV GQALAALQVLKVIE+D
Subjt:  DTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEAD

Query:  VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDL
        VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMY+ENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSR+DISSSLDEDQGPLYFSPES LSRQDL
Subjt:  VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDL

Query:  VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAE
        VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPA+VADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RIEAESMAE
Subjt:  VSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAE

Query:  NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKE
        NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVE+MAEEAKQELERLRSER RDS+ LM ERAS+ESEMEV SRLR+ELEEQLQGLMSNKVEVS+EKE
Subjt:  NAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKE

Query:  RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENL
        RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEK GIKVVVDSDLREQES GDTWLDSSKQF VEETTDRAENL
Subjt:  RINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENL

Query:  MEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG
        MEKLKRM+ EVRGKSRDVIEKIIQKIALLVSNLRQWISK+GE+AE+LKNVAIS+A+RSATELQQSTAELSLA+KEGAKRVVGDCREGVEKITQKFRTSYG
Subjt:  MEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG

A0A6J1CGI7 uncharacterized protein LOC111011467 isoform X10.0e+0084.41Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW
        MASTP TCSP SLQLRLALNCKNC KFPSVLVRARVRKLDPR+R+ C+PIV+NGA IER NG RRSG+CFA S ST DGFSGWSESDS  EVLDLRRK W
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW

Query:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG
        FGGLVGIGITGFILVSGITFAAWSI+KQNSSRQKPQMEALS QQELLLDSDTGNDRLGE+EKEDNSVNADD TLAG+TGN E+SSS TENE+ L+KN VG
Subjt:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG

Query:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHS-GAEVSTSEPEMNILKDEPDNLPNSN
        D VDV +L+ NDVESSSSNNDVNNVAS QEDFQSDS   VTSVA GSLSSL+    E DS+VAS  KD N+CH+ G EV  SEPEMNILKD PDN  NSN
Subjt:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHS-GAEVSTSEPEMNILKDEPDNLPNSN

Query:  TNSLNLKTDIQDESPDTGENYDFSS--------KKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSK
        TNSLN KTDIQDE+PDT ENYDFSS        +KLP+YDDS+SN+NSGNQ E  GP+NEI+DSSL   S +SGDT KESG  DRETV ESS  VL+P+K
Subjt:  TNSLNLKTDIQDESPDTGENYDFSS--------KKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSK

Query:  TEQLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQV
        TE+LLSE T ST+EQQ+ERGLSEAA VS+T YP  D QEK+HETI NSTAAKPELQ ILFSSAGVPAPL SAA+KTLPGKVLVPAVVDQV GQAL+ALQV
Subjt:  TEQLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQV

Query:  LKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPES
        LKVIEA+VEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSR+DI SS DEDQGP YFSPES
Subjt:  LKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPES

Query:  PLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRI
        PLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGE GIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RI
Subjt:  PLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRI

Query:  EAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKV
        EAESMAENAVAAH ALVAQVEKDINASFEK+LSIEREKV+AVEKMAEEAKQELERLRSERER+++ LM+E A+IESEMEVFSRLRNELEEQLQGLMSNKV
Subjt:  EAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKV

Query:  EVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEET
        EVS+EKERINKLRKEAEIENQEI+RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE+ GIKVVVDSDLREQESAGDTWLDSSKQF+V+ET
Subjt:  EVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEET

Query:  TDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQ
         DRAENLM+KLK M+ E+RGKS+++++KII+KIALL+SNLRQW+S +G++AEDLK VAIS+ASRS +ELQQSTAEL LALKEGAKRVVGDCREGVEKITQ
Subjt:  TDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQ

Query:  KFRTSYG
        KF+TSYG
Subjt:  KFRTSYG

A0A6J1HTP0 uncharacterized protein LOC111467385 isoform X10.0e+0081.08Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW
        MASTP TCSP+SLQLRLALNCKN  KFP V VRA VRKLDPRLRVIC PIVHN AKI R NGLRR G+CFAGS S ADGFSGWSESDS  E L+LRRK W
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW

Query:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG
          GLVGIGITGFILVSGITFAAWSINKQN S+QK QMEALS  QELLLDSD+GND+LGED+KEDNSVNADD+       N E+ SS TEN+ETLNKNRVG
Subjt:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG

Query:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT
           DV E + +DVE SS+N++VNNVA LQED QSDSSL VT VA GS        SE D ++ S  KDVN   SG EV TSEPEMN   DEPDN      
Subjt:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLG-PVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSED
                    SP+T E YDFSS+KLPVYDDSSSNYNSG QDET   PVNEI DSSL  FS   GD  KE GL ++E V ES +GVL+P KTE+LLSE+
Subjt:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLG-PVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSED

Query:  TASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADV
        TASTIEQ++ RGLS+AA VS+T YP ADDQE+NHET  NS AA+PELQ  LFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQV GQALAALQVLKVIE++V
Subjt:  TASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADV

Query:  EPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLV
        EPS LCTRREYARWLVSAS ALSRNT SKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSR+DI SSLD+D+GP YFSPESPLSRQDLV
Subjt:  EPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLV

Query:  SWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAEN
        SWKMALEKRQLPEADRK LHQVSGFIDTDKIHPDACPALVADLSVGE GIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RIEAESMAEN
Subjt:  SWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAEN

Query:  AVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKER
        AVAAHSALVAQVEKDINASFEKELSIEREK +AVEKMAEEAKQELERLR E+ERD+I LMRERA+IESEMEV SRLRNELEEQLQGLMSNKVEVS+EKER
Subjt:  AVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKER

Query:  INKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLM
        INKLRKEAEIENQEISRLQYELEVERKALSMAR+WAE+EAKRAREQAKALEEARDRWEK GIKVVVDSDLREQESAGDTWLDSSKQFAVEET DRAENLM
Subjt:  INKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLM

Query:  EKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG
        +KLK M+TEVRGKS+D+IE IIQKIALL+SNLRQW+  +GEKAED+KNVAI++ASRSATELQQS+AE+ LALKEGAKRVVGDCREGVEKI+QKFRTSYG
Subjt:  EKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRTSYG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25680.1 FUNCTIONS IN: molecular_function unknown8.4e-5535.66Show/hide
Query:  KVLVPAVVDQVHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLS
        +V  P  VD    +A+A L+ LK+ E D+   +LCT+REYARWLV ++S L RN    + PA+ +   +  AFDDI   DPDF  IQ LAEAG+ SSKLS
Subjt:  KVLVPAVVDQVHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLS

Query:  RNDISSSLDEDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQ
          D  +    D G   F+PES +SR DLV+WK  LE    PE   ++      +IDT  I+PD       D  +G++  I   FG  + FQP++PVTKAQ
Subjt:  RNDISSSLDEDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQ

Query:  AAIALATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEME
        AA+AL +G+    ++ EL+R+EAES+++ A           + +I   +++++  ER +   +E++      E+E  ++ +E+ S   ++E+A+I+ + +
Subjt:  AAIALATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEME

Query:  VFSRLRNELEEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE
        + + L  E++E  Q L+S+K     E  ++ ++  + + + + +   +  LE E +AL + R+W EDE K ++ +AK LEEA  RW+
Subjt:  VFSRLRNELEEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE

AT5G23890.1 LOCATED IN: mitochondrion, chloroplast thylakoid membrane, chloroplast, plastid, chloroplast envelope3.1e-19846.64Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLD-LRRKK
        MAS   T +P+SLQLRLAL+     K P+V +R           ++C   V    +++   G          SKS+AD  +GW +SD+  +    +++K 
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLD-LRRKK

Query:  WFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRV
           G+VG G+ G IL  G+++AA S +K+    +K +M +L+ QQE ++ S   +D +  DE               +  N E+S+   E++        
Subjt:  WFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRV

Query:  GDSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPN-S
          S++  ++A+   E S  +  +    S  +   +D +    S+   +  + +   +E D   A   K ++   S  + ST   E  +L  E  NL    
Subjt:  GDSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPN-S

Query:  NTNSLNLKTDIQDESPDTG--ENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKES--GLFDRETVAESSDGVLSPSKT-E
        NTNS + ++ +  E  +    EN+  S K     +DS S+ +  +     G V E+ + S Q        T+K     L D ET   +++ +   + T E
Subjt:  NTNSLNLKTDIQDESPDTG--ENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKES--GLFDRETVAESSDGVLSPSKT-E

Query:  QLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPE--LQEI-----LFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQAL
           + D +S  +    + L      S +  P + D  K+   I +         L EI      FSSAG+PAP +S  V   PGK+LVP   DQ+  QA 
Subjt:  QLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKPE--LQEI-----LFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQAL

Query:  AALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLY
        AALQVLKVIE D +PSDLCTRREYARWL+SASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDF+SIQGLAEAGLI+SKLS  D+   LD+ +G   
Subjt:  AALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLY

Query:  FSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSE
        FSPES LSRQDL+SWKMALEKRQLPEAD+KML+++SGFID DKI+PDA P+++ADLS GEQGI ALAFG TRLFQP KPVTK QAAIAL++GEASDIVSE
Subjt:  FSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSE

Query:  ELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGL
        EL RIEAESMAE AV+AH+ALVA+VEKD+NASFEKELS+EREK+EAVEKMAE AK ELE+LR +RE +++ L++ERA++ESEMEV SRLR + EE+L+ L
Subjt:  ELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGL

Query:  MSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLRE---QESAGDTWLDSS
        MSNK E++FEKER+  LRKEAE E+Q IS+LQYELEVERKALSMAR+WAE+EAK+AREQ +ALEEAR RWE +G++VVVD DL+E   +E+     L+  
Subjt:  MSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLRE---QESAGDTWLDSS

Query:  KQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCR
        ++ +VEET  RA+ LM+KLK M+  V GKSR+VI  +++KI L ++ L+++    G++A ++++ AI +A  +A +++Q T ++S    +  K++  +CR
Subjt:  KQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCR

Query:  EGVEKITQKFRT
        +GV KI+Q+F+T
Subjt:  EGVEKITQKFRT

AT5G52410.1 CONTAINS InterPro DOMAIN/s: S-layer homology domain (InterPro:IPR001119)6.7e-16163.73Show/hide
Query:  AALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLY
        AALQ LKVIE+D  P DLCTRRE+ARW+VSAS+ LSRN+ SKVYPAMYIENVTELAFDDITPEDPDF  IQGLAEAGLISSKLS N++ SS   +   + 
Subjt:  AALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLY

Query:  FSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSE
        FSPESPL+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D DKI+P+A PAL+ADLS GE GI AL+FG TRLFQP K VTKAQ A++LA G+A ++V E
Subjt:  FSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSE

Query:  ELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGL
        EL RIEAE+MAEN V AH+ LVAQVEKDINASFEKEL  E+E V+AVEK+AEEAK EL RLR E+E +++ L RER SIE+EME  +R+RNELEEQLQ L
Subjt:  ELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGL

Query:  MSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGD-TWLDSSKQ
         SNK E+S+EKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK LEEAR RWEK+G+KV+VDSDL EQ +  + TWL++ KQ
Subjt:  MSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGD-TWLDSSKQ

Query:  FAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG
          VE T  RA NL+ KLK+M+ +V  KSR+VI  II+KI+LL+S L+Q +     KA+DLK    SKA     +      E+       AK  V + ++ 
Subjt:  FAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG

Query:  VEKITQKFRT
        V K+ +KF++
Subjt:  VEKITQKFRT

AT5G52410.2 INVOLVED IN: biological_process unknown9.9e-16555.06Show/hide
Query:  TLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKP
        T   +++I+    +G    S D  +    F    V+  S  VLSP + +  ++  + ST +   E    +A++ S   Y S+ +     + +   T+  P
Subjt:  TLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSTAAKP

Query:  ELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFD
          ++      G+PAP     V +L  K + P VVD V  Q  AALQ LKVIE+D  P DLCTRRE+ARW+VSAS+ LSRN+ SKVYPAMYIENVTELAFD
Subjt:  ELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFD

Query:  DITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSV
        DITPEDPDF  IQGLAEAGLISSKLS N++ SS   +   + FSPESPL+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D DKI+P+A PAL+ADLS 
Subjt:  DITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSV

Query:  GEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQEL
        GE GI AL+FG TRLFQP K VTKAQ A++LA G+A ++V EEL RIEAE+MAEN V AH+ LVAQVEKDINASFEKEL  E+E V+AVEK+AEEAK EL
Subjt:  GEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQEL

Query:  ERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRARE
         RLR E+E +++ L RER SIE+EME  +R+RNELEEQLQ L SNK E+S+EKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA+RARE
Subjt:  ERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRARE

Query:  QAKALEEARDRWEKHGIKVVVDSDLREQESAGD-TWLDSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAE
        QAK LEEAR RWEK+G+KV+VDSDL EQ +  + TWL++ KQ  VE T  RA NL+ KLK+M+ +V  KSR+VI  II+KI+LL+S L+Q +     KA+
Subjt:  QAKALEEARDRWEKHGIKVVVDSDLREQESAGD-TWLDSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAE

Query:  DLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRT
        DLK    SKA     +      E+       AK  V + ++ V K+ +KF++
Subjt:  DLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKITQKFRT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTCCTCCCACGTGTTCACCCAGCTCTCTCCAGCTTCGCCTCGCTCTCAATTGCAAGAATTGCGGCAAATTCCCTTCAGTTCTTGTTCGGGCGAGAGTGAG
GAAATTGGATCCTCGACTCCGCGTAATTTGTCACCCTATTGTTCATAATGGCGCCAAAATCGAGAGAGGAAATGGACTGCGTCGCAGTGGACTTTGTTTTGCTGGATCGA
AGTCGACGGCTGATGGATTCTCTGGGTGGTCGGAATCGGATTCCCAGGGGGAGGTTTTGGACTTGCGGAGAAAGAAGTGGTTTGGAGGATTGGTGGGGATTGGAATTACT
GGATTCATCCTTGTCTCCGGAATCACCTTTGCAGCATGGTCAATAAACAAGCAGAATAGTTCCAGACAAAAGCCGCAAATGGAGGCCTTAAGTATGCAGCAAGAATTATT
GTTGGACTCTGATACTGGAAATGATAGGCTTGGTGAAGATGAAAAGGAAGATAACAGTGTGAATGCAGATGATGAGACTCTCGCTGGTAGAACAGGTAACCAAGAGGACT
CTTCTTCATGTACAGAAAATGAAGAAACTCTCAACAAAAATAGGGTTGGTGACAGTGTTGATGTTGGGGAGTTAGCAGAAAATGATGTTGAATCTTCATCCAGTAATAAC
GATGTCAATAATGTTGCTTCCTTGCAAGAAGATTTCCAATCTGATTCCTCATTAACTGTTACATCAGTTGCTCCCGGAAGTTTGAGTTCACTTATCTCACCTGAATCTGA
GTTTGATTCTAATGTAGCTTCTTGTTTTAAAGATGTAAACAACTGTCATTCTGGTGCAGAAGTCTCAACTTCTGAACCTGAAATGAATATATTGAAAGATGAACCAGACA
ACTTGCCTAACTCTAATACTAATTCATTAAACCTTAAAACTGATATTCAGGATGAAAGTCCTGACACTGGTGAAAATTATGATTTCAGCTCCAAGAAGTTACCAGTTTAT
GATGATAGTTCATCAAACTATAATTCTGGCAATCAGGATGAGACACTTGGTCCTGTAAATGAAATTACAGATTCTTCATTGCAAGGATTTTCTAGAATATCTGGCGACAC
AACCAAGGAATCAGGATTATTTGACAGAGAGACTGTGGCTGAATCATCTGACGGAGTTCTGAGCCCCAGCAAAACTGAACAGCTCTTATCTGAGGATACTGCATCAACCA
TAGAACAGCAAGTAGAAAGAGGATTATCTGAAGCAGCATTAGTCTCTATCACAGATTATCCATCGGCAGATGATCAAGAGAAAAATCATGAAACTATCAAGAATAGTACT
GCTGCCAAACCGGAACTACAAGAGATTTTATTTTCTTCTGCAGGTGTTCCTGCTCCTTTGGTTTCTGCAGCTGTTAAAACACTTCCTGGCAAGGTCCTAGTTCCTGCAGT
AGTGGATCAGGTTCATGGGCAGGCATTGGCAGCACTGCAAGTTTTAAAGGTGATAGAGGCTGACGTTGAACCTAGTGATCTATGTACTCGTCGGGAATATGCTCGTTGGC
TGGTGTCTGCAAGCAGTGCTCTTTCAAGGAACACAACATCCAAAGTATACCCAGCAATGTATATAGAGAATGTTACCGAGCTTGCTTTTGATGATATTACACCCGAAGAC
CCTGATTTTGCATCTATTCAAGGTTTGGCAGAAGCTGGACTGATTTCAAGCAAGCTTTCGAGAAATGATATTTCTTCTTCATTGGACGAAGACCAGGGTCCTTTATATTT
CTCTCCCGAAAGCCCCCTATCACGTCAAGATCTCGTGAGTTGGAAGATGGCCCTTGAAAAAAGACAGCTTCCAGAGGCAGATAGAAAGATGCTCCACCAAGTTTCTGGAT
TTATTGATACTGATAAGATCCATCCAGATGCTTGTCCTGCACTTGTTGCTGATCTTTCTGTAGGAGAACAAGGAATAATAGCTCTTGCATTTGGATATACAAGGCTTTTC
CAGCCAGATAAGCCTGTAACGAAAGCACAAGCTGCCATTGCTCTTGCAACTGGGGAGGCTTCAGATATAGTAAGTGAGGAGCTTACAAGGATTGAAGCTGAATCAATGGC
AGAAAATGCTGTTGCTGCGCATAGTGCTTTGGTAGCTCAAGTTGAGAAAGATATAAATGCTAGCTTTGAGAAAGAACTTTCCATCGAGAGAGAAAAGGTTGAGGCTGTGG
AGAAAATGGCAGAAGAGGCCAAGCAAGAATTGGAAAGATTAAGATCTGAAAGAGAGAGAGATAGTATCACCTTGATGAGGGAACGCGCTTCTATTGAATCGGAAATGGAA
GTTTTTTCAAGGTTAAGGAATGAGTTGGAGGAGCAGTTGCAAGGCCTGATGAGTAATAAAGTAGAGGTATCCTTTGAAAAGGAAAGAATCAACAAACTCAGGAAAGAAGC
TGAAATTGAAAATCAGGAGATTTCCCGCCTGCAGTATGAGCTTGAAGTTGAGAGAAAGGCGCTTTCCATGGCCAGAGCTTGGGCCGAGGACGAAGCAAAAAGAGCAAGAG
AACAAGCAAAAGCACTCGAAGAGGCTAGGGATCGCTGGGAAAAGCATGGCATCAAAGTAGTCGTTGACAGCGATCTTCGTGAACAAGAATCTGCTGGCGATACCTGGCTT
GATTCTAGCAAACAGTTTGCAGTCGAAGAAACCACGGACAGGGCCGAGAACTTAATGGAGAAGCTGAAAAGAATGTCTACAGAAGTAAGAGGGAAATCGAGAGATGTAAT
TGAGAAGATCATCCAGAAGATAGCTTTGCTAGTATCAAACTTGAGACAATGGATTTCCAAAAGTGGAGAAAAGGCTGAAGATCTAAAGAATGTGGCGATTTCAAAGGCAA
GTAGATCAGCAACAGAGTTGCAACAGAGCACTGCAGAGTTGAGCTTGGCCCTGAAGGAGGGAGCAAAGCGAGTTGTGGGAGATTGTAGGGAAGGAGTAGAGAAAATTACC
CAAAAGTTCAGAACATCTTATGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCACTCCTCCCACGTGTTCACCCAGCTCTCTCCAGCTTCGCCTCGCTCTCAATTGCAAGAATTGCGGCAAATTCCCTTCAGTTCTTGTTCGGGCGAGAGTGAG
GAAATTGGATCCTCGACTCCGCGTAATTTGTCACCCTATTGTTCATAATGGCGCCAAAATCGAGAGAGGAAATGGACTGCGTCGCAGTGGACTTTGTTTTGCTGGATCGA
AGTCGACGGCTGATGGATTCTCTGGGTGGTCGGAATCGGATTCCCAGGGGGAGGTTTTGGACTTGCGGAGAAAGAAGTGGTTTGGAGGATTGGTGGGGATTGGAATTACT
GGATTCATCCTTGTCTCCGGAATCACCTTTGCAGCATGGTCAATAAACAAGCAGAATAGTTCCAGACAAAAGCCGCAAATGGAGGCCTTAAGTATGCAGCAAGAATTATT
GTTGGACTCTGATACTGGAAATGATAGGCTTGGTGAAGATGAAAAGGAAGATAACAGTGTGAATGCAGATGATGAGACTCTCGCTGGTAGAACAGGTAACCAAGAGGACT
CTTCTTCATGTACAGAAAATGAAGAAACTCTCAACAAAAATAGGGTTGGTGACAGTGTTGATGTTGGGGAGTTAGCAGAAAATGATGTTGAATCTTCATCCAGTAATAAC
GATGTCAATAATGTTGCTTCCTTGCAAGAAGATTTCCAATCTGATTCCTCATTAACTGTTACATCAGTTGCTCCCGGAAGTTTGAGTTCACTTATCTCACCTGAATCTGA
GTTTGATTCTAATGTAGCTTCTTGTTTTAAAGATGTAAACAACTGTCATTCTGGTGCAGAAGTCTCAACTTCTGAACCTGAAATGAATATATTGAAAGATGAACCAGACA
ACTTGCCTAACTCTAATACTAATTCATTAAACCTTAAAACTGATATTCAGGATGAAAGTCCTGACACTGGTGAAAATTATGATTTCAGCTCCAAGAAGTTACCAGTTTAT
GATGATAGTTCATCAAACTATAATTCTGGCAATCAGGATGAGACACTTGGTCCTGTAAATGAAATTACAGATTCTTCATTGCAAGGATTTTCTAGAATATCTGGCGACAC
AACCAAGGAATCAGGATTATTTGACAGAGAGACTGTGGCTGAATCATCTGACGGAGTTCTGAGCCCCAGCAAAACTGAACAGCTCTTATCTGAGGATACTGCATCAACCA
TAGAACAGCAAGTAGAAAGAGGATTATCTGAAGCAGCATTAGTCTCTATCACAGATTATCCATCGGCAGATGATCAAGAGAAAAATCATGAAACTATCAAGAATAGTACT
GCTGCCAAACCGGAACTACAAGAGATTTTATTTTCTTCTGCAGGTGTTCCTGCTCCTTTGGTTTCTGCAGCTGTTAAAACACTTCCTGGCAAGGTCCTAGTTCCTGCAGT
AGTGGATCAGGTTCATGGGCAGGCATTGGCAGCACTGCAAGTTTTAAAGGTGATAGAGGCTGACGTTGAACCTAGTGATCTATGTACTCGTCGGGAATATGCTCGTTGGC
TGGTGTCTGCAAGCAGTGCTCTTTCAAGGAACACAACATCCAAAGTATACCCAGCAATGTATATAGAGAATGTTACCGAGCTTGCTTTTGATGATATTACACCCGAAGAC
CCTGATTTTGCATCTATTCAAGGTTTGGCAGAAGCTGGACTGATTTCAAGCAAGCTTTCGAGAAATGATATTTCTTCTTCATTGGACGAAGACCAGGGTCCTTTATATTT
CTCTCCCGAAAGCCCCCTATCACGTCAAGATCTCGTGAGTTGGAAGATGGCCCTTGAAAAAAGACAGCTTCCAGAGGCAGATAGAAAGATGCTCCACCAAGTTTCTGGAT
TTATTGATACTGATAAGATCCATCCAGATGCTTGTCCTGCACTTGTTGCTGATCTTTCTGTAGGAGAACAAGGAATAATAGCTCTTGCATTTGGATATACAAGGCTTTTC
CAGCCAGATAAGCCTGTAACGAAAGCACAAGCTGCCATTGCTCTTGCAACTGGGGAGGCTTCAGATATAGTAAGTGAGGAGCTTACAAGGATTGAAGCTGAATCAATGGC
AGAAAATGCTGTTGCTGCGCATAGTGCTTTGGTAGCTCAAGTTGAGAAAGATATAAATGCTAGCTTTGAGAAAGAACTTTCCATCGAGAGAGAAAAGGTTGAGGCTGTGG
AGAAAATGGCAGAAGAGGCCAAGCAAGAATTGGAAAGATTAAGATCTGAAAGAGAGAGAGATAGTATCACCTTGATGAGGGAACGCGCTTCTATTGAATCGGAAATGGAA
GTTTTTTCAAGGTTAAGGAATGAGTTGGAGGAGCAGTTGCAAGGCCTGATGAGTAATAAAGTAGAGGTATCCTTTGAAAAGGAAAGAATCAACAAACTCAGGAAAGAAGC
TGAAATTGAAAATCAGGAGATTTCCCGCCTGCAGTATGAGCTTGAAGTTGAGAGAAAGGCGCTTTCCATGGCCAGAGCTTGGGCCGAGGACGAAGCAAAAAGAGCAAGAG
AACAAGCAAAAGCACTCGAAGAGGCTAGGGATCGCTGGGAAAAGCATGGCATCAAAGTAGTCGTTGACAGCGATCTTCGTGAACAAGAATCTGCTGGCGATACCTGGCTT
GATTCTAGCAAACAGTTTGCAGTCGAAGAAACCACGGACAGGGCCGAGAACTTAATGGAGAAGCTGAAAAGAATGTCTACAGAAGTAAGAGGGAAATCGAGAGATGTAAT
TGAGAAGATCATCCAGAAGATAGCTTTGCTAGTATCAAACTTGAGACAATGGATTTCCAAAAGTGGAGAAAAGGCTGAAGATCTAAAGAATGTGGCGATTTCAAAGGCAA
GTAGATCAGCAACAGAGTTGCAACAGAGCACTGCAGAGTTGAGCTTGGCCCTGAAGGAGGGAGCAAAGCGAGTTGTGGGAGATTGTAGGGAAGGAGTAGAGAAAATTACC
CAAAAGTTCAGAACATCTTATGGTTAA
Protein sequenceShow/hide protein sequence
MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKWFGGLVGIGIT
GFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVGDSVDVGELAENDVESSSSNN
DVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNTNSLNLKTDIQDESPDTGENYDFSSKKLPVY
DDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNST
AAKPELQEILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPED
PDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLF
QPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEME
VFSRLRNELEEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWL
DSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREGVEKIT
QKFRTSYG