; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020026 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020026
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: 6 growth stages;
Genome locationchr5:47473061..47477756
RNA-Seq ExpressionLag0020026
SyntenyLag0020026
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439251.1 PREDICTED: uncharacterized protein LOC103484091 isoform X1 [Cucumis melo]0.0e+0083.82Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK
        MASTS TCSPSSLQLRLALNC NCGKFPSV VRARVRKLDPRLR++C PIVH     +R +GLR  GVCFAGS+   DGFSGWSESDS  + E LDLRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK
        KWFGGLVGIG+TGFILVSGITFAAWSINKQNSSRQK QMEALSTQQE LLDS+TG D+LGEDEKED +V ADD   A K GN E+SSS TENEETLNKN+
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK

Query:  VG---DVEELSGNDVESSPSN----NVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS
        VG   DVEEL+ N VESS SN    N ASLQED QSDSSL VT+VA GSLSS ISPESE DSNVA+  KDVNNCH G EV TSEPEMN+LKDEPDNSPNS
Subjt:  VG---DVEELSGNDVESSPSN----NVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS
        N NSLNLKT I DE PDT EN+D SS+KL VYD+ SSNY S NQDET G  VNE TDSSL  FSS+S DT+KES L D  TVA+S +GV++PSK EQF S
Subjt:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS

Query:  EETASTLEQQ---GLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        E+ A ++EQQ   GLSEAA VS+T +PLADDQE NHETIMN +AAK ELQ ILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIE 
Subjt:  EETASTLEQQ---GLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        DVEP DLC+RREYARWLVSASSALSRNTTSKVYPAMY+ENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSRHDI SSLDEDQGP  FSPES LSRQD
Subjt:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA
        LVSWKMALEKRQLPEADRK LHQVSGFIDTD+IHPDACPA+VADLSVGEQGIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEEL RIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREKV+AVE+MAEEAKQELERLRSER  D +ALM ERA++ESEMEVLSRLR+ELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQES GDTWLDSSKQF VEET DRAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LM+KLK MA E+RGKS+D+I+KIIQKIALLVSNLRQW+S TGEQAE+LKNVAI+RA+RSA+ELQQSTAEL LA+K+GAKRVVGDCREGVEKI+QKFRTSY
Subjt:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

XP_011651163.1 uncharacterized protein LOC101215442 isoform X1 [Cucumis sativus]0.0e+0084.12Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK
        MASTS TCSP+SLQLRLALNC NCGKFPS+LVRARVRKLDPRLRVICHPIVH     +R +G R  GVCFAGS+   DGFSGWSESDS  + E LDLRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK
        KWFGG VGIG+TGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQE LLDS+TG D+LGEDEKED +V ADD  LA K GN E+SSSYTENEETLNKN+
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK

Query:  VG---DVEELSGNDVESSPSN----NVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS
        VG   DVEEL+ N VESS SN    NVASLQED QSDSSL VTSVA GSLSSLISPESE D+NVA+  KDVNN H G EV TSEPEMNILKDEPDN PNS
Subjt:  VG---DVEELSGNDVESSPSN----NVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS
        NTNSLNLKT I DE PDT ENYD  S+KL VYDD SSNY S NQDET   PV+E TDSSL  FSSIS DT+KESGL D ETVA+SS+GV +PS+ EQF S
Subjt:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS

Query:  EETASTLEQ---QGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        E+ A ++EQ     LSEAA VS++ +PLADDQEKNHETIMN +AAK ELQ I FSSAGVPAPLVSAAVKT PGKVL+PAVVDQVQGQALAALQVLKVIEV
Subjt:  EETASTLEQ---QGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        DVEP DLC+RREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDFASIQGLAEAG+ISSKLSRHDI SSLDEDQGP  FSPES LSRQD
Subjt:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA
        LVSWKMALEKRQLPEADRK LHQVSGFIDTD+IHPDACPALVADLSVGEQGIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEEL RIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREKV+AVEKMAEEAKQELERLRSERE + +ALM ERA+IESEMEVLSRLR+ELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQF VEET +RAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LM+KLK MA E+RG+S+D+I+KIIQKIALLVSNLRQW+S TGEQAE+LKN AI+RA RSA ELQQSTAEL LA+K+GAKRVVGDCREGVEK +QKFRTSY
Subjt:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

XP_022140920.1 uncharacterized protein LOC111011467 isoform X1 [Momordica charantia]0.0e+0084.36Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK
        MAST ATCSP SLQLRLALNCKNC KFPSVLVRARVRKLDPR+R+ C+PIV+     ERA+G RR+GVCFA SD   DGFSGWSESDSG  EE LDLRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK
         WFGGLVGIG+TGFILVSGITFAAWSI+KQNSSRQKPQMEALSTQQE LLDSDTGND+LGE+EKED +V ADD  LA KTGNHEESSSYTENE+ L+KN 
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK

Query:  VG---DVEELSGNDVESSPS----NNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHA-GAEVLTSEPEMNILKDEPDNSPN
        VG   DVE+LSGNDVESS S    NNVAS QED QSDS  AVTSVA+GSLSSL+    E DS+VA+ SKD N+CHA G EVL SEPEMNILKD PDNS N
Subjt:  VG---DVEELSGNDVESSPS----NNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHA-GAEVLTSEPEMNILKDEPDNSPN

Query:  SNTNSLNLKTGIPDETPDTSENYDFSS--------EKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMN
        SNTNSLN KT I DETPDTSENYDFSS        EKL +YDD +SN+NS NQ E PG P+NE +DSSLHE SS+SGDT+KESG VD+ETV ESSK V+N
Subjt:  SNTNSLNLKTGIPDETPDTSENYDFSS--------EKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMN

Query:  PSKTEQFLSEETASTLEQQ---GLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAA
        P+KTE+ LSE T STLEQQ   GLSEAAFVSVTA+PL D QEK+HETIMNS+AAKPELQGILFSSAGVPAPL SAA+KTLPGKVLVPAVVDQVQGQAL+A
Subjt:  PSKTEQFLSEETASTLEQQ---GLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAA

Query:  LQVLKVIEVDVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFS
        LQVLKVIE +VEP DLC+RREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDI SS DEDQGPF FS
Subjt:  LQVLKVIEVDVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFS

Query:  PESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEEL
        PESPLSRQDLVSWKMALEKRQLPEADRK LHQVSGFIDTD+IHPDACPALVADLSVGE GIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEEL
Subjt:  PESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEEL

Query:  TRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMS
         RIEAESMAENAVAAH ALVAQVEKDINASFEK+LSIEREKVDAVEKMAEEAKQELERLRSERE + +ALM+E AAIESEMEV SRLRNELEEQLQGLMS
Subjt:  TRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMS

Query:  NKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAV
        NKVEVSYEKERINKLRKEAEIENQEI+RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE+RGIKVVVDSDLREQESAGDTWLDSSKQF+V
Subjt:  NKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAV

Query:  EETVDRAENLMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEK
        +ETVDRAENLMDKLK MA ELRGKSK+I+DKII+KIALL+SNLRQW+S+ G+QAEDLK VAI+RASRS SELQQSTAEL LALK+GAKRVVGDCREGVEK
Subjt:  EETVDRAENLMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEK

Query:  ISQKFRTSYG
        I+QKF+TSYG
Subjt:  ISQKFRTSYG

XP_022968011.1 uncharacterized protein LOC111467385 isoform X1 [Cucurbita maxima]0.0e+0082.12Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHE-----RASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK
        MAST +TCSP+SLQLRLALNCKN  KFP V VRA VRKLDPRLRVIC PIVH      R +GLRR+G+CFAGSD K DGFSGWSESDSGEE+  L+LRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHE-----RASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK
         W  GLVGIG+TGFILVSGITFAAWSINKQN S+QK QMEALST QE LLDSD+GNDKLGED+KED +V ADD        NHEE SSYTEN+ETLNKN+
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK

Query:  VG---DVEELSGNDVESSPS----NNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS
        VG   DVEE SG+DVE S +    NNVA LQED QSDSSLAVT VA GS        SEID ++ + SKDVN   +G EVLTSEPEMN   DEPDNS   
Subjt:  VG---DVEELSGNDVESSPS----NNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS
                       P+TSE YDFSSEKL VYDD SSNYNS  QDET  PPVNE  DSSLHEFS+  GD +KE GLV+KE V ES +GV+NP KTE+ LS
Subjt:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS

Query:  EETASTLEQ---QGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        EETAST+EQ   +GLS+AAFVSVTA+PLADDQE+NHET MNS AA+PELQG LFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIE 
Subjt:  EETASTLEQ---QGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        +VEP  LC+RREYARWLVSAS ALSRNT SKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDI SSLD+D+GPF FSPESPLSRQD
Subjt:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA
        LVSWKMALEKRQLPEADRKTLHQVSGFIDTD+IHPDACPALVADLSVGE GIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEEL RIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREK DAVEKMAEEAKQELERLR E+E D IALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMAR+WAE+EAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LMDKLK MATE+RGKSKDII+ IIQKIALL+SNLRQW+ N GE+AED+KNVAIARASRSA+ELQQS+AE+GLALK+GAKRVVGDCREGVEKISQKFRTSY
Subjt:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

XP_038892464.1 uncharacterized protein LOC120081550 isoform X1 [Benincasa hispida]0.0e+0082.32Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK
        MAST +TCSPSSLQLRLALNCKNCGKFPSVLVRAR+RKLDPRLRVICHPIV+     ER +GL   GVCFAGS+   DGFSGWSESDS  +E+ LDL RK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK
        KW GG VGIG+TGFIL+SGITFAAWSINKQNSSRQKPQMEALSTQQE LL SDTGNDKLGED KE+  + ADD     KTGN E+SSS TENEETLNKN+
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK

Query:  VG---DVEELSGNDVESSPSNN----VASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS
        VG   DV+EL+ NDVESS SNN    V SLQED QSDSSL VTSVA GSLSSLISPESE DSN+A+  KDVNN H+GAEV TSE EMNILKDEPDN PNS
Subjt:  VG---DVEELSGNDVESSPSNN----VASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS
        NTNSLNLKT I DE PDT ENYDFSS+KL +YDD SSNYNS NQD+T G PVNE TDSSL E S                               EQFLS
Subjt:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS

Query:  EETASTLEQ---QGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        E TAST+EQ   +GLSEAA VSVT +P ADDQEKNHE++MN +AAKPELQ ILFSSAGVPAP+VSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIE 
Subjt:  EETASTLEQ---QGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        DV P DLC+RREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSRHDI SSLDEDQGP  FSPES LSRQD
Subjt:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA
        LVSWKMALEKRQLP ADRK LHQVSGFID D+IHPDACPALVADLSVGEQGI+ALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEEL RIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVA HSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRS+RE D I L+RERA+IESEME+LSRLR+ELEEQL+GLMSNKVE+S+EK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESA DTWLDSSKQFAVEET DRAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LM+KLK MATE+RGKS+D+I+KIIQKIALLVSNLRQW+S TGEQAEDLKNVAI+RA+RSA ELQQSTAEL LALK+GAKRVVGDCREGVEKI+QKF+TSY
Subjt:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

TrEMBL top hitse value%identityAlignment
A0A0A0L9T9 Uncharacterized protein0.0e+0084.12Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK
        MASTS TCSP+SLQLRLALNC NCGKFPS+LVRARVRKLDPRLRVICHPIVH     +R +G R  GVCFAGS+   DGFSGWSESDS  + E LDLRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK
        KWFGG VGIG+TGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQE LLDS+TG D+LGEDEKED +V ADD  LA K GN E+SSSYTENEETLNKN+
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK

Query:  VG---DVEELSGNDVESSPSN----NVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS
        VG   DVEEL+ N VESS SN    NVASLQED QSDSSL VTSVA GSLSSLISPESE D+NVA+  KDVNN H G EV TSEPEMNILKDEPDN PNS
Subjt:  VG---DVEELSGNDVESSPSN----NVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS
        NTNSLNLKT I DE PDT ENYD  S+KL VYDD SSNY S NQDET   PV+E TDSSL  FSSIS DT+KESGL D ETVA+SS+GV +PS+ EQF S
Subjt:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS

Query:  EETASTLEQ---QGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        E+ A ++EQ     LSEAA VS++ +PLADDQEKNHETIMN +AAK ELQ I FSSAGVPAPLVSAAVKT PGKVL+PAVVDQVQGQALAALQVLKVIEV
Subjt:  EETASTLEQ---QGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        DVEP DLC+RREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDFASIQGLAEAG+ISSKLSRHDI SSLDEDQGP  FSPES LSRQD
Subjt:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA
        LVSWKMALEKRQLPEADRK LHQVSGFIDTD+IHPDACPALVADLSVGEQGIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEEL RIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREKV+AVEKMAEEAKQELERLRSERE + +ALM ERA+IESEMEVLSRLR+ELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQF VEET +RAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LM+KLK MA E+RG+S+D+I+KIIQKIALLVSNLRQW+S TGEQAE+LKN AI+RA RSA ELQQSTAEL LA+K+GAKRVVGDCREGVEK +QKFRTSY
Subjt:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

A0A1S3AYZ8 uncharacterized protein LOC103484091 isoform X10.0e+0083.82Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK
        MASTS TCSPSSLQLRLALNC NCGKFPSV VRARVRKLDPRLR++C PIVH     +R +GLR  GVCFAGS+   DGFSGWSESDS  + E LDLRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK
        KWFGGLVGIG+TGFILVSGITFAAWSINKQNSSRQK QMEALSTQQE LLDS+TG D+LGEDEKED +V ADD   A K GN E+SSS TENEETLNKN+
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK

Query:  VG---DVEELSGNDVESSPSN----NVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS
        VG   DVEEL+ N VESS SN    N ASLQED QSDSSL VT+VA GSLSS ISPESE DSNVA+  KDVNNCH G EV TSEPEMN+LKDEPDNSPNS
Subjt:  VG---DVEELSGNDVESSPSN----NVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS
        N NSLNLKT I DE PDT EN+D SS+KL VYD+ SSNY S NQDET G  VNE TDSSL  FSS+S DT+KES L D  TVA+S +GV++PSK EQF S
Subjt:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS

Query:  EETASTLEQQ---GLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        E+ A ++EQQ   GLSEAA VS+T +PLADDQE NHETIMN +AAK ELQ ILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIE 
Subjt:  EETASTLEQQ---GLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        DVEP DLC+RREYARWLVSASSALSRNTTSKVYPAMY+ENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSRHDI SSLDEDQGP  FSPES LSRQD
Subjt:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA
        LVSWKMALEKRQLPEADRK LHQVSGFIDTD+IHPDACPA+VADLSVGEQGIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEEL RIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREKV+AVE+MAEEAKQELERLRSER  D +ALM ERA++ESEMEVLSRLR+ELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQES GDTWLDSSKQF VEET DRAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LM+KLK MA E+RGKS+D+I+KIIQKIALLVSNLRQW+S TGEQAE+LKNVAI+RA+RSA+ELQQSTAEL LA+K+GAKRVVGDCREGVEKI+QKFRTSY
Subjt:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

A0A6J1CGI7 uncharacterized protein LOC111011467 isoform X10.0e+0084.36Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK
        MAST ATCSP SLQLRLALNCKNC KFPSVLVRARVRKLDPR+R+ C+PIV+     ERA+G RR+GVCFA SD   DGFSGWSESDSG  EE LDLRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK
         WFGGLVGIG+TGFILVSGITFAAWSI+KQNSSRQKPQMEALSTQQE LLDSDTGND+LGE+EKED +V ADD  LA KTGNHEESSSYTENE+ L+KN 
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK

Query:  VG---DVEELSGNDVESSPS----NNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHA-GAEVLTSEPEMNILKDEPDNSPN
        VG   DVE+LSGNDVESS S    NNVAS QED QSDS  AVTSVA+GSLSSL+    E DS+VA+ SKD N+CHA G EVL SEPEMNILKD PDNS N
Subjt:  VG---DVEELSGNDVESSPS----NNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHA-GAEVLTSEPEMNILKDEPDNSPN

Query:  SNTNSLNLKTGIPDETPDTSENYDFSS--------EKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMN
        SNTNSLN KT I DETPDTSENYDFSS        EKL +YDD +SN+NS NQ E PG P+NE +DSSLHE SS+SGDT+KESG VD+ETV ESSK V+N
Subjt:  SNTNSLNLKTGIPDETPDTSENYDFSS--------EKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMN

Query:  PSKTEQFLSEETASTLEQQ---GLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAA
        P+KTE+ LSE T STLEQQ   GLSEAAFVSVTA+PL D QEK+HETIMNS+AAKPELQGILFSSAGVPAPL SAA+KTLPGKVLVPAVVDQVQGQAL+A
Subjt:  PSKTEQFLSEETASTLEQQ---GLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAA

Query:  LQVLKVIEVDVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFS
        LQVLKVIE +VEP DLC+RREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDI SS DEDQGPF FS
Subjt:  LQVLKVIEVDVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFS

Query:  PESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEEL
        PESPLSRQDLVSWKMALEKRQLPEADRK LHQVSGFIDTD+IHPDACPALVADLSVGE GIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEEL
Subjt:  PESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEEL

Query:  TRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMS
         RIEAESMAENAVAAH ALVAQVEKDINASFEK+LSIEREKVDAVEKMAEEAKQELERLRSERE + +ALM+E AAIESEMEV SRLRNELEEQLQGLMS
Subjt:  TRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMS

Query:  NKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAV
        NKVEVSYEKERINKLRKEAEIENQEI+RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE+RGIKVVVDSDLREQESAGDTWLDSSKQF+V
Subjt:  NKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAV

Query:  EETVDRAENLMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEK
        +ETVDRAENLMDKLK MA ELRGKSK+I+DKII+KIALL+SNLRQW+S+ G+QAEDLK VAI+RASRS SELQQSTAEL LALK+GAKRVVGDCREGVEK
Subjt:  EETVDRAENLMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEK

Query:  ISQKFRTSYG
        I+QKF+TSYG
Subjt:  ISQKFRTSYG

A0A6J1HSB9 uncharacterized protein LOC111467385 isoform X20.0e+0081.52Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHE-----RASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK
        MAST +TCSP+SLQLRLALNCKN  KFP V VRA VRKLDPRLRVIC PIVH      R +GLRR+G+CFAGSD K DGFSGWSESDSGEE+  L+LRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHE-----RASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK
         W  GLVGIG+TGFILVSGITFAAWSINKQN S+QK QMEALST QE LLDSD+GNDKLGED+KED +V ADD        NHEE SSYTEN+ETLNKN+
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK

Query:  VG---DVEELSGNDVESSPS----NNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS
        VG   DVEE SG+DVE S +    NNVA LQED QSDSSLAVT VA GS        SEID ++ + SKDVN   +G EVLTSEPEMN   DEPDNS   
Subjt:  VG---DVEELSGNDVESSPS----NNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS
                       P+TSE YDFSSEKL VYDD SSNYNS  QDET  PPVNE  DSSLHE            GLV+KE V ES +GV+NP KTE+ LS
Subjt:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS

Query:  EETASTLEQ---QGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        EETAST+EQ   +GLS+AAFVSVTA+PLADDQE+NHET MNS AA+PELQG LFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIE 
Subjt:  EETASTLEQ---QGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        +VEP  LC+RREYARWLVSAS ALSRNT SKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDI SSLD+D+GPF FSPESPLSRQD
Subjt:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA
        LVSWKMALEKRQLPEADRKTLHQVSGFIDTD+IHPDACPALVADLSVGE GIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEEL RIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREK DAVEKMAEEAKQELERLR E+E D IALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMAR+WAE+EAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LMDKLK MATE+RGKSKDII+ IIQKIALL+SNLRQW+ N GE+AED+KNVAIARASRSA+ELQQS+AE+GLALK+GAKRVVGDCREGVEKISQKFRTSY
Subjt:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

A0A6J1HTP0 uncharacterized protein LOC111467385 isoform X10.0e+0082.12Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHE-----RASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK
        MAST +TCSP+SLQLRLALNCKN  KFP V VRA VRKLDPRLRVIC PIVH      R +GLRR+G+CFAGSD K DGFSGWSESDSGEE+  L+LRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHE-----RASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK
         W  GLVGIG+TGFILVSGITFAAWSINKQN S+QK QMEALST QE LLDSD+GNDKLGED+KED +V ADD        NHEE SSYTEN+ETLNKN+
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNK

Query:  VG---DVEELSGNDVESSPS----NNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS
        VG   DVEE SG+DVE S +    NNVA LQED QSDSSLAVT VA GS        SEID ++ + SKDVN   +G EVLTSEPEMN   DEPDNS   
Subjt:  VG---DVEELSGNDVESSPS----NNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS
                       P+TSE YDFSSEKL VYDD SSNYNS  QDET  PPVNE  DSSLHEFS+  GD +KE GLV+KE V ES +GV+NP KTE+ LS
Subjt:  NTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLS

Query:  EETASTLEQ---QGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        EETAST+EQ   +GLS+AAFVSVTA+PLADDQE+NHET MNS AA+PELQG LFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIE 
Subjt:  EETASTLEQ---QGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        +VEP  LC+RREYARWLVSAS ALSRNT SKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDI SSLD+D+GPF FSPESPLSRQD
Subjt:  DVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA
        LVSWKMALEKRQLPEADRKTLHQVSGFIDTD+IHPDACPALVADLSVGE GIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEEL RIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREK DAVEKMAEEAKQELERLR E+E D IALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMAR+WAE+EAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LMDKLK MATE+RGKSKDII+ IIQKIALL+SNLRQW+ N GE+AED+KNVAIARASRSA+ELQQS+AE+GLALK+GAKRVVGDCREGVEKISQKFRTSY
Subjt:  LMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25680.1 FUNCTIONS IN: molecular_function unknown1.4e-5435.92Show/hide
Query:  KVLVPAVVDQVQGQALAALQVLKVIEVDVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLS
        +V  P  VD  Q +A+A L+ LK+ E D+   +LC++REYARWLV ++S L RN    + PA+ +   +  AFDDI   DPDF  IQ LAEAG+ SSKLS
Subjt:  KVLVPAVVDQVQGQALAALQVLKVIEVDVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLS

Query:  RHDIFSSLDEDQGPFDFSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQ
          D       D G  +F+PES +SR DLV+WK  LE    PE   +       +IDT  I+PD       D  +G++  I   FG  + FQP +PVTKAQ
Subjt:  RHDIFSSLDEDQGPFDFSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQ

Query:  AAIALATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEME
        AA+AL +G+    ++ EL+R+EAES+++ A           + +I   +++++  ER +   +E++      E+E  ++ +E      ++E+AAI+ + +
Subjt:  AAIALATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEME

Query:  VLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE
        +L+ L  E++E  Q L+S+K     E  ++ ++  + + + + +   +  LE E +AL + R+W EDE K ++ +AK LEEA  RW+
Subjt:  VLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE

AT5G23890.1 LOCATED IN: mitochondrion, chloroplast thylakoid membrane, chloroplast, plastid, chloroplast envelope6.5e-20147.05Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHERASGLRRNGVCFAGSD---LKVDGFSGWSESDSGEEEEALDLRRKKW
        MAS +AT +P+SLQLRLAL+     K P+V +R           ++C           +     + GSD      D  +GW +SD+ +++++  +++K  
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHERASGLRRNGVCFAGSD---LKVDGFSGWSESDSGEEEEALDLRRKKW

Query:  FGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNKVG
          G+VG GV G IL  G+++AA S +K+    +K +M +L++QQES++ S   +D++  DE               K  N EES+   E++   + +   
Subjt:  FGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNKVG

Query:  DVEELSGNDVESSPSNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPN-SNTNSLNLK
          +E SG D          S  +   +D + A  S+   +  + +   +E D   A   K ++       +L S  E  +L  E  N     NTNS + +
Subjt:  DVEELSGNDVESSPSNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPN-SNTNSLNLK

Query:  TGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKES--GLVDKETVAESSKGVMNPSKTEQFLSEETAST
        + +  E  + S   D  +   S  +D  S+ +  +     G      T + L E SS    TSK     L D ET   +++ +   + T ++      S+
Subjt:  TGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSSNQDETPGPPVNEFTDSSLHEFSSISGDTSKES--GLVDKETVAESSKGVMNPSKTEQFLSEETAST

Query:  LEQQGLSEAAFVSVTAHPLADDQEKNHETIM-------NSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEVDV
        +     ++    S +  P + D  K+   I        N    +    G  FSSAG+PAP +S  V   PGK+LVP   DQ+Q QA AALQVLKVIE D 
Subjt:  LEQQGLSEAAFVSVTAHPLADDQEKNHETIM-------NSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEVDV

Query:  EPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQDLV
        +P DLC+RREYARWL+SASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDF+SIQGLAEAGLI+SKLS  D+   LD+ +G F FSPES LSRQDL+
Subjt:  EPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQDLV

Query:  SWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMAEN
        SWKMALEKRQLPEAD+K L+++SGFID D+I+PDA P+++ADLS GEQGI ALAFG TRLFQP KPVTK QAAIAL++GEASDIVSEEL RIEAESMAE 
Subjt:  SWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMAEN

Query:  AVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKER
        AV+AH+ALVA+VEKD+NASFEKELS+EREK++AVEKMAE AK ELE+LR +RE + +AL++ERAA+ESEMEVLSRLR + EE+L+ LMSNK E+++EKER
Subjt:  AVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKER

Query:  INKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLRE---QESAGDTWLDSSKQFAVEETVDRAE
        +  LRKEAE E+Q IS+LQYELEVERKALSMAR+WAE+EAK+AREQ +ALEEAR RWE  G++VVVD DL+E   +E+     L+  ++ +VEET  RA+
Subjt:  INKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLRE---QESAGDTWLDSSKQFAVEETVDRAE

Query:  NLMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRT
         LMDKLK MA  + GKS+++I  +++KI L ++ L+++  N G++A ++++ AI RA  +A++++Q T +    + D  K++  +CR+GV KISQ+F+T
Subjt:  NLMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRT

AT5G52410.1 CONTAINS InterPro DOMAIN/s: S-layer homology domain (InterPro:IPR001119)1.4e-15862.75Show/hide
Query:  AALQVLKVIEVDVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFD
        AALQ LKVIE D  P DLC+RRE+ARW+VSAS+ LSRN+ SKVYPAMYIENVTELAFDDITPEDPDF  IQGLAEAGLISSKLS +++ SS   +     
Subjt:  AALQVLKVIEVDVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFD

Query:  FSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSE
        FSPESPL+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D D+I+P+A PAL+ADLS GE GI AL+FG TRLFQP K VTKAQ A++LA G+A ++V E
Subjt:  FSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSE

Query:  ELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGL
        EL RIEAE+MAEN V AH+ LVAQVEKDINASFEKEL  E+E VDAVEK+AEEAK EL RLR E+E + +AL RER +IE+EME L+R+RNELEEQLQ L
Subjt:  ELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQLQGL

Query:  MSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGD-TWLDSSKQ
         SNK E+SYEKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK LEEAR RWEK G+KV+VDSDL EQ +  + TWL++ KQ
Subjt:  MSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGD-TWLDSSKQ

Query:  FAVEETVDRAENLMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREG
          VE T+ RA NL+ KLK MA ++  KS+++I  II+KI+LL+S L+Q +     +A+DLK    ++A     +      E+       AK  V + ++ 
Subjt:  FAVEETVDRAENLMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREG

Query:  VEKISQKFRT
        V K+ +KF++
Subjt:  VEKISQKFRT

AT5G52410.2 INVOLVED IN: biological_process unknown1.4e-16361.62Show/hide
Query:  GVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEVDVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFA
        G+PAP     V +L  K + P VVD VQ Q  AALQ LKVIE D  P DLC+RRE+ARW+VSAS+ LSRN+ SKVYPAMYIENVTELAFDDITPEDPDF 
Subjt:  GVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEVDVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFA

Query:  SIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAF
         IQGLAEAGLISSKLS +++ SS   +     FSPESPL+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D D+I+P+A PAL+ADLS GE GI AL+F
Subjt:  SIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAF

Query:  GYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESD
        G TRLFQP K VTKAQ A++LA G+A ++V EEL RIEAE+MAEN V AH+ LVAQVEKDINASFEKEL  E+E VDAVEK+AEEAK EL RLR E+E +
Subjt:  GYTRLFQPEKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESD

Query:  KIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARD
         +AL RER +IE+EME L+R+RNELEEQLQ L SNK E+SYEKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK LEEAR 
Subjt:  KIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARD

Query:  RWEKRGIKVVVDSDLREQESAGD-TWLDSSKQFAVEETVDRAENLMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARA
        RWEK G+KV+VDSDL EQ +  + TWL++ KQ  VE T+ RA NL+ KLK MA ++  KS+++I  II+KI+LL+S L+Q +     +A+DLK    ++A
Subjt:  RWEKRGIKVVVDSDLREQESAGD-TWLDSSKQFAVEETVDRAENLMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARA

Query:  SRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRT
             +      E+       AK  V + ++ V K+ +KF++
Subjt:  SRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTTCTGCCACGTGTTCTCCGAGCTCTCTCCAGCTTCGTCTCGCTCTGAATTGCAAGAATTGCGGCAAATTCCCTTCGGTTCTTGTTCGGGCGAGAGTCAG
GAAGCTCGATCCTCGGCTCCGGGTGATTTGTCACCCCATTGTTCATGAGAGAGCAAGTGGGCTGCGTCGCAATGGAGTTTGTTTTGCTGGGTCGGATTTGAAGGTCGATG
GATTCTCTGGGTGGTCGGAATCGGATTCCGGCGAGGAGGAGGAGGCTTTGGACTTGCGGAGAAAGAAGTGGTTTGGAGGGTTGGTGGGGATTGGAGTTACTGGATTCATC
CTTGTCTCGGGAATCACCTTTGCAGCATGGTCTATAAACAAGCAGAATAGTTCCAGACAAAAGCCACAAATGGAGGCCTTAAGTACGCAGCAAGAATCATTGTTGGACTC
TGATACTGGAAATGATAAGCTTGGTGAAGATGAAAAAGAAGATAAAAATGTGTATGCAGATGATGGAATTCTCGCTAGTAAAACAGGTAATCATGAGGAATCTTCTTCAT
ATACAGAAAACGAAGAAACTCTCAACAAAAATAAAGTTGGTGATGTTGAGGAGTTATCAGGGAATGATGTTGAATCTTCACCCAGCAATAATGTTGCTTCCTTGCAAGAA
GATTCCCAATCTGATTCCTCGTTAGCTGTTACATCAGTTGCTTCCGGAAGTTTGAGTTCTCTTATCTCACCTGAATCTGAGATTGATTCTAATGTAGCTGCTGATTCTAA
AGATGTAAACAACTGTCATGCTGGTGCAGAAGTCTTAACTTCTGAACCTGAAATGAATATATTGAAAGATGAACCAGATAACTCACCTAACTCTAATACTAATTCACTAA
ACCTTAAAACTGGTATTCCAGATGAAACACCTGATACTAGTGAAAATTATGATTTCAGCTCTGAGAAATTATCAGTCTATGATGATGGTTCATCAAACTATAATTCTAGC
AACCAGGATGAGACACCTGGTCCTCCTGTAAATGAATTTACAGATTCTTCATTGCATGAATTTTCTAGCATATCTGGCGATACATCCAAGGAATCAGGATTAGTTGACAA
AGAGACTGTAGCTGAATCATCCAAAGGAGTGATGAACCCCAGTAAAACTGAACAGTTCTTATCTGAGGAAACTGCATCAACCTTAGAACAGCAAGGATTATCTGAAGCAG
CATTTGTCTCTGTCACAGCTCATCCATTGGCAGATGATCAAGAGAAAAACCATGAAACTATCATGAATAGTAGTGCTGCCAAACCAGAACTACAAGGCATTTTATTTTCT
TCTGCAGGTGTTCCTGCTCCTTTGGTTTCTGCAGCTGTTAAAACACTTCCTGGCAAGGTCCTAGTTCCTGCAGTTGTGGATCAGGTTCAGGGTCAGGCATTGGCAGCACT
GCAAGTTTTAAAGGTGATAGAGGTTGACGTTGAACCTGGTGATCTATGTTCTCGTAGGGAATATGCTCGGTGGCTGGTGTCTGCAAGCAGTGCTCTTTCGAGGAACACAA
CATCTAAAGTATATCCAGCAATGTATATAGAGAATGTTACCGAGCTTGCATTTGATGATATTACTCCTGAAGACCCTGATTTTGCATCTATTCAAGGTTTGGCAGAAGCT
GGACTCATTTCAAGCAAGCTCTCGAGACACGATATTTTTTCTTCATTGGACGAAGACCAGGGTCCTTTTGATTTCTCTCCCGAAAGTCCCCTATCACGTCAAGATCTTGT
GAGTTGGAAGATGGCCCTTGAAAAAAGACAGCTGCCAGAGGCAGATAGAAAGACGCTCCACCAAGTTTCTGGATTTATAGATACTGATAGGATCCATCCAGATGCTTGTC
CTGCGCTCGTTGCCGATCTTTCTGTAGGAGAACAGGGAATAATAGCTCTTGCATTTGGATATACAAGGCTTTTCCAGCCAGAAAAGCCTGTAACGAAAGCACAAGCTGCC
ATTGCTCTTGCAACTGGGGAGGCTTCTGATATAGTAAGTGAGGAGCTTACAAGGATTGAAGCTGAATCAATGGCAGAAAATGCTGTTGCTGCACATAGTGCTTTGGTAGC
TCAAGTTGAGAAAGATATTAATGCCAGCTTCGAGAAAGAACTTTCCATTGAAAGAGAAAAGGTTGACGCTGTGGAGAAAATGGCAGAAGAGGCAAAGCAAGAGTTGGAAA
GATTAAGATCCGAAAGAGAGAGCGATAAGATCGCCTTGATGAGGGAACGTGCTGCCATTGAATCAGAAATGGAAGTTCTTTCGAGGTTAAGGAATGAGTTGGAGGAGCAG
TTGCAAGGCCTGATGAGCAATAAAGTAGAAGTATCATATGAAAAGGAAAGAATCAACAAACTCAGGAAAGAAGCTGAAATTGAAAATCAAGAGATTTCCCGCCTGCAATA
TGAGCTTGAGGTTGAGAGGAAGGCGTTGTCCATGGCCAGAGCATGGGCTGAGGACGAAGCAAAAAGAGCAAGAGAACAAGCAAAAGCACTCGAAGAGGCTAGAGATCGCT
GGGAAAAGCGTGGCATCAAAGTAGTCGTCGACAGCGATCTCCGTGAACAGGAATCAGCTGGCGATACTTGGCTTGATTCTAGCAAACAGTTTGCAGTCGAAGAAACCGTC
GACAGGGCTGAAAACTTAATGGACAAGCTGAAAGCAATGGCTACAGAACTAAGAGGGAAATCCAAAGACATCATCGACAAGATCATCCAGAAGATAGCTTTGCTAGTATC
AAACTTGAGACAGTGGCTTTCCAATACTGGAGAACAGGCTGAAGATCTGAAGAATGTGGCCATTGCAAGGGCAAGTAGATCAGCAAGTGAGCTGCAACAGAGCACTGCAG
AGTTGGGCTTGGCCCTGAAGGATGGAGCAAAGCGAGTTGTGGGAGATTGTAGGGAAGGAGTAGAGAAAATTTCCCAAAAGTTCAGAACATCGTACGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCACTTCTGCCACGTGTTCTCCGAGCTCTCTCCAGCTTCGTCTCGCTCTGAATTGCAAGAATTGCGGCAAATTCCCTTCGGTTCTTGTTCGGGCGAGAGTCAG
GAAGCTCGATCCTCGGCTCCGGGTGATTTGTCACCCCATTGTTCATGAGAGAGCAAGTGGGCTGCGTCGCAATGGAGTTTGTTTTGCTGGGTCGGATTTGAAGGTCGATG
GATTCTCTGGGTGGTCGGAATCGGATTCCGGCGAGGAGGAGGAGGCTTTGGACTTGCGGAGAAAGAAGTGGTTTGGAGGGTTGGTGGGGATTGGAGTTACTGGATTCATC
CTTGTCTCGGGAATCACCTTTGCAGCATGGTCTATAAACAAGCAGAATAGTTCCAGACAAAAGCCACAAATGGAGGCCTTAAGTACGCAGCAAGAATCATTGTTGGACTC
TGATACTGGAAATGATAAGCTTGGTGAAGATGAAAAAGAAGATAAAAATGTGTATGCAGATGATGGAATTCTCGCTAGTAAAACAGGTAATCATGAGGAATCTTCTTCAT
ATACAGAAAACGAAGAAACTCTCAACAAAAATAAAGTTGGTGATGTTGAGGAGTTATCAGGGAATGATGTTGAATCTTCACCCAGCAATAATGTTGCTTCCTTGCAAGAA
GATTCCCAATCTGATTCCTCGTTAGCTGTTACATCAGTTGCTTCCGGAAGTTTGAGTTCTCTTATCTCACCTGAATCTGAGATTGATTCTAATGTAGCTGCTGATTCTAA
AGATGTAAACAACTGTCATGCTGGTGCAGAAGTCTTAACTTCTGAACCTGAAATGAATATATTGAAAGATGAACCAGATAACTCACCTAACTCTAATACTAATTCACTAA
ACCTTAAAACTGGTATTCCAGATGAAACACCTGATACTAGTGAAAATTATGATTTCAGCTCTGAGAAATTATCAGTCTATGATGATGGTTCATCAAACTATAATTCTAGC
AACCAGGATGAGACACCTGGTCCTCCTGTAAATGAATTTACAGATTCTTCATTGCATGAATTTTCTAGCATATCTGGCGATACATCCAAGGAATCAGGATTAGTTGACAA
AGAGACTGTAGCTGAATCATCCAAAGGAGTGATGAACCCCAGTAAAACTGAACAGTTCTTATCTGAGGAAACTGCATCAACCTTAGAACAGCAAGGATTATCTGAAGCAG
CATTTGTCTCTGTCACAGCTCATCCATTGGCAGATGATCAAGAGAAAAACCATGAAACTATCATGAATAGTAGTGCTGCCAAACCAGAACTACAAGGCATTTTATTTTCT
TCTGCAGGTGTTCCTGCTCCTTTGGTTTCTGCAGCTGTTAAAACACTTCCTGGCAAGGTCCTAGTTCCTGCAGTTGTGGATCAGGTTCAGGGTCAGGCATTGGCAGCACT
GCAAGTTTTAAAGGTGATAGAGGTTGACGTTGAACCTGGTGATCTATGTTCTCGTAGGGAATATGCTCGGTGGCTGGTGTCTGCAAGCAGTGCTCTTTCGAGGAACACAA
CATCTAAAGTATATCCAGCAATGTATATAGAGAATGTTACCGAGCTTGCATTTGATGATATTACTCCTGAAGACCCTGATTTTGCATCTATTCAAGGTTTGGCAGAAGCT
GGACTCATTTCAAGCAAGCTCTCGAGACACGATATTTTTTCTTCATTGGACGAAGACCAGGGTCCTTTTGATTTCTCTCCCGAAAGTCCCCTATCACGTCAAGATCTTGT
GAGTTGGAAGATGGCCCTTGAAAAAAGACAGCTGCCAGAGGCAGATAGAAAGACGCTCCACCAAGTTTCTGGATTTATAGATACTGATAGGATCCATCCAGATGCTTGTC
CTGCGCTCGTTGCCGATCTTTCTGTAGGAGAACAGGGAATAATAGCTCTTGCATTTGGATATACAAGGCTTTTCCAGCCAGAAAAGCCTGTAACGAAAGCACAAGCTGCC
ATTGCTCTTGCAACTGGGGAGGCTTCTGATATAGTAAGTGAGGAGCTTACAAGGATTGAAGCTGAATCAATGGCAGAAAATGCTGTTGCTGCACATAGTGCTTTGGTAGC
TCAAGTTGAGAAAGATATTAATGCCAGCTTCGAGAAAGAACTTTCCATTGAAAGAGAAAAGGTTGACGCTGTGGAGAAAATGGCAGAAGAGGCAAAGCAAGAGTTGGAAA
GATTAAGATCCGAAAGAGAGAGCGATAAGATCGCCTTGATGAGGGAACGTGCTGCCATTGAATCAGAAATGGAAGTTCTTTCGAGGTTAAGGAATGAGTTGGAGGAGCAG
TTGCAAGGCCTGATGAGCAATAAAGTAGAAGTATCATATGAAAAGGAAAGAATCAACAAACTCAGGAAAGAAGCTGAAATTGAAAATCAAGAGATTTCCCGCCTGCAATA
TGAGCTTGAGGTTGAGAGGAAGGCGTTGTCCATGGCCAGAGCATGGGCTGAGGACGAAGCAAAAAGAGCAAGAGAACAAGCAAAAGCACTCGAAGAGGCTAGAGATCGCT
GGGAAAAGCGTGGCATCAAAGTAGTCGTCGACAGCGATCTCCGTGAACAGGAATCAGCTGGCGATACTTGGCTTGATTCTAGCAAACAGTTTGCAGTCGAAGAAACCGTC
GACAGGGCTGAAAACTTAATGGACAAGCTGAAAGCAATGGCTACAGAACTAAGAGGGAAATCCAAAGACATCATCGACAAGATCATCCAGAAGATAGCTTTGCTAGTATC
AAACTTGAGACAGTGGCTTTCCAATACTGGAGAACAGGCTGAAGATCTGAAGAATGTGGCCATTGCAAGGGCAAGTAGATCAGCAAGTGAGCTGCAACAGAGCACTGCAG
AGTTGGGCTTGGCCCTGAAGGATGGAGCAAAGCGAGTTGTGGGAGATTGTAGGGAAGGAGTAGAGAAAATTTCCCAAAAGTTCAGAACATCGTACGGTTGA
Protein sequenceShow/hide protein sequence
MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHERASGLRRNGVCFAGSDLKVDGFSGWSESDSGEEEEALDLRRKKWFGGLVGIGVTGFI
LVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVYADDGILASKTGNHEESSSYTENEETLNKNKVGDVEELSGNDVESSPSNNVASLQE
DSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGAEVLTSEPEMNILKDEPDNSPNSNTNSLNLKTGIPDETPDTSENYDFSSEKLSVYDDGSSNYNSS
NQDETPGPPVNEFTDSSLHEFSSISGDTSKESGLVDKETVAESSKGVMNPSKTEQFLSEETASTLEQQGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFS
SAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEVDVEPGDLCSRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEA
GLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDRIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAA
IALATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERESDKIALMRERAAIESEMEVLSRLRNELEEQ
LQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETV
DRAENLMDKLKAMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNTGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSYG