; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002621 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002621
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: 6 growth stages;
Genome locationscaffold6:1456289..1462757
RNA-Seq ExpressionSpg002621
SyntenySpg002621
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439251.1 PREDICTED: uncharacterized protein LOC103484091 isoform X1 [Cucumis melo]0.0e+0084.72Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK
        MASTS TCSPSSLQLRLALNC NCGKFPSV VRARVRKLDPRLR++C PIVH     +R +GLR  GVCFAGS+S  DGFSGWSESDS  + E LDLRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR
        KWFGGLVGIG+TGFILVSGITFAAWSINKQNSSRQK QMEALSTQQE LLDS+TG D+LGEDEKED +V+ADD   AGK GN E+SSS TENEE+LNKNR
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR

Query:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS
        VGDGVDVEEL+ N VESS +NNDV+N ASLQED QSDSSL VT+VA GSLSS ISPESE DSNVA+  KDVNNCH G EV TSEPEMN+LKDEPDNSPNS
Subjt:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS
        N NSLNLKTDI DE PDT EN+D SS+KL VYD+ SSNY SGNQDET G  VNE TDSSL  FSS+  DT+KES L D  TVA+S +GV +PSK EQ  S
Subjt:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS

Query:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        E+ A ++EQQ+  GLSEAA VS+T +PLADDQE NHETIMN +AAK ELQ ILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIE 
Subjt:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        DVEP DLCTRREYARWLV ASSALSRNTTSKVYPAMY+ENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSRHDI SSLDEDQGP  FSPES LSRQD
Subjt:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA
        LVSWKMALEKRQLPEADRK LHQVSGFIDTDKIHPDACPA+VADLSVGEQGIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEELARIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREKV+AVE+MAEEAKQELERLRSER RD++ALM ERA++ESEMEVLSRLR+ELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQES GDTWLDSSKQF VEET DRAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LM+KLK MA E+RGKS+D+I+KIIQKIALLVSNLRQW+S  GEQAE+LKNVAI+RA+RSA+ELQQSTAEL LA+K+GAKRVVGDCREGVEKI+QKFRTSY
Subjt:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

XP_011651163.1 uncharacterized protein LOC101215442 isoform X1 [Cucumis sativus]0.0e+0085.01Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK
        MASTS TCSP+SLQLRLALNC NCGKFPS+LVRARVRKLDPRLRVICHPIVH     +R +G R  GVCFAGS+S  DGFSGWSESDS  + E LDLRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR
        KWFGG VGIG+TGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQE LLDS+TG D+LGEDEKED +V+ADD  LAGK GN E+SSSYTENEE+LNKNR
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR

Query:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS
        VGDGVDVEEL+ N VESS +NNDV+NVASLQED QSDSSL VTSVA GSLSSLISPESE D+NVA+  KDVNN H G EV TSEPEMNILKDEPDN PNS
Subjt:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS
        NTNSLNLKTDI DE PDT ENYD  S+KL VYDD SSNY SGNQDET   PV+E TDSSL  FSSI  DT+KESGL D ETVA+SS+GV +PS+ EQ  S
Subjt:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS

Query:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        E+ A ++EQ +   LSEAA VS++ +PLADDQEKNHETIMN +AAK ELQ I FSSAGVPAPLVSAAVKT PGKVL+PAVVDQVQGQALAALQVLKVIEV
Subjt:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        DVEP DLCTRREYARWLV ASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDFASIQGLAEAG+ISSKLSRHDI SSLDEDQGP  FSPES LSRQD
Subjt:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA
        LVSWKMALEKRQLPEADRK LHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEELARIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREKV+AVEKMAEEAKQELERLRSERER+ +ALM ERA+IESEMEVLSRLR+ELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQF VEET +RAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LM+KLK MA E+RG+S+D+I+KIIQKIALLVSNLRQW+S  GEQAE+LKN AI+RA RSA ELQQSTAEL LA+K+GAKRVVGDCREGVEK +QKFRTSY
Subjt:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

XP_022140920.1 uncharacterized protein LOC111011467 isoform X1 [Momordica charantia]0.0e+0085.84Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK
        MAST ATCSP SLQLRLALNCKNC KFPSVLVRARVRKLDPR+R+ C+PIV+     ERA+G RR+GVCFA SDS  DGFSGWSESDSG  EE LDLRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR
         WFGGLVGIG+TGFILVSGITFAAWSI+KQNSSRQKPQMEALSTQQE LLDSDTGND+LGE+EKED +VNADD  LAGKTGNHEESSSYTENE+ L+KN 
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR

Query:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHA-GSEVLTSEPEMNILKDEPDNSPN
        VGDGVDVE+LSGNDVESS +NNDVNNVAS QED QSDS  AVTSVA+GSLSSL+    E DS+VA+ SKD N+CHA G+EVL SEPEMNILKD PDNS N
Subjt:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHA-GSEVLTSEPEMNILKDEPDNSPN

Query:  SNTNSLNLKTDIPDETPDTTENYDFSS--------EKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQN
        SNTNSLN KTDI DETPDT+ENYDFSS        EKL +YDD +SN+NSGNQ E PG P+NE +DSSLH  SS+ GDT+KESG +D+ETV ESSK V N
Subjt:  SNTNSLNLKTDIPDETPDTTENYDFSS--------EKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQN

Query:  PSKTEQLLSEETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAA
        P+KTE+LLSE T STLEQQI RGLSEAAFVSVTA+PL D QEK+HETIMNS+AAKPELQGILFSSAGVPAPL SAA+KTLPGKVLVPAVVDQVQGQAL+A
Subjt:  PSKTEQLLSEETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAA

Query:  LQVLKVIEVDVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFS
        LQVLKVIE +VEP DLCTRREYARWLV ASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDI SS DEDQGPF FS
Subjt:  LQVLKVIEVDVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFS

Query:  PESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEEL
        PESPLSRQDLVSWKMALEKRQLPEADRK LHQVSGFIDTDKIHPDACPALVADLSVGE GIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEEL
Subjt:  PESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEEL

Query:  ARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMS
        ARIEAESMAENAVAAH ALVAQVEKDINASFEK+LSIEREKVDAVEKMAEEAKQELERLRSERER+N+ALM+E AAIESEMEV SRLRNELEEQLQGLMS
Subjt:  ARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMS

Query:  NKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAV
        NKVEVSYEKERINKLRKEAEIENQEI+RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE+RGIKVVVDSDLREQESAGDTWLDSSKQF+V
Subjt:  NKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAV

Query:  EETVDRAENLMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEK
        +ETVDRAENLMDKLK MA ELRGKSK+I+DKII+KIALL+SNLRQW+S+AG+QAEDLK VAI+RASRS SELQQSTAEL LALK+GAKRVVGDCREGVEK
Subjt:  EETVDRAENLMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEK

Query:  ISQKFRTSYG
        I+QKF+TSYG
Subjt:  ISQKFRTSYG

XP_022968011.1 uncharacterized protein LOC111467385 isoform X1 [Cucurbita maxima]0.0e+0083.32Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHE-----RASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK
        MAST +TCSP+SLQLRLALNCKN  KFP V VRA VRKLDPRLRVIC PIVH      R +GLRR+G+CFAGSDSK DGFSGWSESDSGEE+  L+LRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHE-----RASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR
         W  GLVGIG+TGFILVSGITFAAWSINKQN S+QK QMEALST QE LLDSD+GNDKLGED+KED +VNADD        NHEE SSYTEN+E+LNKNR
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR

Query:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS
        VG   DVEE SG+DVE S NN++VNNVA LQED QSDSSLAVT VA GS        SEID ++ + SKDVN   +G+EVLTSEPEMN   DEPDNS   
Subjt:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS
                       P+T+E YDFSSEKL VYDD SSNYNSG QDET  PPVNE  DSSLH FS+  GD +KE GL++KE V ES +GV NP KTE+LLS
Subjt:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS

Query:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        EETAST+EQ+IGRGLS+AAFVSVTA+PLADDQE+NHET MNS AA+PELQG LFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIE 
Subjt:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        +VEP  LCTRREYARWLV AS ALSRNT SKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDI SSLD+D+GPF FSPESPLSRQD
Subjt:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA
        LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGE GIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEELARIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREK DAVEKMAEEAKQELERLR E+ERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMAR+WAE+EAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LMDKLK MATE+RGKSKDII+ IIQKIALL+SNLRQW+ NAGE+AED+KNVAIARASRSA+ELQQS+AE+GLALK+GAKRVVGDCREGVEKISQKFRTSY
Subjt:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

XP_038892464.1 uncharacterized protein LOC120081550 isoform X1 [Benincasa hispida]0.0e+0083.32Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK
        MAST +TCSPSSLQLRLALNCKNCGKFPSVLVRAR+RKLDPRLRVICHPIV+     ER +GL   GVCFAGS+S  DGFSGWSESDS  +E+ LDL RK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR
        KW GG VGIG+TGFIL+SGITFAAWSINKQNSSRQKPQMEALSTQQE LL SDTGNDKLGED KE+  +NADD    GKTGN E+SSS TENEE+LNKNR
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR

Query:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS
        VGD VDV+EL+ NDVESS +NNDVN+V SLQED QSDSSL VTSVA GSLSSLISPESE DSN+A+  KDVNN H+G+EV TSE EMNILKDEPDN PNS
Subjt:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS
        NTNSLNLKTDI DE PDT ENYDFSS+KL +YDD SSNYNSGNQD+T G PVNE TDSSL   S                               EQ LS
Subjt:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS

Query:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        E TAST+EQ+I RGLSEAA VSVT +P ADDQEKNHE++MN +AAKPELQ ILFSSAGVPAP+VSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIE 
Subjt:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        DV P DLCTRREYARWLV ASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSRHDI SSLDEDQGP  FSPES LSRQD
Subjt:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA
        LVSWKMALEKRQLP ADRK LHQVSGFID DKIHPDACPALVADLSVGEQGI+ALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEELARIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVA HSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRS+RERD+I L+RERA+IESEME+LSRLR+ELEEQL+GLMSNKVE+S+EK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESA DTWLDSSKQFAVEET DRAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LM+KLK MATE+RGKS+D+I+KIIQKIALLVSNLRQW+S  GEQAEDLKNVAI+RA+RSA ELQQSTAEL LALK+GAKRVVGDCREGVEKI+QKF+TSY
Subjt:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

TrEMBL top hitse value%identityAlignment
A0A0A0L9T9 Uncharacterized protein0.0e+0085.01Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK
        MASTS TCSP+SLQLRLALNC NCGKFPS+LVRARVRKLDPRLRVICHPIVH     +R +G R  GVCFAGS+S  DGFSGWSESDS  + E LDLRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR
        KWFGG VGIG+TGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQE LLDS+TG D+LGEDEKED +V+ADD  LAGK GN E+SSSYTENEE+LNKNR
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR

Query:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS
        VGDGVDVEEL+ N VESS +NNDV+NVASLQED QSDSSL VTSVA GSLSSLISPESE D+NVA+  KDVNN H G EV TSEPEMNILKDEPDN PNS
Subjt:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS
        NTNSLNLKTDI DE PDT ENYD  S+KL VYDD SSNY SGNQDET   PV+E TDSSL  FSSI  DT+KESGL D ETVA+SS+GV +PS+ EQ  S
Subjt:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS

Query:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        E+ A ++EQ +   LSEAA VS++ +PLADDQEKNHETIMN +AAK ELQ I FSSAGVPAPLVSAAVKT PGKVL+PAVVDQVQGQALAALQVLKVIEV
Subjt:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        DVEP DLCTRREYARWLV ASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDFASIQGLAEAG+ISSKLSRHDI SSLDEDQGP  FSPES LSRQD
Subjt:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA
        LVSWKMALEKRQLPEADRK LHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEELARIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREKV+AVEKMAEEAKQELERLRSERER+ +ALM ERA+IESEMEVLSRLR+ELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQF VEET +RAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LM+KLK MA E+RG+S+D+I+KIIQKIALLVSNLRQW+S  GEQAE+LKN AI+RA RSA ELQQSTAEL LA+K+GAKRVVGDCREGVEK +QKFRTSY
Subjt:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

A0A1S3AYZ8 uncharacterized protein LOC103484091 isoform X10.0e+0084.72Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK
        MASTS TCSPSSLQLRLALNC NCGKFPSV VRARVRKLDPRLR++C PIVH     +R +GLR  GVCFAGS+S  DGFSGWSESDS  + E LDLRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR
        KWFGGLVGIG+TGFILVSGITFAAWSINKQNSSRQK QMEALSTQQE LLDS+TG D+LGEDEKED +V+ADD   AGK GN E+SSS TENEE+LNKNR
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR

Query:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS
        VGDGVDVEEL+ N VESS +NNDV+N ASLQED QSDSSL VT+VA GSLSS ISPESE DSNVA+  KDVNNCH G EV TSEPEMN+LKDEPDNSPNS
Subjt:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS
        N NSLNLKTDI DE PDT EN+D SS+KL VYD+ SSNY SGNQDET G  VNE TDSSL  FSS+  DT+KES L D  TVA+S +GV +PSK EQ  S
Subjt:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS

Query:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        E+ A ++EQQ+  GLSEAA VS+T +PLADDQE NHETIMN +AAK ELQ ILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIE 
Subjt:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        DVEP DLCTRREYARWLV ASSALSRNTTSKVYPAMY+ENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSRHDI SSLDEDQGP  FSPES LSRQD
Subjt:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA
        LVSWKMALEKRQLPEADRK LHQVSGFIDTDKIHPDACPA+VADLSVGEQGIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEELARIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREKV+AVE+MAEEAKQELERLRSER RD++ALM ERA++ESEMEVLSRLR+ELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQES GDTWLDSSKQF VEET DRAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LM+KLK MA E+RGKS+D+I+KIIQKIALLVSNLRQW+S  GEQAE+LKNVAI+RA+RSA+ELQQSTAEL LA+K+GAKRVVGDCREGVEKI+QKFRTSY
Subjt:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

A0A6J1CGI7 uncharacterized protein LOC111011467 isoform X10.0e+0085.84Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK
        MAST ATCSP SLQLRLALNCKNC KFPSVLVRARVRKLDPR+R+ C+PIV+     ERA+G RR+GVCFA SDS  DGFSGWSESDSG  EE LDLRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVH-----ERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR
         WFGGLVGIG+TGFILVSGITFAAWSI+KQNSSRQKPQMEALSTQQE LLDSDTGND+LGE+EKED +VNADD  LAGKTGNHEESSSYTENE+ L+KN 
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR

Query:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHA-GSEVLTSEPEMNILKDEPDNSPN
        VGDGVDVE+LSGNDVESS +NNDVNNVAS QED QSDS  AVTSVA+GSLSSL+    E DS+VA+ SKD N+CHA G+EVL SEPEMNILKD PDNS N
Subjt:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHA-GSEVLTSEPEMNILKDEPDNSPN

Query:  SNTNSLNLKTDIPDETPDTTENYDFSS--------EKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQN
        SNTNSLN KTDI DETPDT+ENYDFSS        EKL +YDD +SN+NSGNQ E PG P+NE +DSSLH  SS+ GDT+KESG +D+ETV ESSK V N
Subjt:  SNTNSLNLKTDIPDETPDTTENYDFSS--------EKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQN

Query:  PSKTEQLLSEETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAA
        P+KTE+LLSE T STLEQQI RGLSEAAFVSVTA+PL D QEK+HETIMNS+AAKPELQGILFSSAGVPAPL SAA+KTLPGKVLVPAVVDQVQGQAL+A
Subjt:  PSKTEQLLSEETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAA

Query:  LQVLKVIEVDVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFS
        LQVLKVIE +VEP DLCTRREYARWLV ASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDI SS DEDQGPF FS
Subjt:  LQVLKVIEVDVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFS

Query:  PESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEEL
        PESPLSRQDLVSWKMALEKRQLPEADRK LHQVSGFIDTDKIHPDACPALVADLSVGE GIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEEL
Subjt:  PESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEEL

Query:  ARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMS
        ARIEAESMAENAVAAH ALVAQVEKDINASFEK+LSIEREKVDAVEKMAEEAKQELERLRSERER+N+ALM+E AAIESEMEV SRLRNELEEQLQGLMS
Subjt:  ARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMS

Query:  NKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAV
        NKVEVSYEKERINKLRKEAEIENQEI+RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE+RGIKVVVDSDLREQESAGDTWLDSSKQF+V
Subjt:  NKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAV

Query:  EETVDRAENLMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEK
        +ETVDRAENLMDKLK MA ELRGKSK+I+DKII+KIALL+SNLRQW+S+AG+QAEDLK VAI+RASRS SELQQSTAEL LALK+GAKRVVGDCREGVEK
Subjt:  EETVDRAENLMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEK

Query:  ISQKFRTSYG
        I+QKF+TSYG
Subjt:  ISQKFRTSYG

A0A6J1HSB9 uncharacterized protein LOC111467385 isoform X20.0e+0082.82Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHE-----RASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK
        MAST +TCSP+SLQLRLALNCKN  KFP V VRA VRKLDPRLRVIC PIVH      R +GLRR+G+CFAGSDSK DGFSGWSESDSGEE+  L+LRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHE-----RASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR
         W  GLVGIG+TGFILVSGITFAAWSINKQN S+QK QMEALST QE LLDSD+GNDKLGED+KED +VNADD        NHEE SSYTEN+E+LNKNR
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR

Query:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS
        VG   DVEE SG+DVE S NN++VNNVA LQED QSDSSLAVT VA GS        SEID ++ + SKDVN   +G+EVLTSEPEMN   DEPDNS   
Subjt:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS
                       P+T+E YDFSSEKL VYDD SSNYNSG QDET  PPVNE  DSSLH           E GL++KE V ES +GV NP KTE+LLS
Subjt:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS

Query:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        EETAST+EQ+IGRGLS+AAFVSVTA+PLADDQE+NHET MNS AA+PELQG LFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIE 
Subjt:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        +VEP  LCTRREYARWLV AS ALSRNT SKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDI SSLD+D+GPF FSPESPLSRQD
Subjt:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA
        LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGE GIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEELARIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREK DAVEKMAEEAKQELERLR E+ERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMAR+WAE+EAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LMDKLK MATE+RGKSKDII+ IIQKIALL+SNLRQW+ NAGE+AED+KNVAIARASRSA+ELQQS+AE+GLALK+GAKRVVGDCREGVEKISQKFRTSY
Subjt:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

A0A6J1HTP0 uncharacterized protein LOC111467385 isoform X10.0e+0083.32Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHE-----RASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK
        MAST +TCSP+SLQLRLALNCKN  KFP V VRA VRKLDPRLRVIC PIVH      R +GLRR+G+CFAGSDSK DGFSGWSESDSGEE+  L+LRRK
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHE-----RASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRK

Query:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR
         W  GLVGIG+TGFILVSGITFAAWSINKQN S+QK QMEALST QE LLDSD+GNDKLGED+KED +VNADD        NHEE SSYTEN+E+LNKNR
Subjt:  KWFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR

Query:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS
        VG   DVEE SG+DVE S NN++VNNVA LQED QSDSSLAVT VA GS        SEID ++ + SKDVN   +G+EVLTSEPEMN   DEPDNS   
Subjt:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS
                       P+T+E YDFSSEKL VYDD SSNYNSG QDET  PPVNE  DSSLH FS+  GD +KE GL++KE V ES +GV NP KTE+LLS
Subjt:  NTNSLNLKTDIPDETPDTTENYDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLS

Query:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV
        EETAST+EQ+IGRGLS+AAFVSVTA+PLADDQE+NHET MNS AA+PELQG LFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIE 
Subjt:  EETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEV

Query:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD
        +VEP  LCTRREYARWLV AS ALSRNT SKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDI SSLD+D+GPF FSPESPLSRQD
Subjt:  DVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQD

Query:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA
        LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGE GIIALAFGYTRLFQP+KPVTKAQAAIALATGEASDIVSEELARIEAESMA
Subjt:  LVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMA

Query:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
        ENAVAAHSALVAQVEKDINASFEKELSIEREK DAVEKMAEEAKQELERLR E+ERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK
Subjt:  ENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEK

Query:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
        ERINKLRKEAEIENQEISRLQYELEVERKALSMAR+WAE+EAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN
Subjt:  ERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVDRAEN

Query:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY
        LMDKLK MATE+RGKSKDII+ IIQKIALL+SNLRQW+ NAGE+AED+KNVAIARASRSA+ELQQS+AE+GLALK+GAKRVVGDCREGVEKISQKFRTSY
Subjt:  LMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRTSY

Query:  G
        G
Subjt:  G

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25680.1 FUNCTIONS IN: molecular_function unknown2.9e-5536.18Show/hide
Query:  KVLVPAVVDQVQGQALAALQVLKVIEVDVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLS
        +V  P  VD  Q +A+A L+ LK+ E D+   +LCT+REYARWLV ++S L RN    + PA+ +   +  AFDDI   DPDF  IQ LAEAG+ SSKLS
Subjt:  KVLVPAVVDQVQGQALAALQVLKVIEVDVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLS

Query:  RHDIFSSLDEDQGPFDFSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQ
          D       D G  +F+PES +SR DLV+WK  LE    PE   +       +IDT  I+PD       D  +G++  I   FG  + FQP +PVTKAQ
Subjt:  RHDIFSSLDEDQGPFDFSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQ

Query:  AAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEME
        AA+AL +G+    ++ EL+R+EAES+++ A           + +I   +++++  ER +   +E++      E+E  ++ +E+ +   ++E+AAI+ + +
Subjt:  AAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEME

Query:  VLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE
        +L+ L  E++E  Q L+S+K     E  ++ ++  + + + + +   +  LE E +AL + R+W EDE K ++ +AK LEEA  RW+
Subjt:  VLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE

AT5G23890.1 LOCATED IN: mitochondrion, chloroplast thylakoid membrane, chloroplast, plastid, chloroplast envelope9.6e-20046.76Show/hide
Query:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHERASGLRRNGVCFAGSD---SKVDGFSGWSESDSGEEEEALDLRRKKW
        MAS +AT +P+SLQLRLAL+     K P+V +R           ++C           +     + GSD   S  D  +GW +SD+ +++++  +++K  
Subjt:  MASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHERASGLRRNGVCFAGSD---SKVDGFSGWSESDSGEEEEALDLRRKKW

Query:  FGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDS--DTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR
          G+VG GV G IL  G+++AA S +K+    +K +M +L++QQES++ S  +  +D++     E+ N+  +D  +       +      E++    +  
Subjt:  FGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDS--DTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNR

Query:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS
          DGV  +E   +  ES P N      A L  ++++D   A +        SL+   +E    +  D++  N    G E   SE   ++L  EP N  + 
Subjt:  VGDGVDVEELSGNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNS

Query:  NTNSLNLKTD---------------IPDETPDTTENYDFSSEK--LSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVA
          +  + K D                  E P+ +   D +S+   + + D  ++   +    E  G P   F  S   + S I  DT+KE     K  V 
Subjt:  NTNSLNLKTD---------------IPDETPDTTENYDFSSEK--LSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVA

Query:  ESSKGVQNPSKTEQLLSEETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQ
        ES+ G    SK E                        +++ +    DD     E     SA         FSSAG+PAP +S  V   PGK+LVP   DQ
Subjt:  ESSKGVQNPSKTEQLLSEETASTLEQQIGRGLSEAAFVSVTAHPLADDQEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQ

Query:  VQGQALAALQVLKVIEVDVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDE
        +Q QA AALQVLKVIE D +P DLCTRREYARWL+ ASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDF+SIQGLAEAGLI+SKLS  D+   LD+
Subjt:  VQGQALAALQVLKVIEVDVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDE

Query:  DQGPFDFSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEA
         +G F FSPES LSRQDL+SWKMALEKRQLPEAD+K L+++SGFID DKI+PDA P+++ADLS GEQGI ALAFG TRLFQP KPVTK QAAIAL++GEA
Subjt:  DQGPFDFSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEA

Query:  SDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELE
        SDIVSEELARIEAESMAE AV+AH+ALVA+VEKD+NASFEKELS+EREK++AVEKMAE AK ELE+LR +RE +N+AL++ERAA+ESEMEVLSRLR + E
Subjt:  SDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELE

Query:  EQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLRE---QESAGD
        E+L+ LMSNK E+++EKER+  LRKEAE E+Q IS+LQYELEVERKALSMAR+WAE+EAK+AREQ +ALEEAR RWE  G++VVVD DL+E   +E+   
Subjt:  EQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLRE---QESAGD

Query:  TWLDSSKQFAVEETVDRAENLMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKR
          L+  ++ +VEET  RA+ LMDKLK MA  + GKS+++I  +++KI L ++ L+++  N G++A ++++ AI RA  +A++++Q T +    + D  K+
Subjt:  TWLDSSKQFAVEETVDRAENLMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKR

Query:  VVGDCREGVEKISQKFRT
        +  +CR+GV KISQ+F+T
Subjt:  VVGDCREGVEKISQKFRT

AT5G52410.1 CONTAINS InterPro DOMAIN/s: S-layer homology domain (InterPro:IPR001119)3.7e-15963.14Show/hide
Query:  AALQVLKVIEVDVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFD
        AALQ LKVIE D  P DLCTRRE+ARW+V AS+ LSRN+ SKVYPAMYIENVTELAFDDITPEDPDF  IQGLAEAGLISSKLS +++ SS   +     
Subjt:  AALQVLKVIEVDVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFD

Query:  FSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSE
        FSPESPL+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D DKI+P+A PAL+ADLS GE GI AL+FG TRLFQP K VTKAQ A++LA G+A ++V E
Subjt:  FSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSE

Query:  ELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGL
        ELARIEAE+MAEN V AH+ LVAQVEKDINASFEKEL  E+E VDAVEK+AEEAK EL RLR E+E + +AL RER +IE+EME L+R+RNELEEQLQ L
Subjt:  ELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGL

Query:  MSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGD-TWLDSSKQ
         SNK E+SYEKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK LEEAR RWEK G+KV+VDSDL EQ +  + TWL++ KQ
Subjt:  MSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSDLREQESAGD-TWLDSSKQ

Query:  FAVEETVDRAENLMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREG
          VE T+ RA NL+ KLK MA ++  KS+++I  II+KI+LL+S L+Q +     +A+DLK    ++A     +      E+       AK  V + ++ 
Subjt:  FAVEETVDRAENLMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRVVGDCREG

Query:  VEKISQKFRT
        V K+ +KF++
Subjt:  VEKISQKFRT

AT5G52410.2 INVOLVED IN: biological_process unknown3.8e-16461.99Show/hide
Query:  GVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEVDVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFA
        G+PAP     V +L  K + P VVD VQ Q  AALQ LKVIE D  P DLCTRRE+ARW+V AS+ LSRN+ SKVYPAMYIENVTELAFDDITPEDPDF 
Subjt:  GVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEVDVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFA

Query:  SIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAF
         IQGLAEAGLISSKLS +++ SS   +     FSPESPL+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D DKI+P+A PAL+ADLS GE GI AL+F
Subjt:  SIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAF

Query:  GYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERD
        G TRLFQP K VTKAQ A++LA G+A ++V EELARIEAE+MAEN V AH+ LVAQVEKDINASFEKEL  E+E VDAVEK+AEEAK EL RLR E+E +
Subjt:  GYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERD

Query:  NIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARD
         +AL RER +IE+EME L+R+RNELEEQLQ L SNK E+SYEKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK LEEAR 
Subjt:  NIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARD

Query:  RWEKRGIKVVVDSDLREQESAGD-TWLDSSKQFAVEETVDRAENLMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARA
        RWEK G+KV+VDSDL EQ +  + TWL++ KQ  VE T+ RA NL+ KLK MA ++  KS+++I  II+KI+LL+S L+Q +     +A+DLK    ++A
Subjt:  RWEKRGIKVVVDSDLREQESAGD-TWLDSSKQFAVEETVDRAENLMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARA

Query:  SRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRT
             +      E+       AK  V + ++ V K+ +KF++
Subjt:  SRSASELQQSTAELGLALKDGAKRVVGDCREGVEKISQKFRT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGCTACCTTAACGCCTGCTATTGGAGTAGGATTTTTCATGGCTTCCACTTCTGCCACGTGTTCTCCGAGCTCTCTCCAGCTTCGTCTCGCTCTGAATTGCAAGAA
TTGCGGCAAATTCCCTTCGGTTCTTGTTCGGGCGAGAGTGAGGAAGCTGGATCCTCGGCTCCGGGTGATTTGTCACCCCATTGTTCATGAGAGAGCAAGTGGGTTGCGTC
GCAATGGAGTTTGTTTTGCTGGGTCGGATTCGAAGGTCGATGGATTCTCTGGGTGGTCGGAATCGGATTCCGGCGAGGAGGAGGAGGCTTTGGACTTGCGGAGAAAGAAG
TGGTTTGGAGGGTTGGTGGGGATTGGAGTTACTGGATTCATCCTTGTCTCGGGAATCACCTTTGCAGCATGGTCAATAAACAAGCAGAATAGTTCCAGACAAAAGCCACA
AATGGAGGCCTTAAGTACGCAGCAAGAATCATTGTTGGACTCTGATACTGGAAATGATAAGCTTGGTGAAGATGAAAAAGAAGATAAAAATGTGAATGCAGATGATGGAA
TTCTCGCTGGTAAAACAGGTAATCATGAGGAATCTTCTTCATATACAGAAAATGAAGAATCTCTCAACAAAAATAGAGTTGGTGATGGTGTTGATGTTGAGGAGTTATCA
GGGAATGATGTTGAATCTTCACCCAACAATAATGATGTCAATAATGTTGCTTCCTTGCAAGAAGATTCCCAATCTGATTCCTCGTTAGCTGTTACATCAGTTGCTTCCGG
AAGTTTGAGTTCTCTTATCTCACCTGAATCTGAGATTGATTCTAATGTAGCTGCTGATTCTAAAGATGTAAACAACTGTCATGCTGGTTCAGAAGTCTTAACTTCTGAAC
CTGAAATGAATATATTGAAAGATGAACCAGATAACTCACCTAACTCTAATACTAATTCACTAAACCTTAAAACTGATATTCCGGATGAAACACCAGACACTACTGAAAAT
TATGATTTCAGCTCTGAGAAATTATCAGTCTATGATGATGGTTCATCAAACTATAATTCTGGCAACCAGGATGAGACACCTGGTCCTCCTGTAAATGAATTTACAGATTC
TTCATTGCATGCATTTTCTAGCATCTATGGCGATACATCCAAGGAATCGGGATTAATTGACAAAGAGACTGTAGCTGAATCATCCAAAGGAGTTCAGAACCCCAGTAAAA
CTGAACAGCTCTTATCTGAGGAAACTGCATCAACCTTAGAACAGCAAATAGGAAGGGGATTATCTGAAGCAGCATTTGTCTCTGTCACAGCTCATCCATTGGCAGATGAT
CAAGAGAAAAATCATGAAACTATCATGAATAGTAGTGCTGCCAAACCAGAACTACAAGGCATTTTATTTTCTTCTGCAGGTGTTCCTGCTCCTTTGGTTTCTGCAGCTGT
TAAAACACTTCCTGGCAAGGTCCTAGTTCCTGCAGTTGTGGATCAGGTTCAGGGGCAGGCATTGGCAGCACTGCAAGTTTTAAAGGTGATAGAGGTTGACGTTGAACCTG
GTGATCTATGTACTCGTAGGGAATATGCTCGTTGGCTGGTGTATGCAAGCAGTGCTCTTTCGAGGAACACAACATCTAAAGTATATCCAGCAATGTATATAGAGAATGTT
ACCGAGCTTGCATTTGATGATATTACTCCCGAAGACCCTGATTTTGCATCTATTCAAGGTTTGGCAGAAGCTGGACTGATTTCAAGCAAGCTCTCGAGACATGATATTTT
TTCTTCATTGGACGAAGACCAGGGTCCTTTTGATTTCTCTCCCGAAAGTCCCCTATCACGTCAAGATCTTGTGAGTTGGAAGATGGCCCTTGAAAAAAGACAGCTGCCAG
AGGCAGATAGAAAGACGCTCCACCAAGTTTCTGGATTTATAGATACTGATAAGATCCATCCAGATGCTTGTCCTGCGCTCGTTGCCGATCTTTCTGTAGGAGAACAGGGA
ATAATAGCTCTTGCATTTGGATATACAAGGCTTTTCCAGCCAGAAAAGCCTGTAACGAAAGCACAAGCTGCCATTGCTCTTGCAACTGGGGAGGCTTCTGATATAGTAAG
TGAGGAGCTTGCAAGGATTGAAGCTGAATCAATGGCAGAAAATGCTGTTGCTGCACATAGTGCTTTGGTAGCTCAAGTTGAGAAAGATATTAATGCCAGCTTCGAGAAAG
AACTTTCCATTGAAAGAGAAAAGGTTGACGCTGTGGAGAAAATGGCAGAAGAGGCAAAGCAAGAGTTGGAAAGATTAAGATCCGAAAGAGAGAGAGATAACATCGCCTTG
ATGAGGGAACGTGCTGCGATTGAATCAGAAATGGAAGTTCTTTCGAGGTTAAGGAATGAGTTGGAGGAGCAGTTGCAAGGCCTGATGAGTAATAAAGTAGAAGTATCATA
TGAAAAGGAAAGAATCAACAAACTCAGGAAAGAAGCTGAAATTGAAAATCAGGAGATTTCCCGCCTGCAATATGAGCTTGAGGTTGAGAGGAAGGCGTTGTCCATGGCCA
GAGCATGGGCCGAGGACGAAGCAAAAAGAGCAAGAGAACAAGCAAAAGCACTCGAAGAGGCTAGAGATCGCTGGGAAAAGCGTGGCATCAAAGTAGTCGTCGACAGCGAT
CTCCGCGAACAGGAATCAGCTGGCGATACTTGGCTTGATTCTAGCAAACAGTTTGCAGTCGAAGAAACCGTCGACAGGGCCGAAAACTTAATGGACAAGCTGAAATCAAT
GGCTACAGAACTAAGAGGGAAATCCAAAGACATCATCGACAAGATCATCCAGAAGATAGCATTACTAGTATCAAACTTGAGACAGTGGCTTTCCAATGCTGGAGAACAGG
CTGAAGATCTGAAGAATGTGGCCATTGCAAGGGCAAGTAGATCAGCAAGTGAGCTGCAACAGAGCACTGCAGAGTTGGGCTTGGCCCTGAAGGATGGAGCAAAGCGAGTT
GTGGGAGATTGTAGGGAAGGAGTAGAGAAAATTTCCCAAAAGTTCAGAACATCGTACGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGCTACCTTAACGCCTGCTATTGGAGTAGGATTTTTCATGGCTTCCACTTCTGCCACGTGTTCTCCGAGCTCTCTCCAGCTTCGTCTCGCTCTGAATTGCAAGAA
TTGCGGCAAATTCCCTTCGGTTCTTGTTCGGGCGAGAGTGAGGAAGCTGGATCCTCGGCTCCGGGTGATTTGTCACCCCATTGTTCATGAGAGAGCAAGTGGGTTGCGTC
GCAATGGAGTTTGTTTTGCTGGGTCGGATTCGAAGGTCGATGGATTCTCTGGGTGGTCGGAATCGGATTCCGGCGAGGAGGAGGAGGCTTTGGACTTGCGGAGAAAGAAG
TGGTTTGGAGGGTTGGTGGGGATTGGAGTTACTGGATTCATCCTTGTCTCGGGAATCACCTTTGCAGCATGGTCAATAAACAAGCAGAATAGTTCCAGACAAAAGCCACA
AATGGAGGCCTTAAGTACGCAGCAAGAATCATTGTTGGACTCTGATACTGGAAATGATAAGCTTGGTGAAGATGAAAAAGAAGATAAAAATGTGAATGCAGATGATGGAA
TTCTCGCTGGTAAAACAGGTAATCATGAGGAATCTTCTTCATATACAGAAAATGAAGAATCTCTCAACAAAAATAGAGTTGGTGATGGTGTTGATGTTGAGGAGTTATCA
GGGAATGATGTTGAATCTTCACCCAACAATAATGATGTCAATAATGTTGCTTCCTTGCAAGAAGATTCCCAATCTGATTCCTCGTTAGCTGTTACATCAGTTGCTTCCGG
AAGTTTGAGTTCTCTTATCTCACCTGAATCTGAGATTGATTCTAATGTAGCTGCTGATTCTAAAGATGTAAACAACTGTCATGCTGGTTCAGAAGTCTTAACTTCTGAAC
CTGAAATGAATATATTGAAAGATGAACCAGATAACTCACCTAACTCTAATACTAATTCACTAAACCTTAAAACTGATATTCCGGATGAAACACCAGACACTACTGAAAAT
TATGATTTCAGCTCTGAGAAATTATCAGTCTATGATGATGGTTCATCAAACTATAATTCTGGCAACCAGGATGAGACACCTGGTCCTCCTGTAAATGAATTTACAGATTC
TTCATTGCATGCATTTTCTAGCATCTATGGCGATACATCCAAGGAATCGGGATTAATTGACAAAGAGACTGTAGCTGAATCATCCAAAGGAGTTCAGAACCCCAGTAAAA
CTGAACAGCTCTTATCTGAGGAAACTGCATCAACCTTAGAACAGCAAATAGGAAGGGGATTATCTGAAGCAGCATTTGTCTCTGTCACAGCTCATCCATTGGCAGATGAT
CAAGAGAAAAATCATGAAACTATCATGAATAGTAGTGCTGCCAAACCAGAACTACAAGGCATTTTATTTTCTTCTGCAGGTGTTCCTGCTCCTTTGGTTTCTGCAGCTGT
TAAAACACTTCCTGGCAAGGTCCTAGTTCCTGCAGTTGTGGATCAGGTTCAGGGGCAGGCATTGGCAGCACTGCAAGTTTTAAAGGTGATAGAGGTTGACGTTGAACCTG
GTGATCTATGTACTCGTAGGGAATATGCTCGTTGGCTGGTGTATGCAAGCAGTGCTCTTTCGAGGAACACAACATCTAAAGTATATCCAGCAATGTATATAGAGAATGTT
ACCGAGCTTGCATTTGATGATATTACTCCCGAAGACCCTGATTTTGCATCTATTCAAGGTTTGGCAGAAGCTGGACTGATTTCAAGCAAGCTCTCGAGACATGATATTTT
TTCTTCATTGGACGAAGACCAGGGTCCTTTTGATTTCTCTCCCGAAAGTCCCCTATCACGTCAAGATCTTGTGAGTTGGAAGATGGCCCTTGAAAAAAGACAGCTGCCAG
AGGCAGATAGAAAGACGCTCCACCAAGTTTCTGGATTTATAGATACTGATAAGATCCATCCAGATGCTTGTCCTGCGCTCGTTGCCGATCTTTCTGTAGGAGAACAGGGA
ATAATAGCTCTTGCATTTGGATATACAAGGCTTTTCCAGCCAGAAAAGCCTGTAACGAAAGCACAAGCTGCCATTGCTCTTGCAACTGGGGAGGCTTCTGATATAGTAAG
TGAGGAGCTTGCAAGGATTGAAGCTGAATCAATGGCAGAAAATGCTGTTGCTGCACATAGTGCTTTGGTAGCTCAAGTTGAGAAAGATATTAATGCCAGCTTCGAGAAAG
AACTTTCCATTGAAAGAGAAAAGGTTGACGCTGTGGAGAAAATGGCAGAAGAGGCAAAGCAAGAGTTGGAAAGATTAAGATCCGAAAGAGAGAGAGATAACATCGCCTTG
ATGAGGGAACGTGCTGCGATTGAATCAGAAATGGAAGTTCTTTCGAGGTTAAGGAATGAGTTGGAGGAGCAGTTGCAAGGCCTGATGAGTAATAAAGTAGAAGTATCATA
TGAAAAGGAAAGAATCAACAAACTCAGGAAAGAAGCTGAAATTGAAAATCAGGAGATTTCCCGCCTGCAATATGAGCTTGAGGTTGAGAGGAAGGCGTTGTCCATGGCCA
GAGCATGGGCCGAGGACGAAGCAAAAAGAGCAAGAGAACAAGCAAAAGCACTCGAAGAGGCTAGAGATCGCTGGGAAAAGCGTGGCATCAAAGTAGTCGTCGACAGCGAT
CTCCGCGAACAGGAATCAGCTGGCGATACTTGGCTTGATTCTAGCAAACAGTTTGCAGTCGAAGAAACCGTCGACAGGGCCGAAAACTTAATGGACAAGCTGAAATCAAT
GGCTACAGAACTAAGAGGGAAATCCAAAGACATCATCGACAAGATCATCCAGAAGATAGCATTACTAGTATCAAACTTGAGACAGTGGCTTTCCAATGCTGGAGAACAGG
CTGAAGATCTGAAGAATGTGGCCATTGCAAGGGCAAGTAGATCAGCAAGTGAGCTGCAACAGAGCACTGCAGAGTTGGGCTTGGCCCTGAAGGATGGAGCAAAGCGAGTT
GTGGGAGATTGTAGGGAAGGAGTAGAGAAAATTTCCCAAAAGTTCAGAACATCGTACGGTTGA
Protein sequenceShow/hide protein sequence
MVATLTPAIGVGFFMASTSATCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHERASGLRRNGVCFAGSDSKVDGFSGWSESDSGEEEEALDLRRKK
WFGGLVGIGVTGFILVSGITFAAWSINKQNSSRQKPQMEALSTQQESLLDSDTGNDKLGEDEKEDKNVNADDGILAGKTGNHEESSSYTENEESLNKNRVGDGVDVEELS
GNDVESSPNNNDVNNVASLQEDSQSDSSLAVTSVASGSLSSLISPESEIDSNVAADSKDVNNCHAGSEVLTSEPEMNILKDEPDNSPNSNTNSLNLKTDIPDETPDTTEN
YDFSSEKLSVYDDGSSNYNSGNQDETPGPPVNEFTDSSLHAFSSIYGDTSKESGLIDKETVAESSKGVQNPSKTEQLLSEETASTLEQQIGRGLSEAAFVSVTAHPLADD
QEKNHETIMNSSAAKPELQGILFSSAGVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIEVDVEPGDLCTRREYARWLVYASSALSRNTTSKVYPAMYIENV
TELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDIFSSLDEDQGPFDFSPESPLSRQDLVSWKMALEKRQLPEADRKTLHQVSGFIDTDKIHPDACPALVADLSVGEQG
IIALAFGYTRLFQPEKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVDAVEKMAEEAKQELERLRSERERDNIAL
MRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKRGIKVVVDSD
LREQESAGDTWLDSSKQFAVEETVDRAENLMDKLKSMATELRGKSKDIIDKIIQKIALLVSNLRQWLSNAGEQAEDLKNVAIARASRSASELQQSTAELGLALKDGAKRV
VGDCREGVEKISQKFRTSYG