; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G009940 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G009940
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein KOKOPELLI isoform X1
Genome locationCmo_Chr02:6098498..6101890
RNA-Seq ExpressionCmoCh02G009940
SyntenyCmoCh02G009940
Gene Ontology termsGO:0012511 - monolayer-surrounded lipid storage body (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR000136 - Oleosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605646.1 Protein KOKOPELLI, partial [Cucurbita argyrosperma subsp. sororia]1.4e-24695.75Show/hide
Query:  MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKI
        MEAD+LYLDLLALRQLY FLLKCCLRDANSELVVG RAKIL KHLLDDATTGLLEFHSKT P YNF RKDDKQTKPLDEKVAEWMEHNQTAR M NPEKI
Subjt:  MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKI

Query:  EHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEAMKQHE
        EHKP RD ASASNVAANDLSSGISSALRRIELHILSLQRYTRSH+SETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEAMKQHE
Subjt:  EHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEAMKQHE

Query:  LALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTADSSSPGDQSS
        LALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTADSSSPGDQSS
Subjt:  LALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTADSSSPGDQSS

Query:  PPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVS
        PPATGSEASSQ GNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVS
Subjt:  PPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVS

Query:  RNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYLYK
        RNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNC     GKKLHWWKMIRRRRGVKLPNKGRVKIGY+ K
Subjt:  RNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYLYK

KAG7035555.1 Protein KOKOPELLI, partial [Cucurbita argyrosperma subsp. argyrosperma]4.6e-23494.29Show/hide
Query:  NLSDKMEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMA
        NLSDKMEAD+LYLDLLALRQLY FLLKCCLRDANSELVVG RAKIL KHLLDDATTGLLEFHSKT P YNF RKDDKQT PLDEKVAEWMEHNQTAR M 
Subjt:  NLSDKMEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMA

Query:  NPEKIEHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEA
        NPEKIEHKP RD ASASNVAANDLSSGISSALRRIELHILSLQRYTRS++SETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEA
Subjt:  NPEKIEHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEA

Query:  MKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTADSSSP
        MKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHN+THLAAQQESEYTNSESESAPSSSPATRQTSESETTADSSSP
Subjt:  MKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTADSSSP

Query:  GDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTT
        GDQSSPPATGSEASSQ GNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKE TSKEEKNG+LRKTT
Subjt:  GDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTT

Query:  IRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMI
        IRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNC     GKKLHWW  +
Subjt:  IRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMI

XP_022958321.1 uncharacterized protein LOC111459571 isoform X1 [Cucurbita moschata]3.2e-26799.59Show/hide
Query:  YSEALAYNLSDKMEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHN
        YSEALAYNLSDKMEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHN
Subjt:  YSEALAYNLSDKMEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHN

Query:  QTARTMANPEKIEHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPL
        QTARTMANPEKIEHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPL
Subjt:  QTARTMANPEKIEHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPL

Query:  TQDKNEAMKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESET
        TQDKNEAMKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESET
Subjt:  TQDKNEAMKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESET

Query:  TADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKN
        TADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKN
Subjt:  TADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKN

Query:  GMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYLYK
        GMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGY+ K
Subjt:  GMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYLYK

XP_022958322.1 uncharacterized protein LOC111459571 isoform X2 [Cucurbita moschata]2.9e-26099.58Show/hide
Query:  MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKI
        MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKI
Subjt:  MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKI

Query:  EHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEAMKQHE
        EHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEAMKQHE
Subjt:  EHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEAMKQHE

Query:  LALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTADSSSPGDQSS
        LALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTADSSSPGDQSS
Subjt:  LALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTADSSSPGDQSS

Query:  PPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVS
        PPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVS
Subjt:  PPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVS

Query:  RNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYLYK
        RNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGY+ K
Subjt:  RNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYLYK

XP_022996025.1 uncharacterized protein LOC111491355 isoform X1 [Cucurbita maxima]6.3e-23186.85Show/hide
Query:  LYYSEALAYNLSDKMEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWME
        ++YSEALAYNLSDKMEADELYLDLLALRQLY FLLKCCLRDANSELVVGARAKIL KHLLDDATTGLLEFHSKTL FYNFLRKDDKQTKPLDEKVAEWME
Subjt:  LYYSEALAYNLSDKMEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWME

Query:  HNQTARTMANPEKIEHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRI
        HNQTAR MANPEKIEHKP RDRASASNVAANDLSSGI+SALRRIELHILSLQRYTRSHISETKLAYYGQSV+QGNES N QKVKPMVANHCSKFV+GFRI
Subjt:  HNQTARTMANPEKIEHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRI

Query:  PLTQDKNEAMKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLW-------------HNKTHLAAQQESEYTNSESESA
        PLTQDK+EAMKQHEL LPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVM+PTLW             HN+ HLAAQQESE+TNSES S 
Subjt:  PLTQDKNEAMKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLW-------------HNKTHLAAQQESEYTNSESESA

Query:  PSSSPATRQTSESETTADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFH------HHHHHYHNGHNSMWK
          SSPAT QTSESETT DSSSP +QSSP ATGSEASSQ GNSSSNI+R+AFKFSHGKKES  AVGRFKSLRNKLGLIFH      HHHHH+H+GHNSMWK
Subjt:  PSSSPATRQTSESETTADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFH------HHHHHYHNGHNSMWK

Query:  QVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIG
        QVR +FHRT KKELTSKEEK G LRKTTIRSVSRNNQVGKFQAL EGLRSHVWKSKAMKKKEQRGLNC     GKKLHWWKMIRRRRGVK PNKGRVKIG
Subjt:  QVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIG

Query:  YL
        Y+
Subjt:  YL

TrEMBL top hitse value%identityAlignment
A0A6J1DLN1 protein KOKOPELLI isoform X11.8e-12255.95Show/hide
Query:  MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKI
        ME +ELYLDLLALR+LY+ LLK CLRDANSEL +  RA+IL KHLLDDAT  +++FHSK              TKP++EKVAEWME+NQ+ R        
Subjt:  MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKI

Query:  EHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRY------TRSHISETKLAYYGQSVHQGNESLNHQKVKPMVA----NHCSKFVHGFRIPLTQ
                    NVAANDLS+GI  ALRRIE HILSLQ Y      TRSHI+  KL+       +    ++H  +K  VA     HCS+FVHGFR+PL+Q
Subjt:  EHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRY------TRSHISETKLAYYGQSVHQGNESLNHQKVKPMVA----NHCSKFVHGFRIPLTQ

Query:  DKNEAM----------KQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLW-HNKTHLAAQQESEYTNSESESAPSSSPA
        D  EAM          KQ+++  P  L+DKS C  GSKAT R    +NRT I E+R +N  G ++MRPTL  H KT +  QQESE+TNSESES  SSS A
Subjt:  DKNEAM----------KQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLW-HNKTHLAAQQESEYTNSESESAPSSSPA

Query:  TRQTSESETTADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIF----HHHHHHYHNGHNS--MWKQVRRMF
        T+QTSE+ETT   SS   Q   PATGSE SS+    SS IS +AF+ SHGKK SKKA+GRFK LRNKLGLIF    HHHHHH+HN HN+  MWKQ+R++F
Subjt:  TRQTSESETTADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIF----HHHHHHYHNGHNS--MWKQVRRMF

Query:  HRTGKKELTSKEEKNGMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYLYKAR
        H T KK +TSK  ++  L+KT IRSVSR NQVG+FQALAEGLRSHVWK  AMKKKE R    GK  G KKLHWW+M  RRRGVKLPNKGRVKIGY+ +  
Subjt:  HRTGKKELTSKEEKNGMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYLYKAR

Query:  QTSNELPRSHKVV
        Q        HK+V
Subjt:  QTSNELPRSHKVV

A0A6J1H1S0 uncharacterized protein LOC111459571 isoform X11.5e-26799.59Show/hide
Query:  YSEALAYNLSDKMEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHN
        YSEALAYNLSDKMEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHN
Subjt:  YSEALAYNLSDKMEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHN

Query:  QTARTMANPEKIEHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPL
        QTARTMANPEKIEHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPL
Subjt:  QTARTMANPEKIEHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPL

Query:  TQDKNEAMKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESET
        TQDKNEAMKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESET
Subjt:  TQDKNEAMKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESET

Query:  TADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKN
        TADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKN
Subjt:  TADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKN

Query:  GMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYLYK
        GMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGY+ K
Subjt:  GMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYLYK

A0A6J1H2T7 uncharacterized protein LOC111459571 isoform X21.4e-26099.58Show/hide
Query:  MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKI
        MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKI
Subjt:  MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKI

Query:  EHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEAMKQHE
        EHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEAMKQHE
Subjt:  EHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEAMKQHE

Query:  LALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTADSSSPGDQSS
        LALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTADSSSPGDQSS
Subjt:  LALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTADSSSPGDQSS

Query:  PPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVS
        PPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVS
Subjt:  PPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVS

Query:  RNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYLYK
        RNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGY+ K
Subjt:  RNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYLYK

A0A6J1K0S1 uncharacterized protein LOC111491355 isoform X26.1e-22486.89Show/hide
Query:  MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKI
        MEADELYLDLLALRQLY FLLKCCLRDANSELVVGARAKIL KHLLDDATTGLLEFHSKTL FYNFLRKDDKQTKPLDEKVAEWMEHNQTAR MANPEKI
Subjt:  MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKI

Query:  EHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEAMKQHE
        EHKP RDRASASNVAANDLSSGI+SALRRIELHILSLQRYTRSHISETKLAYYGQSV+QGNES N QKVKPMVANHCSKFV+GFRIPLTQDK+EAMKQHE
Subjt:  EHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRIPLTQDKNEAMKQHE

Query:  LALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLW-------------HNKTHLAAQQESEYTNSESESAPSSSPATRQTSESE
        L LPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVM+PTLW             HN+ HLAAQQESE+TNSES S   SSPAT QTSESE
Subjt:  LALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLW-------------HNKTHLAAQQESEYTNSESESAPSSSPATRQTSESE

Query:  TTADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFH------HHHHHYHNGHNSMWKQVRRMFHRTGKKEL
        TT DSSSP +QSSP ATGSEASSQ GNSSSNI+R+AFKFSHGKKES  AVGRFKSLRNKLGLIFH      HHHHH+H+GHNSMWKQVR +FHRT KKEL
Subjt:  TTADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFH------HHHHHYHNGHNSMWKQVRRMFHRTGKKEL

Query:  TSKEEKNGMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYL
        TSKEEK G LRKTTIRSVSRNNQVGKFQAL EGLRSHVWKSKAMKKKEQRGLNC     GKKLHWWKMIRRRRGVK PNKGRVKIGY+
Subjt:  TSKEEKNGMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYL

A0A6J1K5J4 uncharacterized protein LOC111491355 isoform X13.0e-23186.85Show/hide
Query:  LYYSEALAYNLSDKMEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWME
        ++YSEALAYNLSDKMEADELYLDLLALRQLY FLLKCCLRDANSELVVGARAKIL KHLLDDATTGLLEFHSKTL FYNFLRKDDKQTKPLDEKVAEWME
Subjt:  LYYSEALAYNLSDKMEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKTLPFYNFLRKDDKQTKPLDEKVAEWME

Query:  HNQTARTMANPEKIEHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRI
        HNQTAR MANPEKIEHKP RDRASASNVAANDLSSGI+SALRRIELHILSLQRYTRSHISETKLAYYGQSV+QGNES N QKVKPMVANHCSKFV+GFRI
Subjt:  HNQTARTMANPEKIEHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKVKPMVANHCSKFVHGFRI

Query:  PLTQDKNEAMKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLW-------------HNKTHLAAQQESEYTNSESESA
        PLTQDK+EAMKQHEL LPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVM+PTLW             HN+ HLAAQQESE+TNSES S 
Subjt:  PLTQDKNEAMKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLW-------------HNKTHLAAQQESEYTNSESESA

Query:  PSSSPATRQTSESETTADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFH------HHHHHYHNGHNSMWK
          SSPAT QTSESETT DSSSP +QSSP ATGSEASSQ GNSSSNI+R+AFKFSHGKKES  AVGRFKSLRNKLGLIFH      HHHHH+H+GHNSMWK
Subjt:  PSSSPATRQTSESETTADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFH------HHHHHYHNGHNSMWK

Query:  QVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIG
        QVR +FHRT KKELTSKEEK G LRKTTIRSVSRNNQVGKFQAL EGLRSHVWKSKAMKKKEQRGLNC     GKKLHWWKMIRRRRGVK PNKGRVKIG
Subjt:  QVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIG

Query:  YL
        Y+
Subjt:  YL

SwissProt top hitse value%identityAlignment
P29111 Major oleosin NAP-II (Fragment)3.8e-2157.69Show/hide
Query:  RSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGRRQDANNDLPGKKSYK
        +S ++ KA+TA T GGSLL LS LT+ GTVIAL +ATPL VIFSP+LVPA ITV+LLI GFL SGGFGIAA  V  W ++Y  G     ++ L   +  K
Subjt:  RSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGRRQDANNDLPGKKSYK

Query:  FGGK
         GGK
Subjt:  FGGK

P29525 Oleosin 18.5 kDa1.1e-2050.79Show/hide
Query:  PNKGRVKIGYLYKARQTSNELPRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWF
        P  GR +  Y    R   ++  +S ++ KA TA T GGSLL LS LT+ GTVIAL +ATPL VIFSP+LVPA ITV+LLI GFL SGGFGIAA  V  W 
Subjt:  PNKGRVKIGYLYKARQTSNELPRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWF

Query:  FRYVAGRRQDANNDLPGKKSYKFGGK
        ++Y  G     ++ L   +  K G K
Subjt:  FRYVAGRRQDANNDLPGKKSYKFGGK

Q43804 Oleosin 19.1e-2358.18Show/hide
Query:  PRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGRRQDANNDLPGKKSY
        PRS++V KA TA T GGSLL LSGL + GTVIAL IATPL VIFSPVLVPA ITV+L+ MGFL SGGFG+AA  V  W ++YV G +Q    D   +  +
Subjt:  PRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGRRQDANNDLPGKKSY

Query:  KFGGKGIQVE
        K  GK   ++
Subjt:  KFGGKGIQVE

Q45W87 Oleosin Ara h 11.01011.0e-2152.89Show/hide
Query:  LYKARQTSNELPRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGRRQD
        LY   +   E PRS ++VKA TA   GGSLL L+GL + GTVI L   TPLFVIFSPVLVPA ITV+LL +GFL SGGFG+AA  V  W +RYV G+   
Subjt:  LYKARQTSNELPRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGRRQD

Query:  ANNDLPGKKSYKFGGKGIQVE
          N L   + +K  GK  +++
Subjt:  ANNDLPGKKSYKFGGKGIQVE

Q9XHP2 Oleosin L1.0e-2161.62Show/hide
Query:  PRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGRRQDANNDLPGKKS
        PR+ +VVKA TA T GGSLL LSGLT+ GTVIAL IATPL VIFSPVLVPA IT+ LL  GFL SGGFG+AA  V  W +RY+ G+     + L   K+
Subjt:  PRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGRRQDANNDLPGKKS

Arabidopsis top hitse value%identityAlignment
AT2G25890.1 Oleosin family protein2.5e-2058.06Show/hide
Query:  RQTSNELPRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGR
        R      P + ++V+ +TAAT G SLL LSGLT+TGTVI L +ATPL V+FSPVLVPA IT+ LL MGFL SGG G+AAA    W ++YV G+
Subjt:  RQTSNELPRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGR

AT4G25140.1 oleosin 17.9e-2250.79Show/hide
Query:  PNKGRVKIGYLYKARQTSNELPRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWF
        P  GR +  Y    R   ++  +S ++ KA TA T GGSLL LS LT+ GTVIAL +ATPL VIFSP+LVPA ITV+LLI GFL SGGFGIAA  V  W 
Subjt:  PNKGRVKIGYLYKARQTSNELPRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWF

Query:  FRYVAGRRQDANNDLPGKKSYKFGGK
        ++Y  G     ++ L   +  K G K
Subjt:  FRYVAGRRQDANNDLPGKKSYKFGGK

AT5G40420.1 oleosin 25.6e-1243.62Show/hide
Query:  PRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGRRQDANNDL
        P S +V+  +      GSLLAL+GL + G+VI L +A PLF++FSPV+VPAA+T+ L + GFL SG FG+       W   Y+ G R+     L
Subjt:  PRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGRRQDANNDL

AT5G51210.1 oleosin37.9e-2260.64Show/hide
Query:  TSNELPRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGRRQD
        T    P++ ++VKA TA T GGSLL LSGLT+ GTVIAL +ATPL VIFSPVLVPA +TV+L+I GFL SGGFGIAA     W +R++ G   D
Subjt:  TSNELPRSHKVVKAITAATTGGSLLALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGRRQD

AT5G63720.1 kokopelli9.9e-1734.26Show/hide
Query:  VMRPTLWHNKT--------HLAAQQESEYTNSESESAP-------SSSPATRQTSESETTA--DSSSPGDQSSPPATG---SEASSQCGNSSSNISREAF
        +M+PTL   +T           A Q    T SESE          S    +   SE ET A  D+ S  + S PP      SE S+   ++  + SRE  
Subjt:  VMRPTLWHNKT--------HLAAQQESEYTNSESESAP-------SSSPATRQTSESETTA--DSSSPGDQSSPPATG---SEASSQCGNSSSNISREAF

Query:  KFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHN------SMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVSRNNQVGKFQALAEGLRSH
              K+ +  +GRFK ++NK+G IFHHHHHH+H+ H+      S W +++  FH   K +  SKE K  M     + +  + +Q G F AL EGL  H
Subjt:  KFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHN------SMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVSRNNQVGKFQALAEGLRSH

Query:  VWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRR--GVKLPNKGRVKIG
           SK  K+K Q        +  KK  WWK++++R+  GVK+P +GRVK+G
Subjt:  VWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRR--GVKLPNKGRVKIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGGAAATCTGCGAAGGAAAGATCAGGCAGCCTACTTCTTCCCCATTGAGATCAGTTTATATGCACTTCATTACAATAATTGGCTTCCCAGAATGACAACAGAACT
GCTTTATTATTCTGAAGCTTTGGCCTATAATTTATCAGACAAGATGGAAGCTGATGAACTATATCTTGATCTCCTAGCACTGAGGCAACTGTACGTCTTTCTCTTAAAAT
GCTGTTTGCGGGATGCAAATTCAGAACTTGTGGTGGGTGCAAGGGCAAAGATTTTATTCAAGCATTTGCTCGATGATGCCACTACTGGACTTCTCGAGTTTCACTCGAAG
ACTCTGCCATTTTACAACTTTTTACGCAAAGATGATAAACAGACAAAGCCACTGGACGAGAAAGTTGCTGAATGGATGGAACATAATCAAACTGCAAGAACGATGGCAAA
TCCAGAGAAGATTGAACACAAACCAGGAAGGGACAGAGCTTCAGCGTCAAATGTTGCCGCTAATGACTTGTCAAGTGGGATCAGTTCAGCACTCAGAAGAATTGAACTCC
ACATTTTATCCCTGCAACGTTATACAAGAAGCCATATCAGTGAAACTAAATTAGCTTACTATGGGCAGTCTGTTCATCAGGGGAATGAGTCATTAAACCACCAAAAAGTT
AAGCCGATGGTGGCAAACCATTGTTCCAAGTTCGTGCATGGATTCAGAATACCTCTGACTCAAGACAAGAACGAGGCCATGAAACAGCACGAACTTGCGCTTCCACCGAC
TCTGATGGATAAATCAGGATGTCCAGAAGGATCCAAGGCAACTGCCAGGCGCGCTATGAAACTGAATCGAACTTGGATACAAGAAAAGAGGAGCAAGAATTCACGTGGTC
GTATTGTAATGAGACCAACTTTGTGGCATAACAAGACCCATCTGGCTGCCCAGCAAGAATCAGAATACACAAACTCAGAATCAGAATCAGCACCTTCTTCAAGTCCGGCA
ACTCGACAAACCAGTGAAAGTGAAACCACTGCTGATTCTTCTTCTCCCGGTGACCAATCCAGTCCACCGGCAACTGGTTCAGAGGCAAGTAGCCAGTGCGGAAACAGCAG
TAGCAACATTTCAAGAGAAGCATTCAAGTTCAGCCATGGGAAGAAAGAGTCCAAGAAAGCAGTAGGACGGTTCAAGAGTTTAAGAAACAAATTGGGCCTTATCTTCCACC
ACCACCATCACCATTACCATAACGGTCACAACTCCATGTGGAAGCAAGTAAGAAGGATGTTCCACCGTACAGGTAAGAAAGAACTAACAAGTAAAGAAGAAAAAAATGGG
ATGCTAAGGAAAACAACAATCAGAAGTGTGTCTCGGAATAACCAAGTTGGGAAGTTTCAAGCACTTGCTGAAGGGCTTCGAAGCCATGTATGGAAATCGAAAGCCATGAA
GAAGAAAGAGCAAAGAGGGCTTAACTGTGGGAAGACGAATGGTGGGAAGAAGCTGCATTGGTGGAAAATGATTCGACGCCGCCGTGGAGTGAAGTTGCCCAATAAAGGAC
GTGTGAAAATAGGGTATCTCTACAAAGCCCGACAGACCTCGAATGAATTACCCCGCTCCCACAAAGTCGTCAAGGCCATTACCGCAGCCACAACTGGCGGCTCCCTCCTC
GCCCTCTCCGGCCTCACAATGACTGGTACAGTCATTGCTCTAGCCATCGCCACACCGCTGTTTGTCATATTCAGTCCAGTTCTCGTCCCGGCGGCGATCACTGTCTCACT
TTTGATCATGGGGTTCTTGATATCTGGTGGATTCGGCATCGCTGCGGCGGTTGTTTCGTGGTGGTTTTTCAGGTACGTGGCTGGGAGAAGGCAAGATGCTAACAATGATT
TGCCCGGAAAAAAATCGTATAAGTTTGGCGGTAAGGGGATACAAGTTGAAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGGAAATCTGCGAAGGAAAGATCAGGCAGCCTACTTCTTCCCCATTGAGATCAGTTTATATGCACTTCATTACAATAATTGGCTTCCCAGAATGACAACAGAACT
GCTTTATTATTCTGAAGCTTTGGCCTATAATTTATCAGACAAGATGGAAGCTGATGAACTATATCTTGATCTCCTAGCACTGAGGCAACTGTACGTCTTTCTCTTAAAAT
GCTGTTTGCGGGATGCAAATTCAGAACTTGTGGTGGGTGCAAGGGCAAAGATTTTATTCAAGCATTTGCTCGATGATGCCACTACTGGACTTCTCGAGTTTCACTCGAAG
ACTCTGCCATTTTACAACTTTTTACGCAAAGATGATAAACAGACAAAGCCACTGGACGAGAAAGTTGCTGAATGGATGGAACATAATCAAACTGCAAGAACGATGGCAAA
TCCAGAGAAGATTGAACACAAACCAGGAAGGGACAGAGCTTCAGCGTCAAATGTTGCCGCTAATGACTTGTCAAGTGGGATCAGTTCAGCACTCAGAAGAATTGAACTCC
ACATTTTATCCCTGCAACGTTATACAAGAAGCCATATCAGTGAAACTAAATTAGCTTACTATGGGCAGTCTGTTCATCAGGGGAATGAGTCATTAAACCACCAAAAAGTT
AAGCCGATGGTGGCAAACCATTGTTCCAAGTTCGTGCATGGATTCAGAATACCTCTGACTCAAGACAAGAACGAGGCCATGAAACAGCACGAACTTGCGCTTCCACCGAC
TCTGATGGATAAATCAGGATGTCCAGAAGGATCCAAGGCAACTGCCAGGCGCGCTATGAAACTGAATCGAACTTGGATACAAGAAAAGAGGAGCAAGAATTCACGTGGTC
GTATTGTAATGAGACCAACTTTGTGGCATAACAAGACCCATCTGGCTGCCCAGCAAGAATCAGAATACACAAACTCAGAATCAGAATCAGCACCTTCTTCAAGTCCGGCA
ACTCGACAAACCAGTGAAAGTGAAACCACTGCTGATTCTTCTTCTCCCGGTGACCAATCCAGTCCACCGGCAACTGGTTCAGAGGCAAGTAGCCAGTGCGGAAACAGCAG
TAGCAACATTTCAAGAGAAGCATTCAAGTTCAGCCATGGGAAGAAAGAGTCCAAGAAAGCAGTAGGACGGTTCAAGAGTTTAAGAAACAAATTGGGCCTTATCTTCCACC
ACCACCATCACCATTACCATAACGGTCACAACTCCATGTGGAAGCAAGTAAGAAGGATGTTCCACCGTACAGGTAAGAAAGAACTAACAAGTAAAGAAGAAAAAAATGGG
ATGCTAAGGAAAACAACAATCAGAAGTGTGTCTCGGAATAACCAAGTTGGGAAGTTTCAAGCACTTGCTGAAGGGCTTCGAAGCCATGTATGGAAATCGAAAGCCATGAA
GAAGAAAGAGCAAAGAGGGCTTAACTGTGGGAAGACGAATGGTGGGAAGAAGCTGCATTGGTGGAAAATGATTCGACGCCGCCGTGGAGTGAAGTTGCCCAATAAAGGAC
GTGTGAAAATAGGGTATCTCTACAAAGCCCGACAGACCTCGAATGAATTACCCCGCTCCCACAAAGTCGTCAAGGCCATTACCGCAGCCACAACTGGCGGCTCCCTCCTC
GCCCTCTCCGGCCTCACAATGACTGGTACAGTCATTGCTCTAGCCATCGCCACACCGCTGTTTGTCATATTCAGTCCAGTTCTCGTCCCGGCGGCGATCACTGTCTCACT
TTTGATCATGGGGTTCTTGATATCTGGTGGATTCGGCATCGCTGCGGCGGTTGTTTCGTGGTGGTTTTTCAGGTACGTGGCTGGGAGAAGGCAAGATGCTAACAATGATT
TGCCCGGAAAAAAATCGTATAAGTTTGGCGGTAAGGGGATACAAGTTGAAGGTTGA
Protein sequenceShow/hide protein sequence
MTGNLRRKDQAAYFFPIEISLYALHYNNWLPRMTTELLYYSEALAYNLSDKMEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSK
TLPFYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKIEHKPGRDRASASNVAANDLSSGISSALRRIELHILSLQRYTRSHISETKLAYYGQSVHQGNESLNHQKV
KPMVANHCSKFVHGFRIPLTQDKNEAMKQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGRIVMRPTLWHNKTHLAAQQESEYTNSESESAPSSSPA
TRQTSESETTADSSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIFHHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNG
MLRKTTIRSVSRNNQVGKFQALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYLYKARQTSNELPRSHKVVKAITAATTGGSLL
ALSGLTMTGTVIALAIATPLFVIFSPVLVPAAITVSLLIMGFLISGGFGIAAAVVSWWFFRYVAGRRQDANNDLPGKKSYKFGGKGIQVEG