; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0019931 (gene) of Chayote v1 genome

Gene IDSed0019931
OrganismSechium edule (Chayote v1)
DescriptionUnknown protein
Genome locationLG06:35553724..35557138
RNA-Seq ExpressionSed0019931
SyntenySed0019931
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606925.1 hypothetical protein SDJN03_00267, partial [Cucurbita argyrosperma subsp. sororia]3.0e-8877.5Show/hide
Query:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG
        MSGLDCMAT+D+EFEIDLEGG  TSEDD+SSE+DS SKPH RKA SRLRSGFLC DGSINRGCSFAS SN TK  KLGVDENVELL+DKS DGEKRRELG
Subjt:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG

Query:  TLAE--KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMM
         LAE  KN+KE  K NGK+HKPPRPPRGPSLDAADRIFVKE+T+LAVKKRA+VERIKALKKMKAEKTSSF+S+LPALFITLLFF VIIFQGMSA G+ +M
Subjt:  TLAE--KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMM

Query:  LESPVPSVGGSAG--SIQPSSIPSLKVDELQSRALKFAGK
         ESPVPS   SAG  S+Q SS     V+  QS  L FAG+
Subjt:  LESPVPSVGGSAG--SIQPSSIPSLKVDELQSRALKFAGK

KAG7036629.1 hypothetical protein SDJN02_00248 [Cucurbita argyrosperma subsp. argyrosperma]2.5e-8777.08Show/hide
Query:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG
        MSGLDCMAT+D+EFEIDLEGG  TSEDD+SSE+DS SKPH RKA SRLRSGFLC DGSIN+GCSFAS SN TK  KLGVDENVELL+DKS DGEKRRELG
Subjt:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG

Query:  TLAE--KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMM
         LAE  KN+KE  K NGK+HKPPRPPRGPSLDAADRIFVKE+T+LAVKKRA+VERIKALKKMKAEKTSSF+S+LPALFITLLFF VIIFQGMSA G+ +M
Subjt:  TLAE--KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMM

Query:  LESPVPSVGGSAGSI--QPSSIPSLKVDELQSRALKFAGK
         ESPVPS   SAG I  Q SS     V+  QS  L FAG+
Subjt:  LESPVPSVGGSAGSI--QPSSIPSLKVDELQSRALKFAGK

XP_004147994.1 uncharacterized protein LOC101214824 [Cucumis sativus]1.9e-8775.9Show/hide
Query:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG
        M  +DCMATKD+EFEIDLEGGGNTSEDD+SSETDS SKPHARK F RLRSGFL +D S++R   FAS SN TK VKLGVDENVELL++ SSDGEKRRE G
Subjt:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG

Query:  TLAEK-NVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMML
          AEK NVK K KKNGK+HKPPRPPRGPSLDAADRIFV+EI ELAVKKRA+VERIKALKKMKAEKTSSF+SSLPALFITLLFF VIIFQGMSAKG+TM+ 
Subjt:  TLAEK-NVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMML

Query:  --ESPVPSVGGSAGSIQPSSI---PSLKVDELQSRALKFAGKQRPDPAT
          +SP PSVGGSAG I   S+    S  V+E +S  L FAGKQ  DPAT
Subjt:  --ESPVPSVGGSAGSIQPSSI---PSLKVDELQSRALKFAGKQRPDPAT

XP_022948547.1 uncharacterized protein LOC111452194 [Cucurbita moschata]4.7e-8676.67Show/hide
Query:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG
        MSGLDCMAT+D+EFEIDLEGG  TSEDD+SSE+DS SKPH RKA SRLRSGFLC DGSINRGCSFAS SN TK  KLGVDENVELL+DKS DGEKRRELG
Subjt:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG

Query:  TLAE--KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMM
         LAE  KN+KE  K NGK+HKPPRPPR PSLDAADRIFVKE+T+LAVKKRA+VERIKALKK KAEKTSSF+S+LPALFITLLFF VIIFQGMSA G+ +M
Subjt:  TLAE--KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMM

Query:  LESPVPSVGGSAGSI--QPSSIPSLKVDELQSRALKFAGK
         ESPVPS   SAG I  Q SS     V+  QS  L FAG+
Subjt:  LESPVPSVGGSAGSI--QPSSIPSLKVDELQSRALKFAGK

XP_038903260.1 uncharacterized protein LOC120089897 [Benincasa hispida]2.8e-8677.51Show/hide
Query:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG
        M GLDCMATKD+EFEIDLEGGGNTSEDD+SSETDS SK HARK F+RLRSGFL +DGSI+R  SFAS SN TK VKLGVDENVEL +D SSDGEKRRE G
Subjt:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG

Query:  TLAE-KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMML
          AE KNVK K    GK+HKPPRPPRGPSLDAADRIFVKE+TELAVKKRA+VERIKALKKMKAEK SSF+SSLPALFITLLFF VIIFQGMSAKG+TM+L
Subjt:  TLAE-KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMML

Query:  --ESPVPSVGGSAGSI-QPSSIPSLKVDELQSRALKFAGKQRPDPATIR
          +SP PSVGGSAG I Q S   S  V+E QS  L FAGKQ  DP  IR
Subjt:  --ESPVPSVGGSAGSI-QPSSIPSLKVDELQSRALKFAGKQRPDPATIR

TrEMBL top hitse value%identityAlignment
A0A0A0LB07 Uncharacterized protein9.3e-8875.9Show/hide
Query:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG
        M  +DCMATKD+EFEIDLEGGGNTSEDD+SSETDS SKPHARK F RLRSGFL +D S++R   FAS SN TK VKLGVDENVELL++ SSDGEKRRE G
Subjt:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG

Query:  TLAEK-NVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMML
          AEK NVK K KKNGK+HKPPRPPRGPSLDAADRIFV+EI ELAVKKRA+VERIKALKKMKAEKTSSF+SSLPALFITLLFF VIIFQGMSAKG+TM+ 
Subjt:  TLAEK-NVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMML

Query:  --ESPVPSVGGSAGSIQPSSI---PSLKVDELQSRALKFAGKQRPDPAT
          +SP PSVGGSAG I   S+    S  V+E +S  L FAGKQ  DPAT
Subjt:  --ESPVPSVGGSAGSIQPSSI---PSLKVDELQSRALKFAGKQRPDPAT

A0A1S3BHI9 uncharacterized protein LOC1034899091.1e-8574.6Show/hide
Query:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG
        M  LDCMATKD+EFEIDLEGGGNTSEDD+SSETDS SK HARK F RLRSGFL +D S++R  +FAS SN TK VKLGVD+NVELL++ SSDGEKRRE G
Subjt:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG

Query:  TLAEK-NVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMML
          AEK NVK K KKNGK+HKPPRPPRGPSLDAADRIFV+EI ELAVKKRA+VERIKALKKMKAEK SSF+SSLPALFITLLFF VIIFQGMSAKG+TM++
Subjt:  TLAEK-NVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMML

Query:  --ESPVPSVGGSAGSIQPSSI---PSLKVDELQSRALKFAGKQRPDPA
          +SP PSVGG AG I   S+    S  V+E +S  L FAGKQ  DPA
Subjt:  --ESPVPSVGGSAGSIQPSSI---PSLKVDELQSRALKFAGKQRPDPA

A0A5D3DAG0 Putative transmembrane protein1.1e-8574.6Show/hide
Query:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG
        M  LDCMATKD+EFEIDLEGGGNTSEDD+SSETDS SK HARK F RLRSGFL +D S++R  +FAS SN TK VKLGVD+NVELL++ SSDGEKRRE G
Subjt:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG

Query:  TLAEK-NVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMML
          AEK NVK K KKNGK+HKPPRPPRGPSLDAADRIFV+EI ELAVKKRA+VERIKALKKMKAEK SSF+SSLPALFITLLFF VIIFQGMSAKG+TM++
Subjt:  TLAEK-NVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMML

Query:  --ESPVPSVGGSAGSIQPSSI---PSLKVDELQSRALKFAGKQRPDPA
          +SP PSVGG AG I   S+    S  V+E +S  L FAGKQ  DPA
Subjt:  --ESPVPSVGGSAGSIQPSSI---PSLKVDELQSRALKFAGKQRPDPA

A0A6J1G9J4 uncharacterized protein LOC1114521942.3e-8676.67Show/hide
Query:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG
        MSGLDCMAT+D+EFEIDLEGG  TSEDD+SSE+DS SKPH RKA SRLRSGFLC DGSINRGCSFAS SN TK  KLGVDENVELL+DKS DGEKRRELG
Subjt:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG

Query:  TLAE--KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMM
         LAE  KN+KE  K NGK+HKPPRPPR PSLDAADRIFVKE+T+LAVKKRA+VERIKALKK KAEKTSSF+S+LPALFITLLFF VIIFQGMSA G+ +M
Subjt:  TLAE--KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMM

Query:  LESPVPSVGGSAGSI--QPSSIPSLKVDELQSRALKFAGK
         ESPVPS   SAG I  Q SS     V+  QS  L FAG+
Subjt:  LESPVPSVGGSAGSI--QPSSIPSLKVDELQSRALKFAGK

A0A6J1KGV9 uncharacterized protein LOC1114930991.1e-8576.67Show/hide
Query:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG
        MSGLDCMAT+D+EFEIDLEGG  TSE D+SSE+DS SKPH RKA SRLRSGFLC +GSINRGCSF S SN TK  KLGVDENVELL+DKS DGEKRRELG
Subjt:  MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELG

Query:  TLAE--KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMM
         LAE  KNVKE  K NGK+HKPPRPPRGPSLDAADRIFVKE+T+LAVKKRA VERIKALKKMKAEKTSSF+S+LPALFITLLFF VII QGMSA G+ +M
Subjt:  TLAE--KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMM

Query:  LESPVPSVGGSAGSI--QPSSIPSLKVDELQSRALKFAGK
         ESPVPS   SAG I  Q SS     V+ LQS  L FAG+
Subjt:  LESPVPSVGGSAGSI--QPSSIPSLKVDELQSRALKFAGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02380.1 unknown protein9.4e-1631.4Show/hide
Query:  LDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELGTLA
        +D M + +++ E+D+E G +    + +S+T S +   + +A                         N     K+  D +  L+ D++      + L    
Subjt:  LDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELGTLA

Query:  EKNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERI-KALKKMKAEKTSSFSSSLP--ALFITLLFFAVIIFQGMSAKGNTMMLE
        +K    K KK+ K  KPPRPP+GPSL   DR  +++I ELA++KRA +ER+ K+LK++KA KTS  S  +   ++ IT +FFA ++FQG S   ++M  +
Subjt:  EKNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERI-KALKKMKAEKTSSFSSSLP--ALFITLLFFAVIIFQGMSAKGNTMMLE

Query:  -SPVPSV
         SP P+V
Subjt:  -SPVPSV

AT3G17120.1 unknown protein5.5e-1647.06Show/hide
Query:  KEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLP---ALFITLLFFAVIIFQGMSAKGNTMMLESPVP
        KEK KK+    KPPRPPRGPSLDAAD+  ++EI ELA+ KRA +ER++ALKK +A K +S +SSL    A   T +FF V++FQG+S +      +S + 
Subjt:  KEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLP---ALFITLLFFAVIIFQGMSAKGNTMMLESPVP

Query:  SVGGSAG---SIQPSSIPS
          G + G   S+Q +  PS
Subjt:  SVGGSAG---SIQPSSIPS

AT3G17120.2 unknown protein5.5e-1647.06Show/hide
Query:  KEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLP---ALFITLLFFAVIIFQGMSAKGNTMMLESPVP
        KEK KK+    KPPRPPRGPSLDAAD+  ++EI ELA+ KRA +ER++ALKK +A K +S +SSL    A   T +FF V++FQG+S +      +S + 
Subjt:  KEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLP---ALFITLLFFAVIIFQGMSAKGNTMMLESPVP

Query:  SVGGSAG---SIQPSSIPS
          G + G   S+Q +  PS
Subjt:  SVGGSAG---SIQPSSIPS

AT4G01960.1 unknown protein1.4e-1632.67Show/hide
Query:  KDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRE-----LGTLAE
        ++++ ++D+E G      +I++   S ++  +    + + SG L  DGS                     +++ + LVD      KRRE     L     
Subjt:  KDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRE-----LGTLAE

Query:  KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMMLE-SPV
        K   +K KK  K  KPPRPP+GP L A D+  ++EITELA++KRA +ER+K L+++KA K+SS  SS+ A+ +T++FF  +IFQG      ++  + SP 
Subjt:  KNVKEKTKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMMLE-SPV

Query:  PS
        P+
Subjt:  PS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGTTTAGATTGTATGGCTACAAAAGATAAAGAGTTTGAAATTGATCTCGAAGGTGGTGGTAATACTAGCGAAGATGATATAAGCAGTGAAACTGACTCAATATC
AAAACCCCATGCTAGAAAAGCCTTCAGCAGGCTTCGGAGTGGCTTCCTATGTGCCGATGGATCTATAAATCGAGGTTGTAGCTTTGCTTCCTGCAGTAATTTGACCAAGC
ACGTTAAACTTGGTGTTGATGAAAATGTGGAATTGTTGGTGGACAAGAGTTCAGATGGAGAAAAGAGAAGAGAACTCGGGACTCTTGCAGAGAAGAACGTGAAAGAGAAG
ACTAAGAAGAATGGAAAGATTCACAAGCCACCACGACCTCCCAGGGGTCCGTCGTTGGATGCTGCTGACAGGATCTTCGTTAAAGAGATCACTGAGTTGGCAGTGAAAAA
GCGTGCCTCGGTTGAGCGAATAAAAGCTTTGAAGAAGATGAAAGCAGAGAAAACATCTTCATTCAGCAGCAGCTTGCCTGCCTTATTTATCACATTGCTTTTCTTTGCTG
TCATTATCTTTCAAGGAATGAGTGCTAAAGGTAACACAATGATGTTGGAGTCTCCTGTGCCCTCTGTTGGCGGTAGCGCAGGCTCGATACAGCCTTCATCTATACCTTCC
CTTAAAGTTGACGAACTTCAATCCCGTGCTCTGAAATTTGCAGGAAAGCAAAGGCCTGATCCTGCTACCATACGATAA
mRNA sequenceShow/hide mRNA sequence
ATTAGAAATACTCTTTTAAGGGCCGCACCTCACAATTTTCACTTTTCAGATCTGAAAATTTCTCCCCATTTTGATTTTGTCCCCAAAATCCCTTCTCATCTCACGATCTG
AGTTCTCTCGCTTTCTTTCTTTCTTTCTCTCTCATCGTTTTCGTTCCTGAAATCGTTCGTATTTTCAGCCCTTTTTTAATCCCCAGATCTCTCTCTGTTGTTCGTCTGCC
ACTTGGATCTGTGTAGAACTCTAGCTTGAAAATGGCGGAAATGGCTTCTTCAACTCATTGATTCACTGTTTGTCGTTTAGAGGTTCACCAGAGTGTTGACAAGATGAGAA
GAAAAGAAGGCCCCTTCCATTTGGGATCTTGGTAGAATGAATGCCCTGCCGAAATGGGAAGTGAAACTTCTTTAGATGAGTGGTTTAGATTGTATGGCTACAAAAGATAA
AGAGTTTGAAATTGATCTCGAAGGTGGTGGTAATACTAGCGAAGATGATATAAGCAGTGAAACTGACTCAATATCAAAACCCCATGCTAGAAAAGCCTTCAGCAGGCTTC
GGAGTGGCTTCCTATGTGCCGATGGATCTATAAATCGAGGTTGTAGCTTTGCTTCCTGCAGTAATTTGACCAAGCACGTTAAACTTGGTGTTGATGAAAATGTGGAATTG
TTGGTGGACAAGAGTTCAGATGGAGAAAAGAGAAGAGAACTCGGGACTCTTGCAGAGAAGAACGTGAAAGAGAAGACTAAGAAGAATGGAAAGATTCACAAGCCACCACG
ACCTCCCAGGGGTCCGTCGTTGGATGCTGCTGACAGGATCTTCGTTAAAGAGATCACTGAGTTGGCAGTGAAAAAGCGTGCCTCGGTTGAGCGAATAAAAGCTTTGAAGA
AGATGAAAGCAGAGAAAACATCTTCATTCAGCAGCAGCTTGCCTGCCTTATTTATCACATTGCTTTTCTTTGCTGTCATTATCTTTCAAGGAATGAGTGCTAAAGGTAAC
ACAATGATGTTGGAGTCTCCTGTGCCCTCTGTTGGCGGTAGCGCAGGCTCGATACAGCCTTCATCTATACCTTCCCTTAAAGTTGACGAACTTCAATCCCGTGCTCTGAA
ATTTGCAGGAAAGCAAAGGCCTGATCCTGCTACCATACGATAAGCGGGCTCGATGGAAGATTTGAAGGATCATTGAAGTTTAGTGTTAATAATCTGGTAAATCATCGCCA
GCTCTTTTAGACCGAAATTTTCGTACCTTATAACTTGATTAAAGGCTTCAGATCGTGTTGGTATCTGTAAATATGCAAAGGCCTTAAACGCTCTTTTGCTAGGCTTTTTC
TTGTACAGAGAGTTCACATTTGATTGATCCATATTATTTTTGTTATAATCAAATTGGCCATGCTGTCAACAGAACCCAGAACTGTAACTTTGGTGGTGC
Protein sequenceShow/hide protein sequence
MSGLDCMATKDKEFEIDLEGGGNTSEDDISSETDSISKPHARKAFSRLRSGFLCADGSINRGCSFASCSNLTKHVKLGVDENVELLVDKSSDGEKRRELGTLAEKNVKEK
TKKNGKIHKPPRPPRGPSLDAADRIFVKEITELAVKKRASVERIKALKKMKAEKTSSFSSSLPALFITLLFFAVIIFQGMSAKGNTMMLESPVPSVGGSAGSIQPSSIPS
LKVDELQSRALKFAGKQRPDPATIR