; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G06840 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G06840
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionproactivator polypeptide-like 1
Genome locationClcChr04:20359807..20368581
RNA-Seq ExpressionClc04G06840
SyntenyClc04G06840
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006665 - sphingolipid metabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR007856 - Saposin-like type B, region 1
IPR008138 - Saposin B type, region 2
IPR008139 - Saposin B type domain
IPR011001 - Saposin-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603544.1 WEB family protein, partial [Cucurbita argyrosperma subsp. sororia]1.8e-10883.19Show/hide
Query:  EASGAMDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECI
        E  GAMDLRFGIVFLLVVGVAWDCDARNLASFDSELSYL++ KDV ALSEASS PK+C+LCE+LVSQAVEY A+NQTQSEII ILRQTC   G+FKEEC+
Subjt:  EASGAMDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECI

Query:  SLVDSYVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANS
        SLVDSYVPLFFSE SSIEPASICQSVRFCEQVT+ISSQIQ+H+CEFCHQT++KILDKLKDPDTQ+EILQ LLN+CDS G R KECKKLVFEYGPLILANS
Subjt:  SLVDSYVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANS

Query:  EKILEQTDICKAIHACPGEPRGDNTVSSVGIASLLADA
        EKILEQTDICKAIHAC G   GD  +SSVG  S LADA
Subjt:  EKILEQTDICKAIHACPGEPRGDNTVSSVGIASLLADA

XP_004149924.1 proactivator polypeptide-like 1 [Cucumis sativus]9.0e-10885.84Show/hide
Query:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS
        MDLRF IVFLLV+GVA  C+ARNLASFDSELSYLE+EKDV ALSEASSN KIC LCE+L+SQAVEYFADNQTQSEIIG+LRQTCG AGVFKEECISLVDS
Subjt:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS

Query:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE
        YVPLFFS+ISSIEP+SICQS   CEQVTIISS  QDHNCEFCHQT++KILDKLKDPDTQIEILQTLLNMCDS  YRVKECKKLVFEYGPLILANSEKILE
Subjt:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE

Query:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA
        QTDI KAIHACP +P GDN VSSVG    LADA
Subjt:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA

XP_008464263.1 PREDICTED: proactivator polypeptide-like 1 [Cucumis melo]6.7e-11186.7Show/hide
Query:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS
        MD RF IVFLLV+ VAW CDARNLASFDSELSYLE+EKDV ALSEASSNPKIC LCE+L+SQAVEYFADNQTQSEIIG+LRQTCG AGVFKEECISLVDS
Subjt:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS

Query:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE
        YVPLFFSEISSIEP+SICQS  FCEQVTIISS  QDHNCEFCHQT++KILDKLKDPDTQIEILQTLL++CDS  YRVKECKKLVFEYGPLILANSEKILE
Subjt:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE

Query:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA
        QTDICKAIHACP +P GDN VSSVG    LADA
Subjt:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA

XP_022949668.1 proactivator polypeptide-like 1 [Cucurbita moschata]5.9e-10783.69Show/hide
Query:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS
        MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYL++ KDV ALSEASS PK+C+LCE+LVSQAVEY A+NQTQSEII ILRQTC   G+FKEEC+SLVDS
Subjt:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS

Query:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE
        YVPLFFSE SSIEPASICQSVRFCEQVT+ISSQIQ+H+CEFCHQT++KILDKLKDPDTQ+EILQ LLN+CDS G R KECKKLVFEYGPLILANSEKILE
Subjt:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE

Query:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA
        QTDICKAIHAC G   GD  +SSVG  S LADA
Subjt:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA

XP_038882493.1 proactivator polypeptide-like 1 [Benincasa hispida]3.1e-11691.88Show/hide
Query:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS
        MDLRFGIVFLLVVG AWDCDARNLASFDS LSYLE+ KDV +LSEASSN KICKLCE+LVSQAVEYFADNQTQSEII ILRQTCGAAGVFKEECI LVDS
Subjt:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS

Query:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE
        YVPLFFSEISSIEPASICQS RFCEQVTIISSQIQDHNCEFCHQT++KILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE
Subjt:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE

Query:  QTDICKAIHACPGEPRGDNTV-SSVGIASLLADA
        QTDICKAIHACPG PRGDNT+  SVG AS LADA
Subjt:  QTDICKAIHACPGEPRGDNTV-SSVGIASLLADA

TrEMBL top hitse value%identityAlignment
A0A0A0L045 Uncharacterized protein4.4e-10885.84Show/hide
Query:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS
        MDLRF IVFLLV+GVA  C+ARNLASFDSELSYLE+EKDV ALSEASSN KIC LCE+L+SQAVEYFADNQTQSEIIG+LRQTCG AGVFKEECISLVDS
Subjt:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS

Query:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE
        YVPLFFS+ISSIEP+SICQS   CEQVTIISS  QDHNCEFCHQT++KILDKLKDPDTQIEILQTLLNMCDS  YRVKECKKLVFEYGPLILANSEKILE
Subjt:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE

Query:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA
        QTDI KAIHACP +P GDN VSSVG    LADA
Subjt:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA

A0A1S3CLJ1 proactivator polypeptide-like 13.2e-11186.7Show/hide
Query:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS
        MD RF IVFLLV+ VAW CDARNLASFDSELSYLE+EKDV ALSEASSNPKIC LCE+L+SQAVEYFADNQTQSEIIG+LRQTCG AGVFKEECISLVDS
Subjt:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS

Query:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE
        YVPLFFSEISSIEP+SICQS  FCEQVTIISS  QDHNCEFCHQT++KILDKLKDPDTQIEILQTLL++CDS  YRVKECKKLVFEYGPLILANSEKILE
Subjt:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE

Query:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA
        QTDICKAIHACP +P GDN VSSVG    LADA
Subjt:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA

A0A5A7U5A0 Proactivator polypeptide-like 13.2e-11186.7Show/hide
Query:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS
        MD RF IVFLLV+ VAW CDARNLASFDSELSYLE+EKDV ALSEASSNPKIC LCE+L+SQAVEYFADNQTQSEIIG+LRQTCG AGVFKEECISLVDS
Subjt:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS

Query:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE
        YVPLFFSEISSIEP+SICQS  FCEQVTIISS  QDHNCEFCHQT++KILDKLKDPDTQIEILQTLL++CDS  YRVKECKKLVFEYGPLILANSEKILE
Subjt:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE

Query:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA
        QTDICKAIHACP +P GDN VSSVG    LADA
Subjt:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA

A0A6J1GCQ1 proactivator polypeptide-like 12.8e-10783.69Show/hide
Query:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS
        MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYL++ KDV ALSEASS PK+C+LCE+LVSQAVEY A+NQTQSEII ILRQTC   G+FKEEC+SLVDS
Subjt:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS

Query:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE
        YVPLFFSE SSIEPASICQSVRFCEQVT+ISSQIQ+H+CEFCHQT++KILDKLKDPDTQ+EILQ LLN+CDS G R KECKKLVFEYGPLILANSEKILE
Subjt:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE

Query:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA
        QTDICKAIHAC G   GD  +SSVG  S LADA
Subjt:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA

A0A6J1IPM0 proactivator polypeptide-like 14.1e-10682.4Show/hide
Query:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS
        MDLRFGIVF LVVG AWDCDARNLASFDSELSYL++ KDV ALSEASS PK+C+LCE+LVSQAVEY A+NQTQSEII ILRQTC   G+FKEEC+SLVDS
Subjt:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS

Query:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE
        YVPLFFSE SSIEPASICQSVRFCEQVT+ISSQIQ+H+CEFCHQT++KILDKLKDPDTQ+EILQ LLN+CDS G R KECKKLVFEYGPLILANSEKILE
Subjt:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE

Query:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA
        QTDICKAIH+CPG    D  +SSVG  S LADA
Subjt:  QTDICKAIHACPGEPRGDNTVSSVGIASLLADA

SwissProt top hitse value%identityAlignment
P07602 Prosaposin1.5e-0429.36Show/hide
Query:  CKLCETLVSQAVEYFADNQTQSEIIGILRQTCG--AAGVFKEECISLVDSYVPLFFSEISS--IEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTK
        C +C+ +V+ A +   DN T+ EI+  L +TC           C  +VDSY+P+    I      P  +C ++  CE        +Q H  E  HQ   K
Subjt:  CKLCETLVSQAVEYFADNQTQSEIIGILRQTCG--AAGVFKEECISLVDSYVPLFFSEISS--IEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTK

Query:  ILDKLKDPD
         L+  K P+
Subjt:  ILDKLKDPD

P10960 Prosaposin2.5e-0429.11Show/hide
Query:  CKLCETLVSQAVEYFADNQTQSEIIGILRQTCG--AAGVFKEECISLVDSYVPLFFSEISS--IEPASICQSVRFCEQV
        C +C+T+V++A     DN T+ EI+  L +TC           C  +VDSY+P+    I      P  +C ++  C+ +
Subjt:  CKLCETLVSQAVEYFADNQTQSEIIGILRQTCG--AAGVFKEECISLVDSYVPLFFSEISS--IEPASICQSVRFCEQV

P26779 Prosaposin4.3e-0428.12Show/hide
Query:  CKLCETLVSQAVEYFADNQTQSEIIGILRQTCG--AAGVFKEECISLVDSYVPLFFSEISS--IEPASICQSVRFCEQVTIISSQIQDHNCEFCHQ
        C +C+ +++ A     DN T+ EI+  L +TC           C  +VDSY+P+    I      P  +C ++  CE        +Q H  E  HQ
Subjt:  CKLCETLVSQAVEYFADNQTQSEIIGILRQTCG--AAGVFKEECISLVDSYVPLFFSEISS--IEPASICQSVRFCEQVTIISSQIQDHNCEFCHQ

Q61207 Prosaposin1.9e-0430.38Show/hide
Query:  CKLCETLVSQAVEYFADNQTQSEIIGILRQTCG--AAGVFKEECISLVDSYVPLFFSEISS--IEPASICQSVRFCEQV
        C +C+T+V++A     DN TQ EI+  L +TC           C  +VDSY+P+    I      P  +C ++  C+ +
Subjt:  CKLCETLVSQAVEYFADNQTQSEIIGILRQTCG--AAGVFKEECISLVDSYVPLFFSEISS--IEPASICQSVRFCEQV

Q8C1C1 Proactivator polypeptide-like 16.3e-1124.16Show/hide
Query:  CKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAA-GVFKEECISLVDSYVPLFFSEISSIEPASICQSVRFCEQ-----------VTIISSQIQDHN--
        C +C  LV +  ++   N T++ I   L + C        ++CI+LVD+Y P     +S + P  +C++++ C              T  S  + + N  
Subjt:  CKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAA-GVFKEECISLVDSYVPLFFSEISSIEPASICQSVRFCEQ-----------VTIISSQIQDHN--

Query:  --CEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGY-RVKECKKLVFEYGPLILANSEKILEQTDICKAIHACPG
          C+ C + +      L    T+ +IL      C       V +C + V EY P+++ + + ++  TD+CK + AC G
Subjt:  --CEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGY-RVKECKKLVFEYGPLILANSEKILEQTDICKAIHACPG

Arabidopsis top hitse value%identityAlignment
AT3G51730.1 saposin B domain-containing protein5.9e-4139.72Show/hide
Query:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS
        M L+ G   LL++G+    DAR+    DS +S            + S+   +C LCE  V+ A+ Y   N TQ+EII  L   C     + ++CISLVD 
Subjt:  MDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS

Query:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE
        YVPLFF ++ S +P   C+ +  C +V  +  + +  +C  CH+TV++IL KL+DPDTQ++I++ L+  C S     K+CK LVFEYGPLIL N+E+ L 
Subjt:  YVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKILE

Query:  QTDICKAIHACPGE
        + D+C  + ACP E
Subjt:  QTDICKAIHACPGE

AT5G01800.1 saposin B domain-containing protein5.0e-4035.98Show/hide
Query:  RFGIVFLLVVGVAWDCDARN---LASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS
        RFG++ +L + ++W C A N   L  F+S                A  + ++C+LC+  V+  ++Y  D   Q+E++  L  +C      K++C+S+VD 
Subjt:  RFGIVFLLVVGVAWDCDARN---LASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYFADNQTQSEIIGILRQTCGAAGVFKEECISLVDS

Query:  YVPLFFSEISSIEPASICQSVRFCEQVT-IISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKIL
        Y  LFF+++S+I+   IC+ +  C+ VT   +SQ+   NCE C +TV++++ KLKDP+T+++I++ LL  C S      +CKK+VFEYGPL+L + +K L
Subjt:  YVPLFFSEISSIEPASICQSVRFCEQVT-IISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRVKECKKLVFEYGPLILANSEKIL

Query:  EQTDICKAIHACPG
        E+ D+C  +H CPG
Subjt:  EQTDICKAIHACPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCCCGAGCGAGAAGCGAGACTTGAGCGAGAGCGCGAGAGCCGAGACAGCGAAACCGGCGAGAGAGCGATACCACGAGGCTGCGACGACACTAGGCGCTGGGAAAG
CCGACACTCAAGGCGACAACCGGCGAGACCCGACGCACGAACCCTCCGGGAACTCCCGGAAACTGAAGGAGAGCGAGAGAAAGAGGTATCGCGGCAGAGGGAGGCTGGCG
GCGGCGAAAGGGAAAGGAAGGGGAGAGCGAGAGATGCGGCGGTGAGAGAAAGAGATGGTTACTTTCTTTTCTTTTTCTTTTTGACTGCACGGAAAACGAGCATACCCTTT
GTAGGTCCACCATTGGAGGCTATTAAAGGCCATTTGGCCCGTAAGCCCGCCTTGGTTAGTCTTTCTCAATTGGTTTTCTTTATAGGTTCGACGAGTCCTGTTTTGAAAGG
AATCGAGGCATCAGGCGCTATGGATTTGAGGTTTGGAATTGTTTTCCTTCTTGTGGTGGGTGTTGCTTGGGATTGTGATGCTAGAAATTTGGCATCATTTGATTCTGAGT
TAAGCTACCTGGAGCGAGAGAAGGACGTTGTGGCTTTATCTGAAGCTTCGAGCAATCCAAAGATATGTAAACTTTGTGAGACTTTGGTCAGTCAGGCAGTTGAATATTTT
GCAGATAACCAGACCCAGAGTGAGATTATTGGTATTCTCCGGCAAACATGTGGTGCGGCGGGCGTGTTCAAGGAGGAGTGCATCAGTCTGGTGGACAGCTATGTTCCTCT
CTTCTTCTCAGAGATTTCCTCAATTGAACCTGCTAGCATCTGCCAATCAGTCCGCTTCTGTGAGCAAGTTACTATAATCTCCTCGCAGATTCAGGATCATAACTGTGAAT
TCTGCCATCAGACTGTTACAAAAATATTGGATAAGTTGAAGGATCCTGACACACAGATAGAGATACTTCAGACCCTTCTGAATATGTGTGACTCTTTCGGGTACCGCGTG
AAAGAGTGCAAGAAATTGGTATTTGAATATGGGCCTCTGATCCTTGCCAACTCGGAGAAAATTCTGGAACAAACAGATATTTGCAAAGCAATACACGCTTGTCCGGGCGA
ACCTCGTGGTGACAACACTGTATCATCTGTTGGAATTGCGTCTCTGCTTGCCGACGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTCCCGAGCGAGAAGCGAGACTTGAGCGAGAGCGCGAGAGCCGAGACAGCGAAACCGGCGAGAGAGCGATACCACGAGGCTGCGACGACACTAGGCGCTGGGAAAG
CCGACACTCAAGGCGACAACCGGCGAGACCCGACGCACGAACCCTCCGGGAACTCCCGGAAACTGAAGGAGAGCGAGAGAAAGAGGTATCGCGGCAGAGGGAGGCTGGCG
GCGGCGAAAGGGAAAGGAAGGGGAGAGCGAGAGATGCGGCGGTGAGAGAAAGAGATGGTTACTTTCTTTTCTTTTTCTTTTTGACTGCACGGAAAACGAGCATACCCTTT
GTAGGTCCACCATTGGAGGCTATTAAAGGCCATTTGGCCCGTAAGCCCGCCTTGGTTAGTCTTTCTCAATTGGTTTTCTTTATAGGTTCGACGAGTCCTGTTTTGAAAGG
AATCGAGGCATCAGGCGCTATGGATTTGAGGTTTGGAATTGTTTTCCTTCTTGTGGTGGGTGTTGCTTGGGATTGTGATGCTAGAAATTTGGCATCATTTGATTCTGAGT
TAAGCTACCTGGAGCGAGAGAAGGACGTTGTGGCTTTATCTGAAGCTTCGAGCAATCCAAAGATATGTAAACTTTGTGAGACTTTGGTCAGTCAGGCAGTTGAATATTTT
GCAGATAACCAGACCCAGAGTGAGATTATTGGTATTCTCCGGCAAACATGTGGTGCGGCGGGCGTGTTCAAGGAGGAGTGCATCAGTCTGGTGGACAGCTATGTTCCTCT
CTTCTTCTCAGAGATTTCCTCAATTGAACCTGCTAGCATCTGCCAATCAGTCCGCTTCTGTGAGCAAGTTACTATAATCTCCTCGCAGATTCAGGATCATAACTGTGAAT
TCTGCCATCAGACTGTTACAAAAATATTGGATAAGTTGAAGGATCCTGACACACAGATAGAGATACTTCAGACCCTTCTGAATATGTGTGACTCTTTCGGGTACCGCGTG
AAAGAGTGCAAGAAATTGGTATTTGAATATGGGCCTCTGATCCTTGCCAACTCGGAGAAAATTCTGGAACAAACAGATATTTGCAAAGCAATACACGCTTGTCCGGGCGA
ACCTCGTGGTGACAACACTGTATCATCTGTTGGAATTGCGTCTCTGCTTGCCGACGCCTGA
Protein sequenceShow/hide protein sequence
MTPEREARLERERESRDSETGERAIPRGCDDTRRWESRHSRRQPARPDARTLRELPETEGEREKEVSRQREAGGGERERKGRARDAAVRERDGYFLFFFFLTARKTSIPF
VGPPLEAIKGHLARKPALVSLSQLVFFIGSTSPVLKGIEASGAMDLRFGIVFLLVVGVAWDCDARNLASFDSELSYLEREKDVVALSEASSNPKICKLCETLVSQAVEYF
ADNQTQSEIIGILRQTCGAAGVFKEECISLVDSYVPLFFSEISSIEPASICQSVRFCEQVTIISSQIQDHNCEFCHQTVTKILDKLKDPDTQIEILQTLLNMCDSFGYRV
KECKKLVFEYGPLILANSEKILEQTDICKAIHACPGEPRGDNTVSSVGIASLLADA