; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004537 (gene) of Snake gourd v1 genome

Gene IDTan0004537
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionproactivator polypeptide-like 1
Genome locationLG03:62944222..62949149
RNA-Seq ExpressionTan0004537
SyntenyTan0004537
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006665 - sphingolipid metabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR007856 - Saposin-like type B, region 1
IPR008138 - Saposin B type, region 2
IPR008139 - Saposin B type domain
IPR011001 - Saposin-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595562.1 WEB family protein, partial [Cucurbita argyrosperma subsp. sororia]9.6e-12090.12Show/hide
Query:  LGRREPEPGCLRLSSKASGAMDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGIL
        L R EPEPGCL LSSKASGAMDLRFGLVFLLVV AAW+C ARKLASSD E SYLE EKDVEA SEASSNPK CKLCESLVSQAVEYLADN TQNEITGIL
Subjt:  LGRREPEPGCLRLSSKASGAMDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGIL

Query:  RQTCAVLGVFKEECISLVDSYVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQC
        +QTCAVLGVFKEECISLVD+YVPLFFSEISSIEPSSICQSA  CEQVTIISSQIQDNSCGFC ETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQC
Subjt:  RQTCAVLGVFKEECISLVDSYVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQC

Query:  KKLVFEYGPLILANSEKILEQTDICRAIHACSAGAGGDNAVSSVVTVSSLADA
        KKLVFEYGPLILANSEK+LEQT+IC+AIHAC A AGGDN  SSV TVSSLADA
Subjt:  KKLVFEYGPLILANSEKILEQTDICRAIHACSAGAGGDNAVSSVVTVSSLADA

KAG7027542.1 Proactivator polypeptide-like 1, partial [Cucurbita argyrosperma subsp. argyrosperma]9.6e-12090.12Show/hide
Query:  LGRREPEPGCLRLSSKASGAMDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGIL
        L R EPEPGCL LSSKASGAMDLRFGLVFLLVV AAW+C ARKLASSD E SYLE EKDVEA SEASSNPK CKLCESLVSQAVEYLADN TQNEITGIL
Subjt:  LGRREPEPGCLRLSSKASGAMDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGIL

Query:  RQTCAVLGVFKEECISLVDSYVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQC
        +QTCAVLGVFKEECISLVD+YVPLFFSEISSIEPSSICQSA  CEQVTIISSQIQDNSCGFC ETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQC
Subjt:  RQTCAVLGVFKEECISLVDSYVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQC

Query:  KKLVFEYGPLILANSEKILEQTDICRAIHACSAGAGGDNAVSSVVTVSSLADA
        KKLVFEYGPLILANSEK+LEQT+IC+AIHAC A AGGDN  SSV TVSSLADA
Subjt:  KKLVFEYGPLILANSEKILEQTDICRAIHACSAGAGGDNAVSSVVTVSSLADA

XP_022925048.1 proactivator polypeptide-like 1 [Cucurbita moschata]2.6e-10990.13Show/hide
Query:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS
        MDLRFGLVFLLVV AAW+C ARKLASSD E SYLE EKDVEA SEASSNPK CKLCESLVSQAVEYLADN TQNEITGIL+QTCAVLGVFKEECISLVD+
Subjt:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS

Query:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE
        YVPLFFSEISSIEPSSICQSA  CEQVTIISSQIQDNSCGFC ETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEK+LE
Subjt:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE

Query:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA
        QT+IC+AIHAC A AGGDN  SSV TVS LADA
Subjt:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA

XP_022966507.1 proactivator polypeptide-like 1 [Cucurbita maxima]1.1e-11090.56Show/hide
Query:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS
        MDLRFGLVFLLVVGAAWNC ARKLASSD   SYLE EKDVEA SEASSNPK CKLCESLVSQAVEYLADN TQNEITGIL+QTCAVLGVFKEEC+SLVD+
Subjt:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS

Query:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE
        YVPLFFSEISSIEPSSICQSA  CEQVTIISSQIQDNSCGFC ETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEK+LE
Subjt:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE

Query:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA
        QT+IC+AIHAC A AGGDN  SSV TVSSLADA
Subjt:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA

XP_023518446.1 proactivator polypeptide-like 1 [Cucurbita pepo subsp. pepo]7.7e-10989.7Show/hide
Query:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS
        MDLRFGLVFLLVV AAW+C ARKLASSD E SYLE EKDVEA SEASSNPK CKLCESLVSQAVEYLADN TQNEITGIL+QTCAVLGVFKEEC+SLVD+
Subjt:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS

Query:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE
        YVPLFFSEISSIEPSSICQSA  CEQVTIISSQIQDNSCGFC ETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEK+LE
Subjt:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE

Query:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA
        QT+IC+AIHAC A AGGDN  SSV TVSSLA A
Subjt:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA

TrEMBL top hitse value%identityAlignment
A0A1S3CLJ1 proactivator polypeptide-like 15.2e-10383.69Show/hide
Query:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS
        MD RF +VFLLV+  AW C AR LAS DSELSYLEQEKDVEALSEASSNPK C LCESL+SQAVEY ADN+TQ+EI G+LRQTC V GVFKEECISLVDS
Subjt:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS

Query:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE
        YVPLFFSEISSIEPSSICQSAHFCEQVTIISS  QD++C FCH+TISKILDKLKDPDTQIEILQTLL++CDS+ YRVK+CKKLVFEYGPLILANSEKILE
Subjt:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE

Query:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA
        QTDIC+AIHAC A   GDNAVSSV TV SLADA
Subjt:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA

A0A5A7U5A0 Proactivator polypeptide-like 15.2e-10383.69Show/hide
Query:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS
        MD RF +VFLLV+  AW C AR LAS DSELSYLEQEKDVEALSEASSNPK C LCESL+SQAVEY ADN+TQ+EI G+LRQTC V GVFKEECISLVDS
Subjt:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS

Query:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE
        YVPLFFSEISSIEPSSICQSAHFCEQVTIISS  QD++C FCH+TISKILDKLKDPDTQIEILQTLL++CDS+ YRVK+CKKLVFEYGPLILANSEKILE
Subjt:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE

Query:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA
        QTDIC+AIHAC A   GDNAVSSV TV SLADA
Subjt:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA

A0A6J1EE54 proactivator polypeptide-like 11.3e-10990.13Show/hide
Query:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS
        MDLRFGLVFLLVV AAW+C ARKLASSD E SYLE EKDVEA SEASSNPK CKLCESLVSQAVEYLADN TQNEITGIL+QTCAVLGVFKEECISLVD+
Subjt:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS

Query:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE
        YVPLFFSEISSIEPSSICQSA  CEQVTIISSQIQDNSCGFC ETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEK+LE
Subjt:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE

Query:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA
        QT+IC+AIHAC A AGGDN  SSV TVS LADA
Subjt:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA

A0A6J1GCQ1 proactivator polypeptide-like 11.5e-10282.4Show/hide
Query:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS
        MDLRFG+VFLLVVG AW+C AR LAS DSELSYL+Q KDVEALSEASS PK C+LCESLVSQAVEYLA+N+TQ+EI  ILRQTCAV+G+FKEEC+SLVDS
Subjt:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS

Query:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE
        YVPLFFSE SSIEP+SICQS  FCEQVT+ISSQIQ++SC FCH+TISKILDKLKDPDTQ+EILQ LLN+CDSLG R K+CKKLVFEYGPLILANSEKILE
Subjt:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE

Query:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA
        QTDIC+AIHAC+  AGGD A+SSV TVSSLADA
Subjt:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA

A0A6J1HN59 proactivator polypeptide-like 15.2e-11190.56Show/hide
Query:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS
        MDLRFGLVFLLVVGAAWNC ARKLASSD   SYLE EKDVEA SEASSNPK CKLCESLVSQAVEYLADN TQNEITGIL+QTCAVLGVFKEEC+SLVD+
Subjt:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDS

Query:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE
        YVPLFFSEISSIEPSSICQSA  CEQVTIISSQIQDNSCGFC ETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEK+LE
Subjt:  YVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILE

Query:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA
        QT+IC+AIHAC A AGGDN  SSV TVSSLADA
Subjt:  QTDICRAIHACSAGAGGDNAVSSVVTVSSLADA

SwissProt top hitse value%identityAlignment
P07602 Prosaposin9.1e-0431.65Show/hide
Query:  CKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGV--FKEECISLVDSYVPLFFSEISS--IEPSSICQSAHFCEQV
        C +C+ +V+ A + L DN T+ EI   L +TC  L        C  +VDSY+P+    I      P  +C + + CE +
Subjt:  CKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGV--FKEECISLVDSYVPLFFSEISS--IEPSSICQSAHFCEQV

P10960 Prosaposin2.4e-0428.41Show/hide
Query:  SEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVL--GVFKEECISLVDSYVPLFFSEISS--IEPSSICQSAHFCEQV
        S+ ++    C +C+++V++A   L DN T+ EI   L +TCA +        C  +VDSY+P+    I      P  +C + + C+ +
Subjt:  SEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVL--GVFKEECISLVDSYVPLFFSEISS--IEPSSICQSAHFCEQV

Q61207 Prosaposin4.1e-0428.41Show/hide
Query:  SEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVL--GVFKEECISLVDSYVPLFFSEISS--IEPSSICQSAHFCEQV
        S+ ++    C +C+++V++A   L DN TQ EI   L +TC  +        C  +VDSY+P+    I      P  +C + + C+ +
Subjt:  SEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVL--GVFKEECISLVDSYVPLFFSEISS--IEPSSICQSAHFCEQV

Q8C1C1 Proactivator polypeptide-like 11.5e-1125.99Show/hide
Query:  TCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVL-GVFKEECISLVDSYVPLFFSEISSIEPSSICQSAHFC---EQVTIISSQI------------Q
        TC +C +LV +  ++L  N T+  I+  L + C V+     ++CI+LVD+Y P     +S + P  +C++   C    +   IS  +            Q
Subjt:  TCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVL-GVFKEECISLVDSYVPLFFSEISSIEPSSICQSAHFC---EQVTIISSQI------------Q

Query:  DNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGY-RVKQCKKLVFEYGPLILANSEKILEQTDICRAIHAC
         + C  C   +      L    T+ +IL      C  L    V QC + V EY P+++ + + ++  TD+C+ + AC
Subjt:  DNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGY-RVKQCKKLVFEYGPLILANSEKILEQTDICRAIHAC

Arabidopsis top hitse value%identityAlignment
AT3G51730.1 saposin B domain-containing protein3.6e-4041.04Show/hide
Query:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPK-TCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVD
        M L+ G   LL++G      AR    S               +SE  SN +  C LCE  V+ A+ YL  N TQ EI   L   C+ L  + ++CISLVD
Subjt:  MDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPK-TCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVD

Query:  SYVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKIL
         YVPLFF ++ S +P   C+  + C +V  +  + + +SCG CH T+S+IL KL+DPDTQ++I++ L+  C SL    K+CK LVFEYGPLIL N+E+ L
Subjt:  SYVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKIL

Query:  EQTDICRAIHAC
         + D+C  + AC
Subjt:  EQTDICRAIHAC

AT5G01800.1 saposin B domain-containing protein1.7e-3735.06Show/hide
Query:  RFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDSYVP
        RFG++ +L +  +W+C A                  +E    A  + + C+LC+  V+  ++YL D   QNE+   L  +C+ +   K++C+S+VD Y  
Subjt:  RFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEYLADNKTQNEITGILRQTCAVLGVFKEECISLVDSYVP

Query:  LFFSEISSIEPSSICQSAHFCEQVT-IISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILEQT
        LFF+++S+I+   IC+  + C+ VT   +SQ+   +C  C ET+S+++ KLKDP+T+++I++ LL  C SL     +CKK+VFEYGPL+L + +K LE+ 
Subjt:  LFFSEISSIEPSSICQSAHFCEQVT-IISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYRVKQCKKLVFEYGPLILANSEKILEQT

Query:  DICRAIHACSAGAGGDNAVSSVVTVSSLADA
        D+C  +H C   A   + V +   V SLAD+
Subjt:  DICRAIHACSAGAGGDNAVSSVVTVSSLADA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAAAGTCCAACCCAACACTCATTTTTCCAACAGCAGAGGAATCAGTTCCATCGTCTCCTTCGGACGCCTCGGAAGAAGGGAACCAGAGCCTGGCTGTCTAAGACT
CTCATCGAAGGCATCAGGCGCAATGGATTTGAGGTTTGGACTTGTTTTCCTTCTTGTAGTGGGTGCTGCTTGGAATTGTGGTGCTAGAAAATTGGCATCCTCTGATTCTG
AGTTAAGCTACCTGGAGCAAGAGAAGGACGTTGAAGCTTTATCTGAAGCTTCCAGCAATCCAAAGACATGTAAACTTTGTGAGAGCTTGGTCAGTCAGGCAGTTGAATAC
CTTGCAGATAACAAAACCCAAAATGAGATCACTGGTATTCTCCGGCAAACCTGCGCTGTGTTGGGCGTGTTCAAGGAGGAGTGCATAAGTCTGGTGGACAGCTATGTTCC
TCTCTTCTTCTCAGAGATTTCCTCAATTGAACCTTCTAGCATCTGCCAATCAGCCCACTTCTGCGAGCAAGTTACTATAATCTCCTCGCAGATTCAGGATAATAGCTGTG
GATTCTGTCACGAGACCATTTCAAAAATATTGGATAAGTTGAAAGATCCTGACACACAGATAGAGATACTCCAGACACTTCTGAATATGTGCGACTCTTTGGGGTACCGT
GTGAAACAGTGCAAGAAATTGGTATTTGAATATGGGCCTCTGATCCTTGCCAATTCGGAGAAAATTCTAGAACAAACAGATATTTGCAGAGCAATACATGCTTGTTCAGC
GGGAGCTGGTGGTGACAACGCCGTTTCATCTGTTGTAACTGTGTCTTCGCTTGCCGACGCTTGA
mRNA sequenceShow/hide mRNA sequence
AAGAAGGGAAAAATTCCCTTTAATTAATGTTAAAAGTCCAACCCAACACTCATTTTTCCAACAGCAGAGGAATCAGTTCCATCGTCTCCTTCGGACGCCTCGGAAGAAGG
GAACCAGAGCCTGGCTGTCTAAGACTCTCATCGAAGGCATCAGGCGCAATGGATTTGAGGTTTGGACTTGTTTTCCTTCTTGTAGTGGGTGCTGCTTGGAATTGTGGTGC
TAGAAAATTGGCATCCTCTGATTCTGAGTTAAGCTACCTGGAGCAAGAGAAGGACGTTGAAGCTTTATCTGAAGCTTCCAGCAATCCAAAGACATGTAAACTTTGTGAGA
GCTTGGTCAGTCAGGCAGTTGAATACCTTGCAGATAACAAAACCCAAAATGAGATCACTGGTATTCTCCGGCAAACCTGCGCTGTGTTGGGCGTGTTCAAGGAGGAGTGC
ATAAGTCTGGTGGACAGCTATGTTCCTCTCTTCTTCTCAGAGATTTCCTCAATTGAACCTTCTAGCATCTGCCAATCAGCCCACTTCTGCGAGCAAGTTACTATAATCTC
CTCGCAGATTCAGGATAATAGCTGTGGATTCTGTCACGAGACCATTTCAAAAATATTGGATAAGTTGAAAGATCCTGACACACAGATAGAGATACTCCAGACACTTCTGA
ATATGTGCGACTCTTTGGGGTACCGTGTGAAACAGTGCAAGAAATTGGTATTTGAATATGGGCCTCTGATCCTTGCCAATTCGGAGAAAATTCTAGAACAAACAGATATT
TGCAGAGCAATACATGCTTGTTCAGCGGGAGCTGGTGGTGACAACGCCGTTTCATCTGTTGTAACTGTGTCTTCGCTTGCCGACGCTTGAATTCGCACACCAAACACGAA
GGAGGGATCTTCATGTCGTGTCTGAATGAAAATGGGTAAGAAAACTACTTTTTCTTTGCCCTTCTCTGTCTGTTTATGAATCTAATATGATACAATCTTTGTCCATGGAT
AGTCTGTGTAATCATGTACATTGATATGTGGATATATTTGATTGGCTGTTGCTATTTCGAGTAAAATAATTAGTATTGAAGTTGATATGTGGATATATTTGATTGGCTGT
TGCTATTCGAGTAAAATAATTAGTATTCAAG
Protein sequenceShow/hide protein sequence
MLKVQPNTHFSNSRGISSIVSFGRLGRREPEPGCLRLSSKASGAMDLRFGLVFLLVVGAAWNCGARKLASSDSELSYLEQEKDVEALSEASSNPKTCKLCESLVSQAVEY
LADNKTQNEITGILRQTCAVLGVFKEECISLVDSYVPLFFSEISSIEPSSICQSAHFCEQVTIISSQIQDNSCGFCHETISKILDKLKDPDTQIEILQTLLNMCDSLGYR
VKQCKKLVFEYGPLILANSEKILEQTDICRAIHACSAGAGGDNAVSSVVTVSSLADA