; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018541 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018541
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein stoned-B-like isoform X1
Genome locationtig00153204:1220088..1221464
RNA-Seq ExpressionSgr018541
SyntenySgr018541
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585540.1 hypothetical protein SDJN03_18273, partial [Cucurbita argyrosperma subsp. sororia]8.0e-5953.49Show/hide
Query:  SPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTTPLSLNS
        +P   FSP E  V +ILL+     +KS  + G +P W  +RKRSA            L SPP S   +A+VPS        PPSK+VKE SPT+PL LNS
Subjt:  SPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTTPLSLNS

Query:  LTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSSSAMEIAR-L
        L  S SESDE  N K SKKK SL  K++++EAIDELTK+ Q L+GE E +K+ Y+HLK+INSELKAKKQ+ ILG    KN+SA PE G SSSAME+ + L
Subjt:  LTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSSSAMEIAR-L

Query:  TVKSPTLQNHHHPQPSIKNQTAPTPEQSQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSNAATRLQSPPPNP
        TV+S T    H P P  +         SQNFQI IG +P Y+   L P+GIPDLNISLEE+ QRNYS+ MAA+AR+ RIQICK KN N  T+LQ+PP NP
Subjt:  TVKSPTLQNHHHPQPSIKNQTAPTPEQSQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSNAATRLQSPPPNP

Query:  C
        C
Subjt:  C

KAG7020453.1 hypothetical protein SDJN02_17137, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-5753.33Show/hide
Query:  SPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTTPLSLNS
        +P   FSP E  V +ILL+     +KS  + G +P W  +RKRSA            L SPP S   +A+VPS        PPSK+VKE SPT+PL LNS
Subjt:  SPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTTPLSLNS

Query:  LTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSSSAMEIAR-L
        L  S SESDE  N K SKKK SL  K++++EAIDELTK+ Q L+GE E +K+ Y+HLK+INSELKAKKQ+ ILG    KN+SA PE G SSSAME+ + L
Subjt:  LTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSSSAMEIAR-L

Query:  TVKSPTLQNHHHPQPSIKNQTAPTPEQSQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSNAATRLQSPPPNP
        TV+S T    H P P  +         SQNFQI IG +P Y+   L P+GIPDLNISLEE+ QRNYS+ MAA+AR+ RIQICK KN N  T+LQ+PP NP
Subjt:  TVKSPTLQNHHHPQPSIKNQTAPTPEQSQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSNAATRLQSPPPNP

XP_022951578.1 uncharacterized protein LOC111454352 [Cucurbita moschata]7.2e-6054.15Show/hide
Query:  SPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTTPLSLNS
        S DF FSP E  V +ILL+     +KS  + G +P W  +RKRSA            L SPP S   +A+VPS        PPSK+VKE SPT+PL LNS
Subjt:  SPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTTPLSLNS

Query:  LTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSSSAMEIAR-L
        L  S SESDE  N K SKKK SL  K++++EAIDELTK+ Q L+GE E +K+ Y+HLK+INSELKAKKQ+ ILG    KN+SA PE G SSSAME+ + L
Subjt:  LTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSSSAMEIAR-L

Query:  TVKSPTLQNHHHPQPSIKNQTAPTPEQSQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSNAATRLQSPPPNP
        TV+S T    H P P  +         SQNFQI IG +P Y+   L P+GIPDLNISLEE+ QRNYS+ MAA+AR+ RIQICK KN N  T+LQ+PP NP
Subjt:  TVKSPTLQNHHHPQPSIKNQTAPTPEQSQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSNAATRLQSPPPNP

Query:  C
        C
Subjt:  C

XP_023002465.1 uncharacterized protein LOC111496295 [Cucurbita maxima]3.2e-6052.72Show/hide
Query:  PECCASTLPSPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELS
        P+C      S DF F+P E  V +ILL+     +KS  + G +P W  +RKRSA            L SPP S   +A+VPS        PPSK+VKE S
Subjt:  PECCASTLPSPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELS

Query:  PTTPLSLNSLTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSS
        PT+PL LNSL  S SESDE  N K SKKK SL  K++++EAIDELTK+ Q L+GE E +K+ Y+HLK+INSELKAKKQ+ ILG    KN+SA PE G SS
Subjt:  PTTPLSLNSLTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSS

Query:  SAMEIARLTVKSPTLQNHHHPQPSIKNQTAPTPEQ----SQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSN
        SAME+ +L     T+++ +H QP      AP  EQ    SQNFQI IG +P Y+   L P+GIPDLNISLEE+ QRNYS+ MAA+AR+ RIQICK KN N
Subjt:  SAMEIARLTVKSPTLQNHHHPQPSIKNQTAPTPEQ----SQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSN

Query:  AATRLQSPPPNPC
          T+LQ+PP NPC
Subjt:  AATRLQSPPPNPC

XP_023537124.1 uncharacterized protein LOC111798295 [Cucurbita pepo subsp. pepo]3.0e-5853.16Show/hide
Query:  SPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTTPLSLNS
        S DF F+P E  V +ILL+     +KS  + G +P W  +RKRSA            L SPP S   +A+VPS        PPSK+VKE SPT+PL LNS
Subjt:  SPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTTPLSLNS

Query:  LTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSSSAMEIAR-L
        L  S SESDE  N K +KKK S   K++++EAIDELTK+ Q L+GE E +K+ Y+HLK+INSELKAKKQ+ ILG    KN+SA PE G SSSAME+ + L
Subjt:  LTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSSSAMEIAR-L

Query:  TVKSPTLQNHHHPQPSIKNQTAPTPEQSQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSNAATRLQSPPPNP
        TV+S T    H P P  +         SQNFQI IG +P Y+   L P+GIPDLNISLEE+ QRNYS+ MAA+AR+ RIQICK KN N  T+LQ+PP NP
Subjt:  TVKSPTLQNHHHPQPSIKNQTAPTPEQSQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSNAATRLQSPPPNP

Query:  C
        C
Subjt:  C

TrEMBL top hitse value%identityAlignment
A0A0A0LRP1 Uncharacterized protein1.5e-5549.68Show/hide
Query:  CASTLPSPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTT
        C+ +  S D  FSP+E  V +IL QLP+LI++S    GL P+W  +RKRSA++  + P  +  ++ PP  LP    +PS          S+  KE SPTT
Subjt:  CASTLPSPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTT

Query:  PLSLNSLTFSPSESDENT---KRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILGLKNQSAQPERGNSSS-AME
        PLSL+SL  S SESDENT   K SKKK  +  K+++LE I++LT +KQ LEG++E +KR + +LK+INSELKAKKQ+ + G  N S  P+ G S+S AME
Subjt:  PLSLNSLTFSPSESDENT---KRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILGLKNQSAQPERGNSSS-AME

Query:  IARLTVKSP---TLQNHHHPQPSIKNQTAPTPEQS---QNFQIAIGKMPSYE-HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTK----NS
        IA+LTVKS       NH   +PS+KNQT P  EQS   QN+QI IG +P Y+  LGP+GIPDLN+SLE++  +NY+K +AA+ARQ RIQI K K    N+
Subjt:  IARLTVKSP---TLQNHHHPQPSIKNQTAPTPEQS---QNFQIAIGKMPSYE-HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTK----NS

Query:  NAATRLQS
        N A +LQS
Subjt:  NAATRLQS

A0A1S3BAR4 uncharacterized protein LOC1034880491.5e-5550.33Show/hide
Query:  CASTLPSPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTT
        C+ +  S D  FSP+E  V +IL QLP+LI+KS    GL P+W  +RKRSA++            SPP + P     P   P +   P S+  KE SPTT
Subjt:  CASTLPSPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTT

Query:  PLSLNSLTFSPSESDEN---TKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILGLKNQSAQPERGNSSS-AME
        PLSLNSL  S SESDEN    K SKKK  +  K+++LE ID+LT +KQ LEG++E +KR + +LK+INSELKAKKQ+ + G  N S  PE G SSS AME
Subjt:  PLSLNSLTFSPSESDEN---TKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILGLKNQSAQPERGNSSS-AME

Query:  IARLTVKSP---TLQNHHHPQPSIKNQTAPTPEQ---SQNFQIAIGKMPSYE-HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTK--NSNA
        +A+LTVKS       NH   +PS+KNQT PT EQ   ++N+QI IG +P Y+  LGP+GIPDLN+SLE++  ++Y+K +AA+ARQ RIQI K K  N+N 
Subjt:  IARLTVKSP---TLQNHHHPQPSIKNQTAPTPEQ---SQNFQIAIGKMPSYE-HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTK--NSNA

Query:  ATRLQS
        A +LQS
Subjt:  ATRLQS

A0A5A7VHE1 Uncharacterized protein1.5e-5550.33Show/hide
Query:  CASTLPSPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTT
        C+ +  S D  FSP+E  V +IL QLP+LI+KS    GL P+W  +RKRSA++            SPP + P     P   P +   P S+  KE SPTT
Subjt:  CASTLPSPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTT

Query:  PLSLNSLTFSPSESDEN---TKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILGLKNQSAQPERGNSSS-AME
        PLSLNSL  S SESDEN    K SKKK  +  K+++LE ID+LT +KQ LEG++E +KR + +LK+INSELKAKKQ+ + G  N S  PE G SSS AME
Subjt:  PLSLNSLTFSPSESDEN---TKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILGLKNQSAQPERGNSSS-AME

Query:  IARLTVKSP---TLQNHHHPQPSIKNQTAPTPEQ---SQNFQIAIGKMPSYE-HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTK--NSNA
        +A+LTVKS       NH   +PS+KNQT PT EQ   ++N+QI IG +P Y+  LGP+GIPDLN+SLE++  ++Y+K +AA+ARQ RIQI K K  N+N 
Subjt:  IARLTVKSP---TLQNHHHPQPSIKNQTAPTPEQ---SQNFQIAIGKMPSYE-HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTK--NSNA

Query:  ATRLQS
        A +LQS
Subjt:  ATRLQS

A0A6J1GI34 uncharacterized protein LOC1114543523.5e-6054.15Show/hide
Query:  SPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTTPLSLNS
        S DF FSP E  V +ILL+     +KS  + G +P W  +RKRSA            L SPP S   +A+VPS        PPSK+VKE SPT+PL LNS
Subjt:  SPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTTPLSLNS

Query:  LTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSSSAMEIAR-L
        L  S SESDE  N K SKKK SL  K++++EAIDELTK+ Q L+GE E +K+ Y+HLK+INSELKAKKQ+ ILG    KN+SA PE G SSSAME+ + L
Subjt:  LTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSSSAMEIAR-L

Query:  TVKSPTLQNHHHPQPSIKNQTAPTPEQSQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSNAATRLQSPPPNP
        TV+S T    H P P  +         SQNFQI IG +P Y+   L P+GIPDLNISLEE+ QRNYS+ MAA+AR+ RIQICK KN N  T+LQ+PP NP
Subjt:  TVKSPTLQNHHHPQPSIKNQTAPTPEQSQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSNAATRLQSPPPNP

Query:  C
        C
Subjt:  C

A0A6J1KP15 uncharacterized protein LOC1114962951.6e-6052.72Show/hide
Query:  PECCASTLPSPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELS
        P+C      S DF F+P E  V +ILL+     +KS  + G +P W  +RKRSA            L SPP S   +A+VPS        PPSK+VKE S
Subjt:  PECCASTLPSPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELS

Query:  PTTPLSLNSLTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSS
        PT+PL LNSL  S SESDE  N K SKKK SL  K++++EAIDELTK+ Q L+GE E +K+ Y+HLK+INSELKAKKQ+ ILG    KN+SA PE G SS
Subjt:  PTTPLSLNSLTFSPSESDE--NTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILG---LKNQSAQPERGNSS

Query:  SAMEIARLTVKSPTLQNHHHPQPSIKNQTAPTPEQ----SQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSN
        SAME+ +L     T+++ +H QP      AP  EQ    SQNFQI IG +P Y+   L P+GIPDLNISLEE+ QRNYS+ MAA+AR+ RIQICK KN N
Subjt:  SAMEIARLTVKSPTLQNHHHPQPSIKNQTAPTPEQ----SQNFQIAIGKMPSYE--HLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSN

Query:  AATRLQSPPPNPC
          T+LQ+PP NPC
Subjt:  AATRLQSPPPNPC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCCGAATGCTGCGCTTCCACTCTTCCCTCTCCTGATTTCGGCTTCTCTCCCGACGAATTTACCGTCTGTGAAATCCTGCTTCAGCTCCCCGTTCTGATCGAGAA
ATCCAAGTCTTTTTTCGGATTATTGCCCGCCTGGGGTCGTAAGCGGAAGAGATCCGCCTTGAATTATTCCACTTTGCCGTCTGGTGCTCTTTCTTTATCGTCTCCGCCTT
CGTCACTTCCTCACGCCGCTGCCGTGCCGTCGTCTACTCCTGCTGAGACCGAAGAGCCGCCGTCGAAGGAGGTCAAGGAGTTGAGCCCTACTACTCCTCTCTCCCTCAAC
TCTTTGACCTTTTCGCCGAGCGAATCTGATGAGAATACCAAACGGTCAAAGAAGAAAGTCTCTCTCAAGACGAAAACTGAGTGGTTGGAAGCCATTGACGAATTGACCAA
GCGCAAGCAAGTATTGGAAGGGGAGATGGAGGTTGTGAAGCGCCGTTACGATCATCTGAAATCCATCAATTCGGAGTTGAAGGCGAAAAAGCAAAAGACGATTCTGGGAC
TTAAGAACCAATCGGCGCAGCCAGAAAGGGGAAACTCAAGTTCCGCCATGGAAATCGCTCGGCTTACTGTCAAATCCCCCACTCTGCAGAATCATCATCATCCTCAACCA
TCCATCAAGAATCAGACGGCTCCCACGCCAGAACAGAGTCAGAATTTCCAGATTGCAATAGGAAAAATGCCTTCCTATGAACACTTGGGCCCAATCGGCATCCCTGACCT
GAACATCTCTCTGGAAGAACTCACTCAGAGGAATTACAGCAAAATCATGGCCGCTCAAGCACGACAGAGAAGGATTCAGATCTGCAAGACCAAGAACTCCAACGCGGCCA
CCAGATTGCAGAGTCCTCCTCCTAATCCATGTAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCCCGAATGCTGCGCTTCCACTCTTCCCTCTCCTGATTTCGGCTTCTCTCCCGACGAATTTACCGTCTGTGAAATCCTGCTTCAGCTCCCCGTTCTGATCGAGAA
ATCCAAGTCTTTTTTCGGATTATTGCCCGCCTGGGGTCGTAAGCGGAAGAGATCCGCCTTGAATTATTCCACTTTGCCGTCTGGTGCTCTTTCTTTATCGTCTCCGCCTT
CGTCACTTCCTCACGCCGCTGCCGTGCCGTCGTCTACTCCTGCTGAGACCGAAGAGCCGCCGTCGAAGGAGGTCAAGGAGTTGAGCCCTACTACTCCTCTCTCCCTCAAC
TCTTTGACCTTTTCGCCGAGCGAATCTGATGAGAATACCAAACGGTCAAAGAAGAAAGTCTCTCTCAAGACGAAAACTGAGTGGTTGGAAGCCATTGACGAATTGACCAA
GCGCAAGCAAGTATTGGAAGGGGAGATGGAGGTTGTGAAGCGCCGTTACGATCATCTGAAATCCATCAATTCGGAGTTGAAGGCGAAAAAGCAAAAGACGATTCTGGGAC
TTAAGAACCAATCGGCGCAGCCAGAAAGGGGAAACTCAAGTTCCGCCATGGAAATCGCTCGGCTTACTGTCAAATCCCCCACTCTGCAGAATCATCATCATCCTCAACCA
TCCATCAAGAATCAGACGGCTCCCACGCCAGAACAGAGTCAGAATTTCCAGATTGCAATAGGAAAAATGCCTTCCTATGAACACTTGGGCCCAATCGGCATCCCTGACCT
GAACATCTCTCTGGAAGAACTCACTCAGAGGAATTACAGCAAAATCATGGCCGCTCAAGCACGACAGAGAAGGATTCAGATCTGCAAGACCAAGAACTCCAACGCGGCCA
CCAGATTGCAGAGTCCTCCTCCTAATCCATGTAGGTGA
Protein sequenceShow/hide protein sequence
MAPECCASTLPSPDFGFSPDEFTVCEILLQLPVLIEKSKSFFGLLPAWGRKRKRSALNYSTLPSGALSLSSPPSSLPHAAAVPSSTPAETEEPPSKEVKELSPTTPLSLN
SLTFSPSESDENTKRSKKKVSLKTKTEWLEAIDELTKRKQVLEGEMEVVKRRYDHLKSINSELKAKKQKTILGLKNQSAQPERGNSSSAMEIARLTVKSPTLQNHHHPQP
SIKNQTAPTPEQSQNFQIAIGKMPSYEHLGPIGIPDLNISLEELTQRNYSKIMAAQARQRRIQICKTKNSNAATRLQSPPPNPCR