; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020847 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020847
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionMitochondrial pyruvate carrier 2-like
Genome locationtig00153574:590686..593829
RNA-Seq ExpressionSgr020847
SyntenySgr020847
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019402.1 hypothetical protein SDJN02_18363, partial [Cucurbita argyrosperma subsp. argyrosperma]8.5e-6775.25Show/hide
Query:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW
        MKKLYRRRGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS  SNDFTT +    HRGK AHQKPAA+K   SDHPPVF+CDCFRCYTSYW
Subjt:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW

Query:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGS----GDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV
        VRWDSSPNRQ+IHEIIDAYEE LA+SK  KNN+KERKKR+TGSGSGS    GD KGS+++ + EES   EME A     DGG   E+E EKGSVR IV
Subjt:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGS----GDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV

XP_022927395.1 uncharacterized protein LOC111434229 [Cucurbita moschata]1.7e-6775.76Show/hide
Query:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW
        MKKLYRRRGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS  SNDFTT +   GHRGK AHQKPAA+K   SDHPPVF+CDCFRCYTSYW
Subjt:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW

Query:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGS----GDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV
        VRWDSSPNRQ+IHEIIDAYEE LA+SK  KNN+KERKKR+TGSGSGS    GD KGS+++ + EES   EME A     DGG   E+E EKGSVR IV
Subjt:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGS----GDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV

XP_023001610.1 uncharacterized protein LOC111495687 [Cucurbita maxima]5.9e-6874.75Show/hide
Query:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW
        M KLYRRRGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS  SNDFTT +   GHRGK A QKPAA+K   SDHPP F+CDCFRCYTSYW
Subjt:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW

Query:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGS----GDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV
        VRWDSSPNRQ+IHEIIDAYEE LA+SK  KNN+KERKKR+TGSGSGS    GD KGS+++ + EES   EME A     DGGGG E+E EKG+VR IV
Subjt:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGS----GDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV

XP_023520004.1 uncharacterized protein LOC111783314 [Cucurbita pepo subsp. pepo]4.1e-6975.76Show/hide
Query:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW
        MKKLYRRRGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS  SNDFTT +   GHRGK AHQKPAA++   SDHPPVF+C+CFRCYTSYW
Subjt:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW

Query:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGS----GDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV
        VRWDSSPNRQ+IHEIIDAYEE LA+SK  KN++KERKKR+TGSGSGS    GD KGS+V+ + EES   EME A     DGGGG E+E EKGSVR IV
Subjt:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGS----GDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV

XP_038894832.1 uncharacterized protein LOC120083238 [Benincasa hispida]6.7e-7261.05Show/hide
Query:  LCTQIPAFHVNISRRHALQS--------------------------CSNRLVP----TPLAHSLRRPPDIFSPMKKLYRRRGTVHPSPPIISDHLSFLPA
        +CTQIPAFHVNIS  HA QS                            N++ P    + L+HSL+ P   +SPMKKLYR+RGTVHPSPPIISDHLSFLP 
Subjt:  LCTQIPAFHVNISRRHALQS--------------------------CSNRLVP----TPLAHSLRRPPDIFSPMKKLYRRRGTVHPSPPIISDHLSFLPA

Query:  AILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYWVRWDSSPNRQLIHEIIDAYEEALAQSK
        AILTLAAALS EDREVLAYLISSC SNDFT  +    HRGK AHQKP A+    SDHPP F+CDCFRCYTSYWVRWDSSPNRQLIHEIIDAYEE LA+SK
Subjt:  AILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYWVRWDSSPNRQLIHEIIDAYEEALAQSK

Query:  TSKNNRKERKKRSTGSGSGSGDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV
          KNN+KERKKR++G  SG G+ K S+ ++  EE    E E A         G E+ETEKGSVRRIV
Subjt:  TSKNNRKERKKRSTGSGSGSGDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV

TrEMBL top hitse value%identityAlignment
A0A0A0LVV3 Uncharacterized protein2.3e-5764.15Show/hide
Query:  PLAHSLRRPPDIFSPMKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTT--TSFGHRGKTAHQKPAASKGCSSDHP
        P+  SL      +SPMKKLYR+RGTVHPSP IISDHLSFLP  ILTLAAALS  DREVLAYLISSC SNDFT    S  HRGK  HQK AA+ G   DHP
Subjt:  PLAHSLRRPPDIFSPMKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTT--TSFGHRGKTAHQKPAASKGCSSDHP

Query:  PVFNCDCFRCYTSYWVRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRST-GSGSGSGDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDE
        P F+C CF+CYTSYWVRWDSSPNRQLIHEIIDAYEE LA+SK  KNN+KERKKR+  G  SG G+ KGS+ +++ EE    E E A         G E+ 
Subjt:  PVFNCDCFRCYTSYWVRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRST-GSGSGSGDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDE

Query:  TEKGSVRRIVEV
         EKG VRRIV +
Subjt:  TEKGSVRRIVEV

A0A1S3C9P0 uncharacterized protein LOC1034982281.0e-5463.96Show/hide
Query:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTT--TSFGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW
        MKKLYR+ GTVHPSPP+ISDHLSFLP AILTL++ALS +DREVLAYLISSC SNDFT    S  HRGK AH K AA     SDHPP F+C CF+CYTSYW
Subjt:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTT--TSFGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW

Query:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKR-STGSGSGSGDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIVEV
        VRWDSSPNRQLIHEIIDAYE+ LA++K  KNN+KERKKR S+G+ SG G+ KG++ +++ EE    E             G E+E EKG VRRIV +
Subjt:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKR-STGSGSGSGDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIVEV

A0A5D3CKA3 Uncharacterized protein1.0e-5463.96Show/hide
Query:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTT--TSFGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW
        MKKLYR+ GTVHPSPP+ISDHLSFLP AILTL++ALS +DREVLAYLISSC SNDFT    S  HRGK AH K AA     SDHPP F+C CF+CYTSYW
Subjt:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTT--TSFGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW

Query:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKR-STGSGSGSGDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIVEV
        VRWDSSPNRQLIHEIIDAYE+ LA++K  KNN+KERKKR S+G+ SG G+ KG++ +++ EE    E             G E+E EKG VRRIV +
Subjt:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKR-STGSGSGSGDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIVEV

A0A6J1ENT0 uncharacterized protein LOC1114342298.2e-6875.76Show/hide
Query:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW
        MKKLYRRRGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS  SNDFTT +   GHRGK AHQKPAA+K   SDHPPVF+CDCFRCYTSYW
Subjt:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW

Query:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGS----GDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV
        VRWDSSPNRQ+IHEIIDAYEE LA+SK  KNN+KERKKR+TGSGSGS    GD KGS+++ + EES   EME A     DGG   E+E EKGSVR IV
Subjt:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGS----GDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV

A0A6J1KLN6 uncharacterized protein LOC1114956872.8e-6874.75Show/hide
Query:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW
        M KLYRRRGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS  SNDFTT +   GHRGK A QKPAA+K   SDHPP F+CDCFRCYTSYW
Subjt:  MKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTS--FGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYW

Query:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGS----GDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV
        VRWDSSPNRQ+IHEIIDAYEE LA+SK  KNN+KERKKR+TGSGSGS    GD KGS+++ + EES   EME A     DGGGG E+E EKG+VR IV
Subjt:  VRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGS----GDAKGSDVSSRREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein6.3e-3648.65Show/hide
Query:  MKKLYRRRGTVHPSPPII--SDH-LSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTSFGHRGKTA--HQKPAASKGCSSDHPPVFNCDCFRCYT
        MKKLY R+GTVHPSPP I  +DH L+ LP AI +LAA LSPEDREVLAYLIS       T +  G R  T+  ++  A  K    +H P+F+CDCF CYT
Subjt:  MKKLYRRRGTVHPSPPII--SDH-LSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTSFGHRGKTA--HQKPAASKGCSSDHPPVFNCDCFRCYT

Query:  SYWVRWDSSPNRQLIHEIIDAYEEALAQSKTSKNN---RKERKKRSTGS----GSGSGDAKGSDVSSRREES--GPREMEKAGESSDDGG---GGAED--
        SYWVRWDSSP+RQLIHEIIDA+E++L ++K  K N   +K+R+KRS  S     S S     S++ SR  ES         + E + DGG   GG E   
Subjt:  SYWVRWDSSPNRQLIHEIIDAYEEALAQSKTSKNN---RKERKKRSTGS----GSGSGDAKGSDVSSRREES--GPREMEKAGESSDDGG---GGAED--

Query:  -----------ETEKGSVRRIV
                   E EKG+VRR V
Subjt:  -----------ETEKGSVRRIV

AT1G24270.1 unknown protein4.4e-2139.34Show/hide
Query:  ISRRHALQSCSNRLVPTPLAHSLRRPPDIFSPMKKLYRRRGTVHPSPPIIS-------DHLS---FLPAAILTLAAALSPEDREVLAYLISSCGSNDFTT
        I  R+  +S  N L+   L+  L +     S MK +  ++G VHPSPP+ S       D LS    L +AIL L + LS ED EVLAYLI+   +   TT
Subjt:  ISRRHALQSCSNRLVPTPLAHSLRRPPDIFSPMKKLYRRRGTVHPSPPIIS-------DHLS---FLPAAILTLAAALSPEDREVLAYLISSCGSNDFTT

Query:  TSFGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYWVRWDSSPNRQLIHEIIDAYEEALAQ-----SKTSKNNRKERKK
             + K +H+             P+ +C CF CYTSYW +WDSS NR+LI++II+A+E+ L +     S TSK N+K  KK
Subjt:  TSFGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYWVRWDSSPNRQLIHEIIDAYEEALAQ-----SKTSKNNRKERKK

AT1G62422.1 unknown protein2.9e-3351.52Show/hide
Query:  MKKLYRRRGTVHPSPP--IISDH--LSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTSFGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTS
        MKKL  R+GTVHPSPP  I +D   LS LP AIL+L AALS EDREVLAYLIS+ G  D    S   + K             + H P+F CDCF CYTS
Subjt:  MKKLYRRRGTVHPSPP--IISDH--LSFLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTSFGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTS

Query:  YWVRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGSGDAKGSDVSSRREESGPREMEKAGESSDDGG--GGAEDETEKGSVRRIV
        YWVRWD+SP RQLIHEIIDAYE++L      K  +K+R+KRS G  SG  ++ G   +SR  E G    E AG  S+  G  GG E E EKGSV +++
Subjt:  YWVRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGSGDAKGSDVSSRREESGPREMEKAGESSDDGG--GGAEDETEKGSVRRIV

AT5G13090.1 unknown protein2.0e-2134.92Show/hide
Query:  RRRGTVHPSPP---------IISDHLS-----------FLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTSFGHRGKTAHQKPAASKGCSSDHPPV
        +++G V+PSPP           S+HL+            LPA IL L + LS E+REVLAYLI    +   T +  G+       K  ++K   +  PPV
Subjt:  RRRGTVHPSPP---------IISDHLS-----------FLPAAILTLAAALSPEDREVLAYLISSCGSNDFTTTSFGHRGKTAHQKPAASKGCSSDHPPV

Query:  FNCDCFRCYTSYWVRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGSGDAKGSDVSSRREESGPREMEKAGESSDD
        F+C+CF CYT+YW RWDSSPNR+LIHEII+A+E    +  ++  ++ +R K+    G    D+  S  + R  ++G ++ +   E  ++
Subjt:  FNCDCFRCYTSYWVRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGSGDAKGSDVSSRREESGPREMEKAGESSDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGGCCCCATCCCCATTAAAACAAGAATCTCCACCGTACGATGATTTAGCAATTTGGGTTGTTCTGTGTACTCAAATTCCCGCGTTTCACGTGAATATTTCCCGTCG
CCATGCATTGCAGAGTTGCAGTAATCGTTTGGTTCCAACCCCACTCGCACATTCTCTGCGCCGTCCACCTGACATTTTTTCTCCCATGAAGAAGCTCTACCGGAGGAGAG
GAACGGTGCACCCCTCGCCGCCGATCATCTCCGACCACCTCTCGTTTCTCCCCGCCGCCATCCTCACCCTAGCGGCGGCTCTCTCGCCGGAGGACAGAGAGGTCTTGGCT
TACCTCATCTCCTCCTGCGGCTCCAACGACTTCACCACCACCAGCTTCGGCCACCGCGGCAAGACCGCCCACCAGAAGCCCGCCGCTTCGAAGGGCTGTTCCTCCGACCA
CCCCCCGGTGTTCAACTGCGACTGTTTCCGGTGCTACACCAGCTACTGGGTCAGATGGGACTCGTCCCCGAATCGCCAACTCATACACGAAATCATCGATGCTTATGAAG
AAGCGCTGGCTCAGAGCAAAACCAGCAAGAACAACAGGAAAGAGAGGAAGAAGAGAAGTACCGGATCCGGGTCCGGGTCGGGCGACGCGAAGGGATCCGATGTGAGTTCG
AGACGAGAGGAGTCGGGGCCGAGGGAGATGGAGAAGGCGGGAGAGAGCAGTGACGACGGCGGCGGCGGCGCTGAGGATGAGACGGAGAAAGGATCGGTGAGGAGGATTGT
TGAAGTAGAGCAGATCGCAGATTTGGAGCGAGTTCATCTGAGCGCGGGTTTGGGCTTTGGATCTAAGAAAGAAGGAGAAGCAAGGAAGAAAGGAGGAAGAAGAATAAGGG
AGACGTATGAGGAAGAAGAAGAAAAAGTTGATTTTGTTGATTGCTTTTCGAAGATCTGTAGAAGAAGACGCAGAAGAGCCAAGGATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCGGGCCCCATCCCCATTAAAACAAGAATCTCCACCGTACGATGATTTAGCAATTTGGGTTGTTCTGTGTACTCAAATTCCCGCGTTTCACGTGAATATTTCCCGTCG
CCATGCATTGCAGAGTTGCAGTAATCGTTTGGTTCCAACCCCACTCGCACATTCTCTGCGCCGTCCACCTGACATTTTTTCTCCCATGAAGAAGCTCTACCGGAGGAGAG
GAACGGTGCACCCCTCGCCGCCGATCATCTCCGACCACCTCTCGTTTCTCCCCGCCGCCATCCTCACCCTAGCGGCGGCTCTCTCGCCGGAGGACAGAGAGGTCTTGGCT
TACCTCATCTCCTCCTGCGGCTCCAACGACTTCACCACCACCAGCTTCGGCCACCGCGGCAAGACCGCCCACCAGAAGCCCGCCGCTTCGAAGGGCTGTTCCTCCGACCA
CCCCCCGGTGTTCAACTGCGACTGTTTCCGGTGCTACACCAGCTACTGGGTCAGATGGGACTCGTCCCCGAATCGCCAACTCATACACGAAATCATCGATGCTTATGAAG
AAGCGCTGGCTCAGAGCAAAACCAGCAAGAACAACAGGAAAGAGAGGAAGAAGAGAAGTACCGGATCCGGGTCCGGGTCGGGCGACGCGAAGGGATCCGATGTGAGTTCG
AGACGAGAGGAGTCGGGGCCGAGGGAGATGGAGAAGGCGGGAGAGAGCAGTGACGACGGCGGCGGCGGCGCTGAGGATGAGACGGAGAAAGGATCGGTGAGGAGGATTGT
TGAAGTAGAGCAGATCGCAGATTTGGAGCGAGTTCATCTGAGCGCGGGTTTGGGCTTTGGATCTAAGAAAGAAGGAGAAGCAAGGAAGAAAGGAGGAAGAAGAATAAGGG
AGACGTATGAGGAAGAAGAAGAAAAAGTTGATTTTGTTGATTGCTTTTCGAAGATCTGTAGAAGAAGACGCAGAAGAGCCAAGGATGAATAG
Protein sequenceShow/hide protein sequence
MRAPSPLKQESPPYDDLAIWVVLCTQIPAFHVNISRRHALQSCSNRLVPTPLAHSLRRPPDIFSPMKKLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLA
YLISSCGSNDFTTTSFGHRGKTAHQKPAASKGCSSDHPPVFNCDCFRCYTSYWVRWDSSPNRQLIHEIIDAYEEALAQSKTSKNNRKERKKRSTGSGSGSGDAKGSDVSS
RREESGPREMEKAGESSDDGGGGAEDETEKGSVRRIVEVEQIADLERVHLSAGLGFGSKKEGEARKKGGRRIRETYEEEEEKVDFVDCFSKICRRRRRRAKDE