; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023587 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023587
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein CONSERVED ONLY IN THE GREEN LINEAGE 160, chloroplastic
Genome locationtig00000892:4762504..4771705
RNA-Seq ExpressionSgr023587
SyntenySgr023587
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604770.1 Protein CONSERVED ONLY IN THE GREEN LINEAGE 160, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.5e-9075.2Show/hide
Query:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG
        +SLEVDLSKELMAPPMP  E +VE+ IQ+DNR SPRW+LAPTRREQEKW+RA KAATGGSDVMFRELRRP+GDPEVLA+LSREQYFKLKKKLQ LTLAIG
Subjt:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG

Query:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY
                              G G +S     ++SY   +       +ASFGAGLIGSLVY+RMLGSSVDSLADGA+GLVKGAVAQPRLLVPVILVMVY
Subjt:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY

Query:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA
        NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTV KNEPQA
Subjt:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA

XP_022149755.1 uncharacterized protein LOC111018112 [Momordica charantia]3.5e-9076.1Show/hide
Query:  NSLEVDLSKELMAPP-MPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAI
        +SLEVDLSKELMAPP MPRSE+LVE+NIQID   SPRWKLAPTRREQEKW+RANKAATGGSDVMFRELRRPRGDPEVLASL REQYFKLK K++ILTLAI
Subjt:  NSLEVDLSKELMAPP-MPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAI

Query:  GVLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMV
        G                      G G  S     ++SY   +       +ASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGA+AQPRLLVPVILVMV
Subjt:  GVLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMV

Query:  YNRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA
        YNRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTV K+EPQ+
Subjt:  YNRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA

XP_022947232.1 uncharacterized protein LOC111451157 [Cucurbita moschata]1.3e-8974.8Show/hide
Query:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG
        +SLEVDLSKELMAPPMP  E +VE+ IQ+DNR SPRW+LAPTRREQEKW+RA KAATGGSDVMFRELRRP+GDPEVLA+LSREQYFKLKKKLQ LTLAIG
Subjt:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG

Query:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY
                              G G +S     ++SY   +       +ASFGAGLIGSLVY+RMLGSSVDSLADGA+GLVKGAVAQPRLLVPVILVMVY
Subjt:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY

Query:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA
        NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTV KNEP A
Subjt:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA

XP_022970893.1 uncharacterized protein LOC111469729 [Cucurbita maxima]3.5e-9075.2Show/hide
Query:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG
        +SLEVDLSKELMAPPMP  E +VE+ IQ+DNR SPRW+LAPTRREQEKW+RA KAATGGSDVMFRELRRP+GDPEVLA+LSREQYFKLKKKLQ LTLAIG
Subjt:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG

Query:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY
                              G G +S     ++SY   +       +ASFGAGLIGSLVY+RMLGSSVDSLADGA+GLVKGAVAQPRLLVPVILVMVY
Subjt:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY

Query:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA
        NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTV KNEPQA
Subjt:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA

XP_038901394.1 protein CONSERVED ONLY IN THE GREEN LINEAGE 160, chloroplastic [Benincasa hispida]4.2e-9176.4Show/hide
Query:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG
        +SLEVDLSKEL APP+PRSE+LVEKNI ID+R SPRWKLAPTRREQEKW+RA KAATGGSDVMFRELRRP+GDPEVLA+LSREQYFKLKKK+QILTLAIG
Subjt:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG

Query:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY
                              G G  S     ++SY   +       +ASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVM+Y
Subjt:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY

Query:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA
        NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTV KN+PQA
Subjt:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA

TrEMBL top hitse value%identityAlignment
A0A0A0KC26 Uncharacterized protein6.8e-8774Show/hide
Query:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG
        +SL+VDLSKEL APPMPRSE+LVEKNI ID+R SPRWKLAPTRREQEKW+RA +AATGGSDVMFRELRRP+G+PEVLA+LS EQY KLKKK+QILTLAIG
Subjt:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG

Query:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY
                              G G +S     ++SY   +       SASFGAGLIGSLVYIRMLG+SVDSLADGAKGLVKGAVAQPRLLVPVILVM+Y
Subjt:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY

Query:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA
        NRWNGILVEDYGV+QLQLIPMLVGFFTYKVATFVQA+EEALTV K EPQA
Subjt:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA

A0A1S3BK31 uncharacterized protein LOC103490489 isoform X31.0e-8773.2Show/hide
Query:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG
        +SL+VDLSKEL  PPMPRSE+LVEKNI I +R SPRWKLAPTR EQEKW+RA KAATGGSDVMF+ELRRP+GDPE LA+LS EQYFKLKKK+QILTLAIG
Subjt:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG

Query:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY
                              G G +S     ++SY   +       +ASFGAGLIGSLVYIRMLG+SVDSLADGAKGLVKGAVAQPRLLVPVILVM+Y
Subjt:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY

Query:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA
        NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQA+EEALTV KN+PQA
Subjt:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA

A0A6J1D6M3 uncharacterized protein LOC1110181121.7e-9076.1Show/hide
Query:  NSLEVDLSKELMAPP-MPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAI
        +SLEVDLSKELMAPP MPRSE+LVE+NIQID   SPRWKLAPTRREQEKW+RANKAATGGSDVMFRELRRPRGDPEVLASL REQYFKLK K++ILTLAI
Subjt:  NSLEVDLSKELMAPP-MPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAI

Query:  GVLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMV
        G                      G G  S     ++SY   +       +ASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGA+AQPRLLVPVILVMV
Subjt:  GVLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMV

Query:  YNRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA
        YNRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTV K+EPQ+
Subjt:  YNRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA

A0A6J1G5W4 uncharacterized protein LOC1114511576.5e-9074.8Show/hide
Query:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG
        +SLEVDLSKELMAPPMP  E +VE+ IQ+DNR SPRW+LAPTRREQEKW+RA KAATGGSDVMFRELRRP+GDPEVLA+LSREQYFKLKKKLQ LTLAIG
Subjt:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG

Query:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY
                              G G +S     ++SY   +       +ASFGAGLIGSLVY+RMLGSSVDSLADGA+GLVKGAVAQPRLLVPVILVMVY
Subjt:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY

Query:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA
        NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTV KNEP A
Subjt:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA

A0A6J1I585 uncharacterized protein LOC1114697291.7e-9075.2Show/hide
Query:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG
        +SLEVDLSKELMAPPMP  E +VE+ IQ+DNR SPRW+LAPTRREQEKW+RA KAATGGSDVMFRELRRP+GDPEVLA+LSREQYFKLKKKLQ LTLAIG
Subjt:  NSLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIG

Query:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY
                              G G +S     ++SY   +       +ASFGAGLIGSLVY+RMLGSSVDSLADGA+GLVKGAVAQPRLLVPVILVMVY
Subjt:  VLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVY

Query:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA
        NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTV KNEPQA
Subjt:  NRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA

SwissProt top hitse value%identityAlignment
O82279 Protein CONSERVED ONLY IN THE GREEN LINEAGE 160, chloroplastic3.7e-6654.2Show/hide
Query:  SSNSLEVDLSKELMAPPMPRSEELVEKNIQIDNRN----------SPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKL
        S +S++VDLSKEL +     S+ +V+  +                SP+WKLAPTRREQEKW+RA KAATGGSDVMFRELRRPRGDPEV A+  REQYFKL
Subjt:  SSNSLEVDLSKELMAPPMPRSEELVEKNIQIDNRN----------SPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKL

Query:  KKKLQILTLAIGVLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQP
        K K+Q+LTL IG                      G G +S     +ISY   I       + SFGAGL+GSL Y+RMLG+SVD++ADGA+G+ KGA  QP
Subjt:  KKKLQILTLAIGVLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQP

Query:  RLLVPVILVMVYNRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA
        RLLVPV+LVM++NRWN ILV +YG M L+LIPMLVGFFTYK+ATF QA+EEA+++   +P++
Subjt:  RLLVPVILVMVYNRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA

P08443 ATP synthase protein I5.3e-0430.61Show/hide
Query:  SASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKN
        +AS+  G +G L+Y+RMLG +V+ + +  +   K      RL + V+L+++  RW            L+L+P+ +GF TYK A     L   +  A+N
Subjt:  SASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKN

Arabidopsis top hitse value%identityAlignment
AT1G06320.1 unknown protein4.3e-1740.98Show/hide
Query:  KISLEDYLDFFFSNKQLVRTVNYLHQILRMHGYRRI-KAPKKALTDAVSTIDLVNPSRSTLK---ESVSSSASIPLEDVISDLKDLDWQECCVTSVLTFS
        KI++E+Y++F  S   +  T+ YL+QIL +HG+R++ K  KK + +AV ++DL++ SRSTLK   +S  SS+S+ L++VISD++ L WQECC TS+   +
Subjt:  KISLEDYLDFFFSNKQLVRTVNYLHQILRMHGYRRI-KAPKKALTDAVSTIDLVNPSRSTLK---ESVSSSASIPLEDVISDLKDLDWQECCVTSVLTFS

Query:  SWKQNNSGPSPGHQEVKSKQNA
        S +   S  S   Q+   ++ A
Subjt:  SWKQNNSGPSPGHQEVKSKQNA

AT2G31040.1 ATP synthase protein I -related2.6e-6754.2Show/hide
Query:  SSNSLEVDLSKELMAPPMPRSEELVEKNIQIDNRN----------SPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKL
        S +S++VDLSKEL +     S+ +V+  +                SP+WKLAPTRREQEKW+RA KAATGGSDVMFRELRRPRGDPEV A+  REQYFKL
Subjt:  SSNSLEVDLSKELMAPPMPRSEELVEKNIQIDNRN----------SPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKL

Query:  KKKLQILTLAIGVLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQP
        K K+Q+LTL IG                      G G +S     +ISY   I       + SFGAGL+GSL Y+RMLG+SVD++ADGA+G+ KGA  QP
Subjt:  KKKLQILTLAIGVLVCSRLMFLIPQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQP

Query:  RLLVPVILVMVYNRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA
        RLLVPV+LVM++NRWN ILV +YG M L+LIPMLVGFFTYK+ATF QA+EEA+++   +P++
Subjt:  RLLVPVILVMVYNRWNGILVEDYGVMQLQLIPMLVGFFTYKVATFVQALEEALTVAKNEPQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAGTGCCGAAAACAGAACGCGGAAAGGCTCCGGAAGAAAGATAAGATCTCACTCGAAGACTACCTCGATTTCTTCTTCTCTAACAAGCAACTCGTCCGCACCGT
CAACTATCTTCATCAGATCCTTCGGATGCACGGCTACAGGAGAATCAAAGCTCCGAAGAAAGCATTGACCGATGCCGTAAGCACAATTGATCTGGTCAATCCCTCTCGTT
CCACACTCAAAGAGAGCGTCTCATCCTCAGCGTCGATTCCGCTCGAGGACGTAATATCGGACCTCAAAGACCTCGATTGGCAGGAATGTTGTGTCACATCGGTTCTTACA
TTCAGTTCGTGGAAGCAGAATAACTCCGGTCCTAGTCCGGGCCACCAGGAGGTGAAATCAAAGCAAAATGCTCGAGAAGCAGGATGTCTTGGTGAAGCTGACGCCATTCA
TGGAGTTTCCTCGTCGTCTGCCTCGAAGAAGCCGGGAGGTAAATTAGGACCCAAGATTAAATCATATAGGGTAGCTGAACGTGTTTACATCTTTCTGGAACTGTTTCAGT
TTGATCATTTTTTTCTCCGCAACAGTGGTGTTTCTGTTATTCTACATTCGCAGCATTGGGTTGGTGTATCCTTCACTCTTGTGTCAAATCCTTCGAGGCAGAGCAGCAAC
AGTTTGGAAGTTGATTTGAGCAAAGAACTAATGGCTCCTCCAATGCCTCGATCAGAAGAGTTAGTCGAAAAAAATATTCAGATTGATAACCGCAATTCACCCAGATGGAA
GTTAGCACCAACAAGGCGTGAGCAAGAGAAGTGGGAAAGGGCAAATAAGGCCGCTACTGGAGGCAGTGATGTGATGTTTCGAGAATTGAGACGGCCTCGAGGGGATCCAG
AAGTATTGGCTTCCTTATCCAGGGAACAGTATTTTAAGTTAAAGAAGAAGTTGCAAATCTTAACACTGGCAATAGGGGTGTTGGTTTGTTCTCGGCTTATGTTTCTTATT
CCCCAGAAGTTGCTGCTAGAGATTGTCGAGCCCGGTCAAGGATTTATGAGCTGTTGGAGGCACAAGCACATTAGCTATAAGCATTCAATTGATGATGTATGTGCTTTAAG
GAGCGCGAGTTTTGGTGCTGGGTTAATTGGATCTCTTGTGTACATACGAATGCTGGGAAGTAGTGTGGATTCTCTGGCAGATGGAGCAAAGGGACTTGTCAAGGGAGCTG
TTGCACAACCACGGTTATTAGTTCCAGTAATACTGGTGATGGTATATAACCGCTGGAATGGGATTCTTGTTGAAGATTATGGAGTTATGCAGTTACAGTTGATACCAATG
TTAGTTGGATTCTTCACATACAAGGTTGCTACTTTTGTTCAAGCTTTAGAGGAGGCACTTACTGTGGCGAAGAACGAGCCACAAGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAGTGCCGAAAACAGAACGCGGAAAGGCTCCGGAAGAAAGATAAGATCTCACTCGAAGACTACCTCGATTTCTTCTTCTCTAACAAGCAACTCGTCCGCACCGT
CAACTATCTTCATCAGATCCTTCGGATGCACGGCTACAGGAGAATCAAAGCTCCGAAGAAAGCATTGACCGATGCCGTAAGCACAATTGATCTGGTCAATCCCTCTCGTT
CCACACTCAAAGAGAGCGTCTCATCCTCAGCGTCGATTCCGCTCGAGGACGTAATATCGGACCTCAAAGACCTCGATTGGCAGGAATGTTGTGTCACATCGGTTCTTACA
TTCAGTTCGTGGAAGCAGAATAACTCCGGTCCTAGTCCGGGCCACCAGGAGGTGAAATCAAAGCAAAATGCTCGAGAAGCAGGATGTCTTGGTGAAGCTGACGCCATTCA
TGGAGTTTCCTCGTCGTCTGCCTCGAAGAAGCCGGGAGGTAAATTAGGACCCAAGATTAAATCATATAGGGTAGCTGAACGTGTTTACATCTTTCTGGAACTGTTTCAGT
TTGATCATTTTTTTCTCCGCAACAGTGGTGTTTCTGTTATTCTACATTCGCAGCATTGGGTTGGTGTATCCTTCACTCTTGTGTCAAATCCTTCGAGGCAGAGCAGCAAC
AGTTTGGAAGTTGATTTGAGCAAAGAACTAATGGCTCCTCCAATGCCTCGATCAGAAGAGTTAGTCGAAAAAAATATTCAGATTGATAACCGCAATTCACCCAGATGGAA
GTTAGCACCAACAAGGCGTGAGCAAGAGAAGTGGGAAAGGGCAAATAAGGCCGCTACTGGAGGCAGTGATGTGATGTTTCGAGAATTGAGACGGCCTCGAGGGGATCCAG
AAGTATTGGCTTCCTTATCCAGGGAACAGTATTTTAAGTTAAAGAAGAAGTTGCAAATCTTAACACTGGCAATAGGGGTGTTGGTTTGTTCTCGGCTTATGTTTCTTATT
CCCCAGAAGTTGCTGCTAGAGATTGTCGAGCCCGGTCAAGGATTTATGAGCTGTTGGAGGCACAAGCACATTAGCTATAAGCATTCAATTGATGATGTATGTGCTTTAAG
GAGCGCGAGTTTTGGTGCTGGGTTAATTGGATCTCTTGTGTACATACGAATGCTGGGAAGTAGTGTGGATTCTCTGGCAGATGGAGCAAAGGGACTTGTCAAGGGAGCTG
TTGCACAACCACGGTTATTAGTTCCAGTAATACTGGTGATGGTATATAACCGCTGGAATGGGATTCTTGTTGAAGATTATGGAGTTATGCAGTTACAGTTGATACCAATG
TTAGTTGGATTCTTCACATACAAGGTTGCTACTTTTGTTCAAGCTTTAGAGGAGGCACTTACTGTGGCGAAGAACGAGCCACAAGCCTAA
Protein sequenceShow/hide protein sequence
MEECRKQNAERLRKKDKISLEDYLDFFFSNKQLVRTVNYLHQILRMHGYRRIKAPKKALTDAVSTIDLVNPSRSTLKESVSSSASIPLEDVISDLKDLDWQECCVTSVLT
FSSWKQNNSGPSPGHQEVKSKQNAREAGCLGEADAIHGVSSSSASKKPGGKLGPKIKSYRVAERVYIFLELFQFDHFFLRNSGVSVILHSQHWVGVSFTLVSNPSRQSSN
SLEVDLSKELMAPPMPRSEELVEKNIQIDNRNSPRWKLAPTRREQEKWERANKAATGGSDVMFRELRRPRGDPEVLASLSREQYFKLKKKLQILTLAIGVLVCSRLMFLI
PQKLLLEIVEPGQGFMSCWRHKHISYKHSIDDVCALRSASFGAGLIGSLVYIRMLGSSVDSLADGAKGLVKGAVAQPRLLVPVILVMVYNRWNGILVEDYGVMQLQLIPM
LVGFFTYKVATFVQALEEALTVAKNEPQA