; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023353 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023353
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function, DUF584
Genome locationtig00000892:2566838..2572454
RNA-Seq ExpressionSgr023353
SyntenySgr023353
Gene Ontology termsNA
InterPro domainsIPR007608 - Senescence regulator S40


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605549.1 hypothetical protein SDJN03_02866, partial [Cucurbita argyrosperma subsp. sororia]5.1e-6176.3Show/hide
Query:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEE
        MATGKSCYGR+NYRFL GD+   HHHSF+S+  FELNESDIYNSGS  SPE+RRS  P  R+SKKP  S+  VE     RGG+SS+PVNIPDWSKILKEE
Subjt:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEE

Query:  YRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
        YRE RS EYDED+EED +  E MRVPPHEFLARQ ARTR ASFSVHEGIGRTLKGRDLSRVRNAIWEK GFED
Subjt:  YRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED

XP_008463944.1 PREDICTED: uncharacterized protein LOC103501949 [Cucumis melo]7.9e-6276.44Show/hide
Query:  MATGKSCYGRTNYRFLPGDL-GHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKE
        MATGKSCYGR+NYRFL GD+ GHHHHHSF+S+ SFELNESDIYNSG+++SP         TR+ KK   S+KRVE    G GG+SS+PVNIPDWSKILKE
Subjt:  MATGKSCYGRTNYRFLPGDL-GHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKE

Query:  EYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
        EYRE RS EY +D+E+D D EE MRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
Subjt:  EYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED

XP_022140623.1 uncharacterized protein LOC111011233 [Momordica charantia]4.5e-6577.84Show/hide
Query:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWG---RGGSSSVPVNIPDWSKIL
        MATGKSCYGR+NYRFLPG    H HHSF+S+SSFEL+ESDIYNS SSKSPEMRRS APG R+SKKP +  + VESGDWG    GG+SS+PVNIPDWSKIL
Subjt:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWG---RGGSSSVPVNIPDWSKIL

Query:  KEEYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
        KEEYRE   +EYDED E DGD E+GMRVPPHEFL    ARTR+ASFSVHEGIGRTLKGRDLSRVRNAIWEKTGF+D
Subjt:  KEEYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED

XP_022996303.1 uncharacterized protein LOC111491571 [Cucurbita maxima]1.0e-6176.88Show/hide
Query:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEE
        MA+GKSCYGR+NYRFL GD+   HHHSF+S+  FELNESDIYNSGS  SPE+RRS  P  R+SKKP  S+  VE     RGG+SS+PVNIPDWSKILKEE
Subjt:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEE

Query:  YRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
        YRE RS EYDED+EED D  E MRVPPHEFLARQMARTR ASFSVHEGIGRTLKGRDLSRVRNAIWEK GFED
Subjt:  YRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED

XP_038901889.1 uncharacterized protein LOC120088567 [Benincasa hispida]2.0e-6581.03Show/hide
Query:  MATGKSCYGRTNYRFLPGDL-GHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKE
        MATGKSCYGR+NYRFL GD+ GHHHHHSF+S+ SFELNESDIYNSGSSKSP   RS A  TR+SKKP  S+KRVE+G    GG+SS PVNIPDWSKILKE
Subjt:  MATGKSCYGRTNYRFLPGDL-GHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKE

Query:  EYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
        EYRE R  EY++DLEED D +E MRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
Subjt:  EYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED

TrEMBL top hitse value%identityAlignment
A0A0A0KBD5 Uncharacterized protein1.3e-5773.3Show/hide
Query:  MATGKSCYG-RTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGS-SKSPEMRRSTAPGTRMSKKPGASNKRVE-SGDWGRGGSSSVPVNIPDWSKIL
        MATGKSCYG R++YRFLPGD+    HHSF+S+ SFELNESDIYNSGS ++SP         TR++KK  +S + VE  G  G GG+SS+PVNIPDWSKIL
Subjt:  MATGKSCYG-RTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGS-SKSPEMRRSTAPGTRMSKKPGASNKRVE-SGDWGRGGSSSVPVNIPDWSKIL

Query:  KEEYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
        KEEYRE RS EY +D+EED + EE MRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
Subjt:  KEEYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED

A0A1S3CKD1 uncharacterized protein LOC1035019493.8e-6276.44Show/hide
Query:  MATGKSCYGRTNYRFLPGDL-GHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKE
        MATGKSCYGR+NYRFL GD+ GHHHHHSF+S+ SFELNESDIYNSG+++SP         TR+ KK   S+KRVE    G GG+SS+PVNIPDWSKILKE
Subjt:  MATGKSCYGRTNYRFLPGDL-GHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKE

Query:  EYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
        EYRE RS EY +D+E+D D EE MRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
Subjt:  EYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED

A0A5A7SXR0 Uncharacterized protein3.8e-6276.44Show/hide
Query:  MATGKSCYGRTNYRFLPGDL-GHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKE
        MATGKSCYGR+NYRFL GD+ GHHHHHSF+S+ SFELNESDIYNSG+++SP         TR+ KK   S+KRVE    G GG+SS+PVNIPDWSKILKE
Subjt:  MATGKSCYGRTNYRFLPGDL-GHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKE

Query:  EYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
        EYRE RS EY +D+E+D D EE MRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
Subjt:  EYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED

A0A6J1CIE6 uncharacterized protein LOC1110112332.2e-6577.84Show/hide
Query:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWG---RGGSSSVPVNIPDWSKIL
        MATGKSCYGR+NYRFLPG    H HHSF+S+SSFEL+ESDIYNS SSKSPEMRRS APG R+SKKP +  + VESGDWG    GG+SS+PVNIPDWSKIL
Subjt:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWG---RGGSSSVPVNIPDWSKIL

Query:  KEEYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
        KEEYRE   +EYDED E DGD E+GMRVPPHEFL    ARTR+ASFSVHEGIGRTLKGRDLSRVRNAIWEKTGF+D
Subjt:  KEEYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED

A0A6J1KAF0 uncharacterized protein LOC1114915715.0e-6276.88Show/hide
Query:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEE
        MA+GKSCYGR+NYRFL GD+   HHHSF+S+  FELNESDIYNSGS  SPE+RRS  P  R+SKKP  S+  VE     RGG+SS+PVNIPDWSKILKEE
Subjt:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEE

Query:  YRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
        YRE RS EYDED+EED D  E MRVPPHEFLARQMARTR ASFSVHEGIGRTLKGRDLSRVRNAIWEK GFED
Subjt:  YRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED

SwissProt top hitse value%identityAlignment
A0A023PXC2 Putative uncharacterized membrane protein YEL053W-A3.4e-0761.7Show/hide
Query:  FTVTRRPFQSFAVSLAMSSPIFFGERPRGPIFGASELAAPTSPPVTL
        +TVT  P  S  V+LA+SSP FFG+RP+GPIFGA    APTSPP  L
Subjt:  FTVTRRPFQSFAVSLAMSSPIFFGERPRGPIFGASELAAPTSPPVTL

P87267 Putative uncharacterized protein YDR417C1.3e-0659.57Show/hide
Query:  FTVTRRPFQSFAVSLAMSSPIFFGERPRGPIFGASELAAPTSPPVTL
        +TVT  P  S  V+LA+SSP FFG++P GPIFGA    APTSPP  L
Subjt:  FTVTRRPFQSFAVSLAMSSPIFFGERPRGPIFGASELAAPTSPPVTL

Arabidopsis top hitse value%identityAlignment
AT2G28400.1 Protein of unknown function, DUF5843.0e-2746.33Show/hide
Query:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEE
        MAT K  Y R ++RF   D         ++ S FEL+E D++N+GS  S     S +  T  S + G +N+++  G      +SS+PVN+PDWSKIL +E
Subjt:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEE

Query:  YRETRSFEYDEDLEEDGD----GEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
         R  R    +E  E DGD    GE   RVPPHE LA +    R+ASFSVHEG GRTLKGRDLSRVRN I++  G ED
Subjt:  YRETRSFEYDEDLEEDGD----GEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED

AT3G45210.1 Protein of unknown function, DUF5842.5e-2948.26Show/hide
Query:  ATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEEY
        AT KS Y R ++RFLP D      ++ + +S FE +ESD+Y S  S SPE RR      R S     +   V         +SS+P+N+ +WSKIL +E 
Subjt:  ATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEEY

Query:  RETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
        R++        +E D DG EG ++PPHE+L    A+TR+ASFSVHEGIGRTLKGRD+SRVRNAI EKTGF D
Subjt:  RETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED

AT4G04630.1 Protein of unknown function, DUF5842.5e-2139.6Show/hide
Query:  SFSSESSFELNESDIYN---SGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEEYRETRSFE-YDEDLEEDGDGEEG
        S   E   E  E D+++    G + SPEM+   +  +  S    +      S +      SS P+N+PDWSK+  +     RS   +    ++D + ++G
Subjt:  SFSSESSFELNESDIYN---SGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEEYRETRSFE-YDEDLEEDGDGEEG

Query:  MRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGF
          VPPHE++AR++ART+I+SFS+ EG+GRTLKGRDLS+VRNA+  KTGF
Subjt:  MRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGF

AT5G03230.1 Protein of unknown function, DUF5842.0e-2649.34Show/hide
Query:  SSESSFELNESDIYNSGSSKSP---EMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEEYR--ETRSFEYDEDLEEDGDGEEGM
        S E+ FE +ESDI+N G  + P   + +RS +  +R+ +KP    K  +SG+     + S+PVNIPDWSKILK EYR       + D+D E+D D  +G 
Subjt:  SSESSFELNESDIYNSGSSKSP---EMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEEYR--ETRSFEYDEDLEEDGDGEEGM

Query:  R--VPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
        R  +PPHE+LAR+    R +SF+VHEGIG T KGRDL R+RNAIWEK GF+D
Subjt:  R--VPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED

AT5G60680.1 Protein of unknown function, DUF5842.0e-3450.57Show/hide
Query:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEE
        MATGKS Y R +YRFL  D      +  +S+S  E +ESD++N   S SP+  R  +   R  KK  +SN+   S       +SS+PVN+PDWSKIL+ E
Subjt:  MATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKILKEE

Query:  YRETRSFEYDEDLEEDGDGEEGMR-VPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED
        YR+ R    +++ ++D D E+G   +PPHEFL    A+TR+ASFSVHEG+GRTLKGRDLSRVRNAI+EK GF+D
Subjt:  YRETRSFEYDEDLEEDGDGEEGMR-VPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACCCAACTGATACACAAGTTCCTAATATCTCCTTCACCGTTCCTGACAAATCTTTCGCCATCGATCTCAGCCGCATGATCTTCGCAATCTCGATCACCTTCACCGT
CACTCGGAGGCCCTTCCAGTCCTTTGCAGTCTCTTTGGCTATGTCTTCTCCGATTTTCTTCGGGGAGAGACCGAGAGGTCCAATCTTCGGAGCGAGTGAACTCGCCGCTC
CGACCTCGCCTCCGGTGACTCTAACGAAGAGCGATTTATCGCGAGATAATACGTGCAGGGGATATTGGAAATTGCAAGGAGATTTTGGGGTGTCGCTTGTGAGAGGACAG
TGGGGTAACAAGGCCAGCTGGCAAGCAAACGCACGAGATCCAAGCGTCTTTAAAACCCCACCCACGCCGCGACAAAAATTGACCGTAGCCATTTTTGTCTTCCCAAAATG
CCCCTCCTGCGCACTAGACAATGGTCAGGCTTCATCTTCAACACGTACGTGGCAGTCATCGGCACCATTCACAGAGATGTCGGTTGATACCGTCGCAGGGAGTTTTCGCT
TTATAAAAGAGCTTAACGAAATCGACTCAAACTCAACAATTTCTCCTCAGATTAGAGGAACTGTGAGAGTCCAAACGATCGGAGGGATCGATATTGTTTTTTGCTCTATT
CGAAGTTTCTTTGAAAAGCCAAACAGCACGCTGCTATCTTCCATGGCGACCGGAAAAAGCTGTTACGGTCGCACCAACTATCGCTTCCTTCCCGGCGACCTCGGCCACCA
CCACCACCACTCCTTCAGTTCTGAGTCGTCGTTCGAACTCAATGAATCAGACATTTACAACTCCGGTAGTTCAAAATCGCCGGAGATGCGGAGATCTACGGCTCCGGGAA
CGCGTATGTCGAAGAAGCCAGGTGCTTCCAACAAACGAGTGGAGAGCGGCGATTGGGGGCGGGGTGGTTCTTCTTCGGTGCCGGTGAATATCCCGGATTGGTCGAAGATT
CTGAAGGAGGAGTACAGAGAGACACGTAGCTTTGAATACGATGAAGACTTGGAGGAAGATGGCGATGGTGAGGAGGGCATGAGAGTGCCGCCGCACGAGTTTTTGGCGAG
GCAGATGGCGAGGACGAGGATCGCTTCTTTCTCGGTGCACGAAGGTATTGGAAGGACGTTGAAAGGGAGAGATCTGAGTAGGGTAAGAAATGCAATATGGGAAAAAACTG
GGTTTGAGGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACCCAACTGATACACAAGTTCCTAATATCTCCTTCACCGTTCCTGACAAATCTTTCGCCATCGATCTCAGCCGCATGATCTTCGCAATCTCGATCACCTTCACCGT
CACTCGGAGGCCCTTCCAGTCCTTTGCAGTCTCTTTGGCTATGTCTTCTCCGATTTTCTTCGGGGAGAGACCGAGAGGTCCAATCTTCGGAGCGAGTGAACTCGCCGCTC
CGACCTCGCCTCCGGTGACTCTAACGAAGAGCGATTTATCGCGAGATAATACGTGCAGGGGATATTGGAAATTGCAAGGAGATTTTGGGGTGTCGCTTGTGAGAGGACAG
TGGGGTAACAAGGCCAGCTGGCAAGCAAACGCACGAGATCCAAGCGTCTTTAAAACCCCACCCACGCCGCGACAAAAATTGACCGTAGCCATTTTTGTCTTCCCAAAATG
CCCCTCCTGCGCACTAGACAATGGTCAGGCTTCATCTTCAACACGTACGTGGCAGTCATCGGCACCATTCACAGAGATGTCGGTTGATACCGTCGCAGGGAGTTTTCGCT
TTATAAAAGAGCTTAACGAAATCGACTCAAACTCAACAATTTCTCCTCAGATTAGAGGAACTGTGAGAGTCCAAACGATCGGAGGGATCGATATTGTTTTTTGCTCTATT
CGAAGTTTCTTTGAAAAGCCAAACAGCACGCTGCTATCTTCCATGGCGACCGGAAAAAGCTGTTACGGTCGCACCAACTATCGCTTCCTTCCCGGCGACCTCGGCCACCA
CCACCACCACTCCTTCAGTTCTGAGTCGTCGTTCGAACTCAATGAATCAGACATTTACAACTCCGGTAGTTCAAAATCGCCGGAGATGCGGAGATCTACGGCTCCGGGAA
CGCGTATGTCGAAGAAGCCAGGTGCTTCCAACAAACGAGTGGAGAGCGGCGATTGGGGGCGGGGTGGTTCTTCTTCGGTGCCGGTGAATATCCCGGATTGGTCGAAGATT
CTGAAGGAGGAGTACAGAGAGACACGTAGCTTTGAATACGATGAAGACTTGGAGGAAGATGGCGATGGTGAGGAGGGCATGAGAGTGCCGCCGCACGAGTTTTTGGCGAG
GCAGATGGCGAGGACGAGGATCGCTTCTTTCTCGGTGCACGAAGGTATTGGAAGGACGTTGAAAGGGAGAGATCTGAGTAGGGTAAGAAATGCAATATGGGAAAAAACTG
GGTTTGAGGATTAA
Protein sequenceShow/hide protein sequence
MHPTDTQVPNISFTVPDKSFAIDLSRMIFAISITFTVTRRPFQSFAVSLAMSSPIFFGERPRGPIFGASELAAPTSPPVTLTKSDLSRDNTCRGYWKLQGDFGVSLVRGQ
WGNKASWQANARDPSVFKTPPTPRQKLTVAIFVFPKCPSCALDNGQASSSTRTWQSSAPFTEMSVDTVAGSFRFIKELNEIDSNSTISPQIRGTVRVQTIGGIDIVFCSI
RSFFEKPNSTLLSSMATGKSCYGRTNYRFLPGDLGHHHHHSFSSESSFELNESDIYNSGSSKSPEMRRSTAPGTRMSKKPGASNKRVESGDWGRGGSSSVPVNIPDWSKI
LKEEYRETRSFEYDEDLEEDGDGEEGMRVPPHEFLARQMARTRIASFSVHEGIGRTLKGRDLSRVRNAIWEKTGFED