; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015139 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015139
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionhydroxyproline-rich glycoprotein family protein
Genome locationtig00002956:1374075..1388274
RNA-Seq ExpressionSgr015139
SyntenySgr015139
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583445.1 hypothetical protein SDJN03_19377, partial [Cucurbita argyrosperma subsp. sororia]2.7e-8069.62Show/hide
Query:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR
        F+  E+DN V + N  WRGRSYA SVASD+P PE D K V  EARRAMVESFVDKYK +N GKFPSIS T+KQVGGSFY++RKILQELQN+STMSSL+S+
Subjt:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR

Query:  SQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKNER
        S+   QE  IKE PNV GK L+AASDWQ  SSCAEKILSA+DDV+ A+LVSH  +P+R N+L+DSEEV S+S+KKPD+DN + D SEHV T+SH LKNER
Subjt:  SQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKNER

Query:  DVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR
        DVVSDV +E   SSEELKHE P+CKEQQ+HSS E+DR
Subjt:  DVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR

KAG7019205.1 hypothetical protein SDJN02_18163 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-7563.42Show/hide
Query:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR
        F+  E+DN V + N  WRGRSYA SVASD+P PE D K V  + RRAMVESFVDKYK +N GKFPSIS T+KQVGGSFY++RKILQELQN+STMSSL+S+
Subjt:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR

Query:  SQNPSQEKAIK--------------------ETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN
        S+   QE  IK                    E PNV GK L+AASDWQ  SSCAEKILSA+DDV+ A+LVSH  +P+R N+L+DSEEV S+S+KKPD+DN
Subjt:  SQNPSQEKAIK--------------------ETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN

Query:  -DVDSSEHVYTESHRLKNERDVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR
         + D SEHV T+SH LKNERDVVSDV +E   SSEELKHE P+CKEQQ+HSS E+DR
Subjt:  -DVDSSEHVYTESHRLKNERDVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR

XP_022156275.1 uncharacterized protein LOC111023208 [Momordica charantia]6.5e-8774.9Show/hide
Query:  FSWNEIDNVV--ASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLR
        F   E++NVV  +SSNI WRGRSYA SVA  IP+P+ DRK+VPIEARRAMVESFVDKYK+ NAGK PSIS+TQKQVGGSFYVVRKILQELQN+STM SL+
Subjt:  FSWNEIDNVV--ASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLR

Query:  SRSQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKN
        SRS+   +EKA KETPNVG K L+A SDW+MSSSCAEK LSADDDVEL+S VSH VLPMRRN+LED EEVSS S+KK DD+N D+D+SEHVYTES  LK+
Subjt:  SRSQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKN

Query:  ERDVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR
        E DVVSDV LE  F SE+LKHE  +CKEQQ+HSSLELDR
Subjt:  ERDVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR

XP_022964986.1 uncharacterized protein LOC111464933 [Cucurbita moschata]8.6e-7968.35Show/hide
Query:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR
        F+  E+DN V + N  WRGRSYA SVASD+P PE D K V  + RRAMVESFVDKYK +N GKFPSIS T+KQVGGSFY++RKILQELQN+STMSSL+S+
Subjt:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR

Query:  SQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKNER
        S+   +E  IKE PNV GK L+AASDWQ  SSCAEKILSA+DDV+ A+LVSH  +P+R N+L+DSEEV S+S+KKPD+DN + D SEHV T+SH LKNER
Subjt:  SQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKNER

Query:  DVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR
        DVVSDV +E   SSEELKHE P+CKEQQ+HSS E+DR
Subjt:  DVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR

XP_022970550.1 uncharacterized protein LOC111469494 [Cucurbita maxima]5.5e-7868.35Show/hide
Query:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR
        F+  ++DN V + N  WRGRSYA SVASD+P PE DRK V  E RRAMVESFVDKYK +N GKFPSI+ T KQVGGSFY +RKILQELQN+STMSSL S+
Subjt:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR

Query:  SQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKNER
        S+   +E  IKE PNV GK L+AASDWQ  S CAEKILSA+DDV+ A+LVSH  +P+R N+L DSEEV S+S+KKPD+DN ++D SEHV T+SH LKNER
Subjt:  SQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKNER

Query:  DVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR
        DVVSDV LE   SSEELKHE P+CKEQQ+HSS E+DR
Subjt:  DVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR

TrEMBL top hitse value%identityAlignment
A0A1S3C6C2 uncharacterized protein LOC103497179 isoform X11.2e-6562.04Show/hide
Query:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR
        F W E+DN V +SN  WRGRSY  SVASDIP P  DRK VPIE RRAM+ESFV KYK +N GKFPS++ T K+VGGS+YVVRKI+QELQN+S++S L+ R
Subjt:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR

Query:  SQNPSQEKAIK-------ETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDNDVDSSEHVYTESH
        S+   QE  IK       E+ NV GKHL+AAS+ Q  SSCAE  LSA DD      VSH VLPMR N+LEDSE++ S+  K  DDD   D S+ V TESH
Subjt:  SQNPSQEKAIK-------ETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDNDVDSSEHVYTESH

Query:  RLKNERDVVSDVHLEGRFSSEELKH-EGPHCKEQQIHSSLELDRM
         LKNERDVVSDVHLE R +SEELKH EGP+ KEQQ+ SS EL R+
Subjt:  RLKNERDVVSDVHLEGRFSSEELKH-EGPHCKEQQIHSSLELDRM

A0A5A7SRN1 Uncharacterized protein1.2e-6562.04Show/hide
Query:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR
        F W E+DN V +SN  WRGRSY  SVASDIP P  DRK VPIE RRAM+ESFV KYK +N GKFPS++ T K+VGGS+YVVRKI+QELQN+S++S L+ R
Subjt:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR

Query:  SQNPSQEKAIK-------ETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDNDVDSSEHVYTESH
        S+   QE  IK       E+ NV GKHL+AAS+ Q  SSCAE  LSA DD      VSH VLPMR N+LEDSE++ S+  K  DDD   D S+ V TESH
Subjt:  SQNPSQEKAIK-------ETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDNDVDSSEHVYTESH

Query:  RLKNERDVVSDVHLEGRFSSEELKH-EGPHCKEQQIHSSLELDRM
         LKNERDVVSDVHLE R +SEELKH EGP+ KEQQ+ SS EL R+
Subjt:  RLKNERDVVSDVHLEGRFSSEELKH-EGPHCKEQQIHSSLELDRM

A0A6J1DRM1 uncharacterized protein LOC1110232083.2e-8774.9Show/hide
Query:  FSWNEIDNVV--ASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLR
        F   E++NVV  +SSNI WRGRSYA SVA  IP+P+ DRK+VPIEARRAMVESFVDKYK+ NAGK PSIS+TQKQVGGSFYVVRKILQELQN+STM SL+
Subjt:  FSWNEIDNVV--ASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLR

Query:  SRSQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKN
        SRS+   +EKA KETPNVG K L+A SDW+MSSSCAEK LSADDDVEL+S VSH VLPMRRN+LED EEVSS S+KK DD+N D+D+SEHVYTES  LK+
Subjt:  SRSQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKN

Query:  ERDVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR
        E DVVSDV LE  F SE+LKHE  +CKEQQ+HSSLELDR
Subjt:  ERDVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR

A0A6J1HKG5 uncharacterized protein LOC1114649334.1e-7968.35Show/hide
Query:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR
        F+  E+DN V + N  WRGRSYA SVASD+P PE D K V  + RRAMVESFVDKYK +N GKFPSIS T+KQVGGSFY++RKILQELQN+STMSSL+S+
Subjt:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR

Query:  SQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKNER
        S+   +E  IKE PNV GK L+AASDWQ  SSCAEKILSA+DDV+ A+LVSH  +P+R N+L+DSEEV S+S+KKPD+DN + D SEHV T+SH LKNER
Subjt:  SQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKNER

Query:  DVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR
        DVVSDV +E   SSEELKHE P+CKEQQ+HSS E+DR
Subjt:  DVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR

A0A6J1I365 uncharacterized protein LOC1114694942.7e-7868.35Show/hide
Query:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR
        F+  ++DN V + N  WRGRSYA SVASD+P PE DRK V  E RRAMVESFVDKYK +N GKFPSI+ T KQVGGSFY +RKILQELQN+STMSSL S+
Subjt:  FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSR

Query:  SQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKNER
        S+   +E  IKE PNV GK L+AASDWQ  S CAEKILSA+DDV+ A+LVSH  +P+R N+L DSEEV S+S+KKPD+DN ++D SEHV T+SH LKNER
Subjt:  SQNPSQEKAIKETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDN-DVDSSEHVYTESHRLKNER

Query:  DVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR
        DVVSDV LE   SSEELKHE P+CKEQQ+HSS E+DR
Subjt:  DVVSDVHLEGRFSSEELKHEGPHCKEQQIHSSLELDR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding1.5e-0950Show/hide
Query:  RKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQEL
        R  +P E R+ +VESF+ K++  N G FPS+S T K+VGGSFY +R+I++E+
Subjt:  RKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQEL

AT5G58210.1 hydroxyproline-rich glycoprotein family protein4.3e-1238.1Show/hide
Query:  NVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSRSQNPSQE
        N V  +++    R Y      +        K +  + RRA+VESFV++Y+ TNAG+FPS+  T KQVGGS+Y+VR I QEL+       L+ ++  P   
Subjt:  NVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSRSQNPSQE

Query:  KAIKE
        KA+ E
Subjt:  KAIKE

AT5G58210.2 hydroxyproline-rich glycoprotein family protein4.3e-1238.1Show/hide
Query:  NVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSRSQNPSQE
        N V  +++    R Y      +        K +  + RRA+VESFV++Y+ TNAG+FPS+  T KQVGGS+Y+VR I QEL+       L+ ++  P   
Subjt:  NVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSRSQNPSQE

Query:  KAIKE
        KA+ E
Subjt:  KAIKE

AT5G58210.3 hydroxyproline-rich glycoprotein family protein4.3e-1238.1Show/hide
Query:  NVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSRSQNPSQE
        N V  +++    R Y      +        K +  + RRA+VESFV++Y+ TNAG+FPS+  T KQVGGS+Y+VR I QEL+       L+ ++  P   
Subjt:  NVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSRSQNPSQE

Query:  KAIKE
        KA+ E
Subjt:  KAIKE

AT5G58210.4 hydroxyproline-rich glycoprotein family protein4.3e-1238.1Show/hide
Query:  NVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSRSQNPSQE
        N V  +++    R Y      +        K +  + RRA+VESFV++Y+ TNAG+FPS+  T KQVGGS+Y+VR I QEL+       L+ ++  P   
Subjt:  NVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSRSQNPSQE

Query:  KAIKE
        KA+ E
Subjt:  KAIKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TGTTCAGTTGGAATGAGATCGATAATGTTGTAGCCAGCTCAAATATTTTGTGGCGTGGAAGATCATATGCGGTGTCTGTTGCTTCTGATATACCTAATCCCGAGAACGAT
CGTAAAAGTGTTCCTATAGAAGCTCGGCGAGCAATGGTCGAATCTTTTGTGGACAAGTACAAGACAACAAATGCTGGGAAGTTCCCATCGATATCGCATACTCAGAAACA
AGTGGGTGGCTCTTTTTATGTGGTTAGGAAAATCCTTCAGGAGCTGCAGAATAAATCTACAATGTCGTCCTTAAGGAGTAGAAGTCAAAATCCATCTCAAGAAAAGGCAA
TCAAGGAGACCCCTAATGTTGGTGGCAAACATTTGAAAGCAGCATCTGATTGGCAAATGTCATCCTCTTGTGCTGAGAAGATCTTGTCTGCTGATGATGATGTTGAGCTT
GCAAGTCTTGTTAGCCATTGTGTCCTTCCAATGAGAAGGAATGTACTGGAGGACTCTGAGGAAGTTTCTTCTGCTTCTTATAAGAAACCAGATGATGATAACGACGTGGA
CAGTTCTGAACATGTTTATACTGAAAGCCATAGGCTAAAAAATGAACGAGATGTAGTTTCTGATGTTCACCTGGAAGGTAGATTTTCATCTGAAGAGCTGAAGCATGAAG
GTCCACATTGTAAAGAGCAACAAATCCATAGTTCTCTTGAATTAGACAGGATGCCACCTGCCGCACCAATGGCAACTAAAGAGGCCCAACCGATGCAAGAGGAATTCTGT
ATGGATTTGACCAACATTCTGTCAACTTTGCTACAAAATTGGGTTCTCCCTTTGGATAAAAGATTTAACAAGGGATCTTTTCTTCCAAGAACCTAG
mRNA sequenceShow/hide mRNA sequence
TGTTCAGTTGGAATGAGATCGATAATGTTGTAGCCAGCTCAAATATTTTGTGGCGTGGAAGATCATATGCGGTGTCTGTTGCTTCTGATATACCTAATCCCGAGAACGAT
CGTAAAAGTGTTCCTATAGAAGCTCGGCGAGCAATGGTCGAATCTTTTGTGGACAAGTACAAGACAACAAATGCTGGGAAGTTCCCATCGATATCGCATACTCAGAAACA
AGTGGGTGGCTCTTTTTATGTGGTTAGGAAAATCCTTCAGGAGCTGCAGAATAAATCTACAATGTCGTCCTTAAGGAGTAGAAGTCAAAATCCATCTCAAGAAAAGGCAA
TCAAGGAGACCCCTAATGTTGGTGGCAAACATTTGAAAGCAGCATCTGATTGGCAAATGTCATCCTCTTGTGCTGAGAAGATCTTGTCTGCTGATGATGATGTTGAGCTT
GCAAGTCTTGTTAGCCATTGTGTCCTTCCAATGAGAAGGAATGTACTGGAGGACTCTGAGGAAGTTTCTTCTGCTTCTTATAAGAAACCAGATGATGATAACGACGTGGA
CAGTTCTGAACATGTTTATACTGAAAGCCATAGGCTAAAAAATGAACGAGATGTAGTTTCTGATGTTCACCTGGAAGGTAGATTTTCATCTGAAGAGCTGAAGCATGAAG
GTCCACATTGTAAAGAGCAACAAATCCATAGTTCTCTTGAATTAGACAGGATGCCACCTGCCGCACCAATGGCAACTAAAGAGGCCCAACCGATGCAAGAGGAATTCTGT
ATGGATTTGACCAACATTCTGTCAACTTTGCTACAAAATTGGGTTCTCCCTTTGGATAAAAGATTTAACAAGGGATCTTTTCTTCCAAGAACCTAG
Protein sequenceShow/hide protein sequence
FSWNEIDNVVASSNILWRGRSYAVSVASDIPNPENDRKSVPIEARRAMVESFVDKYKTTNAGKFPSISHTQKQVGGSFYVVRKILQELQNKSTMSSLRSRSQNPSQEKAI
KETPNVGGKHLKAASDWQMSSSCAEKILSADDDVELASLVSHCVLPMRRNVLEDSEEVSSASYKKPDDDNDVDSSEHVYTESHRLKNERDVVSDVHLEGRFSSEELKHEG
PHCKEQQIHSSLELDRMPPAAPMATKEAQPMQEEFCMDLTNILSTLLQNWVLPLDKRFNKGSFLPRT