; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020617 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020617
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionmajor pollen allergen Lol p 11-like
Genome locationtig00153552:479714..484029
RNA-Seq ExpressionSgr020617
SyntenySgr020617
Gene Ontology termsNA
InterPro domainsIPR006041 - Pollen allergen Ole e 1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600638.1 Protein DOWNSTREAM OF FLC, partial [Cucurbita argyrosperma subsp. sororia]6.8e-6685.92Show/hide
Query:  SAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTCNEPDRGRNSATV
        SAQPLQKPF+V+G VYCD+CRCGFETNAS PISGA VRLECRDRAKW+LKFSK+AITNS+G+YTI V+ DHKDESCK+VLVSSPHPTCN+PD GRNSATV
Subjt:  SAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTCNEPDRGRNSATV

Query:  ILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES
        ILTNNNGLTNN+R+ANAMGF TQRPLALCP+LLKQYLDYDES
Subjt:  ILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES

XP_022136800.1 protein DOWNSTREAM OF FLC [Momordica charantia]2.3e-6981.93Show/hide
Query:  MAAN--LSMFFAVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESC
        MAA+  + +   VCVLLLPA  L  AQ LQKP+VV+G VYCD+CRCGFET+AS P+SGARVRLECRDRAKW+L FSKEAITNS+G+Y+IVVTEDHKDESC
Subjt:  MAAN--LSMFFAVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESC

Query:  KVVLVSSPHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES
        KVVLVSSPHPTC+EPDRGRNSATVILTNNNGLTNNIR+ANAMGFLT+RPLALCPALLKQYLDYD+S
Subjt:  KVVLVSSPHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES

XP_022941733.1 protein DOWNSTREAM OF FLC-like [Cucurbita moschata]6.8e-6685.92Show/hide
Query:  SAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTCNEPDRGRNSATV
        SAQPLQKPF+V+G VYCD+CRCGFETNAS PISGA VRLECRDRAKW+LKFSK+AITNS+G+YTI V+ DHKDESCK+VLVSSPHPTCN+PD GRNSATV
Subjt:  SAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTCNEPDRGRNSATV

Query:  ILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES
        ILTNNNGLTNN+R+ANAMGF TQRPLALCP+LLKQYLDYDES
Subjt:  ILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES

XP_022983080.1 major pollen allergen Lol p 11-like [Cucurbita maxima]3.4e-6585.21Show/hide
Query:  SAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTCNEPDRGRNSATV
        SAQPLQKPF+V+G VYCD+CRCGFETNAS PISGA VRLECRDR KW+LKFSK+AITNS+G+YTI V+ DHKDESCK+VLVSSPHPTCN+PD GRNSATV
Subjt:  SAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTCNEPDRGRNSATV

Query:  ILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES
        ILTNNNGLTNN+R+ANAMGF TQRPLALCP+LLKQYLDYDES
Subjt:  ILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES

XP_023525170.1 protein DOWNSTREAM OF FLC-like isoform X2 [Cucurbita pepo subsp. pepo]2.6e-6578.05Show/hide
Query:  AANLS-MFFAVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKV
        AANL  +   VC ++       SAQPLQKPF+V+G VYCD+CRCGFETNAS PISGA VRLECRDRAKW+LKFSK+AITNS+G+YTI V+ DHKDESCK+
Subjt:  AANLS-MFFAVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKV

Query:  VLVSSPHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES
        VLVSSPHPTCN+PD GRNSATVILTNNNGLTNN+R+ANAMGF TQRPLALCP+LLKQYLDYDES
Subjt:  VLVSSPHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES

TrEMBL top hitse value%identityAlignment
A0A0A0KUJ2 Uncharacterized protein2.9e-5472.96Show/hide
Query:  MFFAVCVL-LLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSS
        +   VC L LL  LP  SAQ     FVV+G VYCD+CRCGFETN S PISGA+VRLECRDRA W+LKF+KEAIT+SKG+Y I V EDHKDESCKVVLVSS
Subjt:  MFFAVCVL-LLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSS

Query:  PHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES
        PH  CN PD GRNSATVILTNNNGLT+  R+ANAMGFL +RPLA CP +LKQYLD+DES
Subjt:  PHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES

A0A5A7SZE2 Major pollen allergen Lol p 11-like1.9e-5371.7Show/hide
Query:  MFFAVCVL-LLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSS
        +   VC L LL  LP  SAQ     FVV+G VYCD+CRCGFETN S P+SGA+VRLECRDRA W+LKF+KEA TNSKG+Y I V EDHKDESCKVVLVSS
Subjt:  MFFAVCVL-LLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSS

Query:  PHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES
        PH  C+ PD GRNSATVILTNNNGLT+  R+ANAMGFL +RPLA CP +LKQYLD+DES
Subjt:  PHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES

A0A6J1C4Z4 protein DOWNSTREAM OF FLC1.1e-6981.93Show/hide
Query:  MAAN--LSMFFAVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESC
        MAA+  + +   VCVLLLPA  L  AQ LQKP+VV+G VYCD+CRCGFET+AS P+SGARVRLECRDRAKW+L FSKEAITNS+G+Y+IVVTEDHKDESC
Subjt:  MAAN--LSMFFAVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESC

Query:  KVVLVSSPHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES
        KVVLVSSPHPTC+EPDRGRNSATVILTNNNGLTNNIR+ANAMGFLT+RPLALCPALLKQYLDYD+S
Subjt:  KVVLVSSPHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES

A0A6J1FSY1 protein DOWNSTREAM OF FLC-like3.3e-6685.92Show/hide
Query:  SAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTCNEPDRGRNSATV
        SAQPLQKPF+V+G VYCD+CRCGFETNAS PISGA VRLECRDRAKW+LKFSK+AITNS+G+YTI V+ DHKDESCK+VLVSSPHPTCN+PD GRNSATV
Subjt:  SAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTCNEPDRGRNSATV

Query:  ILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES
        ILTNNNGLTNN+R+ANAMGF TQRPLALCP+LLKQYLDYDES
Subjt:  ILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES

A0A6J1J4Q1 major pollen allergen Lol p 11-like1.6e-6585.21Show/hide
Query:  SAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTCNEPDRGRNSATV
        SAQPLQKPF+V+G VYCD+CRCGFETNAS PISGA VRLECRDR KW+LKFSK+AITNS+G+YTI V+ DHKDESCK+VLVSSPHPTCN+PD GRNSATV
Subjt:  SAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTCNEPDRGRNSATV

Query:  ILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES
        ILTNNNGLTNN+R+ANAMGF TQRPLALCP+LLKQYLDYDES
Subjt:  ILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDES

SwissProt top hitse value%identityAlignment
P33050 Pollen-specific protein C133.2e-2640.51Show/hide
Query:  SMFFAVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSS
        ++   +CV+L  A    +  P    +V+ GRVYCD+CR GF TN +  I+GA+VRLEC+      L+ + + +T++ G YTI + + H+++ C+VVLV+S
Subjt:  SMFFAVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSS

Query:  PHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDE
        P   C+E    R+ A V+LT N G+++++R AN +G+    PL +C ALLKQ LD D+
Subjt:  PHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDE

Q29W25 Pollen allergen Cro s 11.4e-2441.67Show/hide
Query:  AVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPT
        A+CVL L  +    A   +  F V G VYCD+CR  F T  S  + GA V+LECR+       F  EA+T+  G+Y+I V  D +D+ C++ LV SP+  
Subjt:  AVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPT

Query:  CNEPDR---GRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYD
        C+E       + SA V LT+NNG  ++IR ANA+GF+ + PL  CP +LK+   YD
Subjt:  CNEPDR---GRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYD

Q8H6L7 Pollen allergen Phl p 111.4e-2445.74Show/hide
Query:  FVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTCNEPDRGRNSATVILTNNNGL
        FVV+GRVYCD CR GFETN S  + GA V ++CR       K   EA T+  G Y I + +DH++E C+VVL  SP  TC+E +  R+ A V LT+NNG+
Subjt:  FVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTCNEPDRGRNSATVILTNNNGL

Query:  -TNNIRFANAMGFLTQRPLALCPALLKQY
            IR+AN + F  + PL  C  +L+ Y
Subjt:  -TNNIRFANAMGFLTQRPLALCPALLKQY

Q8LGR0 Pollen allergen Che a 11.0e-2441.67Show/hide
Query:  AVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPT
        A+CVL L  +    A   +  F V G VYCD+CR  F T  S  + GA V+LECR+       F  EA+T+  G+Y+I V  D +D+ C++ LV SP+  
Subjt:  AVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPT

Query:  CNEPDR---GRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYD
        C+E       + SA V LT+NNG  ++IR ANA+GF+ + PL  CP +LK+   YD
Subjt:  CNEPDR---GRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYD

Q9LX15 Protein DOWNSTREAM OF FLC6.9e-2946.31Show/hide
Query:  VCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTC
        +CVL+LP    L+A  +  PF + G VYCD+CR GFET A+  I GARVR+ C+DR     +    A+T   G+Y + V  D +D+ C   LV SP   C
Subjt:  VCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTC

Query:  NEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYL
         E D GR++ATVILT +NG  +   +ANAMGF    PL  C AL K+YL
Subjt:  NEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYL

Arabidopsis top hitse value%identityAlignment
AT1G78040.1 Pollen Ole e 1 allergen and extensin family protein2.0e-2336.81Show/hide
Query:  ANLSMFFAVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFET-NASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVV
        A L M   +C+L   A+        +   VV G  YCD C+ GFET  +S  I GA V+L C+DR      ++ +A+++ +G+Y  +V +DH+D+ C V+
Subjt:  ANLSMFFAVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFET-NASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVV

Query:  LVSSPHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYL-DYDE
        LV S   TC++   GR  + VIL + +G+ + IR AN MGF  +     C AL ++Y+ D DE
Subjt:  LVSSPHPTCNEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYL-DYDE

AT4G08685.1 Pollen Ole e 1 allergen and extensin family protein1.7e-3046.41Show/hide
Query:  VCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTC
        V +  LPAL  ++A+P + PFVV GRVYCD+C  GFET AS  ISGA VRLEC+DR    L +S EA T+S G Y I+V EDH ++ C  +LV S    C
Subjt:  VCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTC

Query:  NEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDE
        +    G + A V LT  NG+ ++ RFAN MGFL    +  C  ++K Y + ++
Subjt:  NEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDE

AT4G18596.1 Pollen Ole e 1 allergen and extensin family protein4.8e-2539.02Show/hide
Query:  MAANLSMFF---AVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDES
        MA+    FF   AVC+  L  + +  A    + F + G VYCD+CR  F T  S  + GA+V+LECR R    +  +KEA+T+  G Y + VT DH++E 
Subjt:  MAANLSMFF---AVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDES

Query:  CKVVLVSSPHPTCNEPDRG---RNSATVILTNNNGL-TNNIRFANAMGFLTQRPLALCPALLKQ
        C++VLV SP   C++  +    RN+A + LT N+G+ ++  R  N +GF+ Q P A CPA  K+
Subjt:  CKVVLVSSPHPTCNEPDRG---RNSATVILTNNNGL-TNNIRFANAMGFLTQRPLALCPALLKQ

AT5G10130.1 Pollen Ole e 1 allergen and extensin family protein4.9e-3046.31Show/hide
Query:  VCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTC
        +CVL+LP    L+A  +  PF + G VYCD+CR GFET A+  I GARVR+ C+DR     +    A+T   G+Y + V  D +D+ C   LV SP   C
Subjt:  VCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTC

Query:  NEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYL
         E D GR++ATVILT +NG  +   +ANAMGF    PL  C AL K+YL
Subjt:  NEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYL

AT5G45880.1 Pollen Ole e 1 allergen and extensin family protein1.1e-2440.13Show/hide
Query:  AVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPT
        AVC+  L       A    + F + G VYCD+CR  F T  S  + GA+V+LECR R    +  +KEA+T+  G Y + VT DH++E C++VLV SP   
Subjt:  AVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPT

Query:  CNEPDRG---RNSATVILTNNNGL-TNNIRFANAMGFLTQRPLALCPALLKQ
        C++       RN+A + LT N+G+ ++  R  N +GF+ Q PLA CPA  K+
Subjt:  CNEPDRG---RNSATVILTNNNGL-TNNIRFANAMGFLTQRPLALCPALLKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCTAATCTTTCCATGTTCTTTGCTGTTTGCGTTCTGCTGCTTCCGGCGCTGCCGCTGCTCTCGGCGCAGCCTCTGCAGAAGCCCTTCGTCGTCAGTGGCCGTGT
CTACTGCGACTCTTGCCGCTGCGGCTTCGAGACCAACGCCTCCATTCCCATTTCTGGTGCAAGAGTGAGACTAGAATGCAGAGACAGAGCCAAGTGGATGTTGAAGTTCA
GCAAAGAAGCCATCACCAACTCCAAAGGAGAATACACCATTGTTGTCACAGAAGACCACAAGGATGAGAGCTGCAAGGTGGTTCTCGTGAGCAGCCCTCACCCCACCTGC
AACGAGCCCGACCGCGGTCGCAATAGCGCCACCGTCATCCTCACCAACAACAACGGCCTGACCAACAACATTCGCTTTGCAAATGCCATGGGCTTCTTAACCCAACGCCC
TCTCGCTCTCTGCCCCGCCCTCCTCAAACAGTATCTTGATTATGACGAATCGTTCCGGAGCCCATTTCTTCCGCCGGACTTGCGTTCAAATCCATGCTCTCTCTGTGCAT
CTCAAACGCTTCGTCCATCGCCCGCTCCCCGACGCCGCCCTTCCAATACCCGCTGCCGTAGTCCTCCACCGTGCTGCTGCCGGACCCATCTCCGGCGTCGTCTATCCGCG
CCTCGTCGCTGTTCGGATCGGCCCTCGATAGCTCCCAGTTCATCCACTCCACCACGCTCTTCAGGTGCGTATCGATTTCATGCTCGTCCGGAAAGTGCCCCACTGTTGTG
TTGGTTTTGTTGTTTCCCGTTTCGTTATCGTTGTTTTGTCGGTCGGTTCTGTAAAGGCACTGTGTTTCGTGTTCTGTTCTGGCGTTCTTGTCGGCGAACCCCAACGGCAG
CTCGCTCTGCGGGCACCATTGATTCTGGCAAGCGTAGAGCGCGTCGGCGGCGGCTTCTCGATCGAACAGACACTTTCGTTTCTCTTTCCCACTTAAACTGGGCGGAACGC
CGTGGCCTCCGGCGGCGCCTTGCTCGCCCCACCAAATCTCCTTTCCTGTCGGCCACCACGGCGGGGCTAAACCCTTATCCAGCGGGAACTTTCTCTGCGGCGGAATACAG
TGCTGCATCAGAGCAGAGAGAATGGACCCTAAAGTCGTGTCCTGCAAATCGTGGAGGAGATGAATTTGGCTATGGCCATGGGAGCATCTTGTTCGAACCGAACGTCGTCC
CTCCACCACTCCCGGAGGCTCTCGGAGGAGCCTGTGACGGGCTTCCCCTTCTCCGGCACAATCCCATAAACAAACCCCTGTGCCTTGCAAGCCTCCATGATCTTATCCAT
GTACTTGAGAATCGAATCTTGAGCTCTCGCCATCTTCTTCCGGCGAGAGGCTTCCTCTCTGGCTGCCGACTCAGGCTCCTCACCATCACGCTCCTTCTTCATCTTCTTCA
ACCGCTGTCTATCTTTCCACATTCGCTTCTTGAGCTCATCGTAGCTAATATCTTTTTCTTCTTCTTCTTCTTCATCTTCTTGTGGATCATCTCCATCGTTGGGACCTCGG
ATTTCTCCATGAAACTTCACCATCATCAGAGACCTTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCTAATCTTTCCATGTTCTTTGCTGTTTGCGTTCTGCTGCTTCCGGCGCTGCCGCTGCTCTCGGCGCAGCCTCTGCAGAAGCCCTTCGTCGTCAGTGGCCGTGT
CTACTGCGACTCTTGCCGCTGCGGCTTCGAGACCAACGCCTCCATTCCCATTTCTGGTGCAAGAGTGAGACTAGAATGCAGAGACAGAGCCAAGTGGATGTTGAAGTTCA
GCAAAGAAGCCATCACCAACTCCAAAGGAGAATACACCATTGTTGTCACAGAAGACCACAAGGATGAGAGCTGCAAGGTGGTTCTCGTGAGCAGCCCTCACCCCACCTGC
AACGAGCCCGACCGCGGTCGCAATAGCGCCACCGTCATCCTCACCAACAACAACGGCCTGACCAACAACATTCGCTTTGCAAATGCCATGGGCTTCTTAACCCAACGCCC
TCTCGCTCTCTGCCCCGCCCTCCTCAAACAGTATCTTGATTATGACGAATCGTTCCGGAGCCCATTTCTTCCGCCGGACTTGCGTTCAAATCCATGCTCTCTCTGTGCAT
CTCAAACGCTTCGTCCATCGCCCGCTCCCCGACGCCGCCCTTCCAATACCCGCTGCCGTAGTCCTCCACCGTGCTGCTGCCGGACCCATCTCCGGCGTCGTCTATCCGCG
CCTCGTCGCTGTTCGGATCGGCCCTCGATAGCTCCCAGTTCATCCACTCCACCACGCTCTTCAGGTGCGTATCGATTTCATGCTCGTCCGGAAAGTGCCCCACTGTTGTG
TTGGTTTTGTTGTTTCCCGTTTCGTTATCGTTGTTTTGTCGGTCGGTTCTGTAAAGGCACTGTGTTTCGTGTTCTGTTCTGGCGTTCTTGTCGGCGAACCCCAACGGCAG
CTCGCTCTGCGGGCACCATTGATTCTGGCAAGCGTAGAGCGCGTCGGCGGCGGCTTCTCGATCGAACAGACACTTTCGTTTCTCTTTCCCACTTAAACTGGGCGGAACGC
CGTGGCCTCCGGCGGCGCCTTGCTCGCCCCACCAAATCTCCTTTCCTGTCGGCCACCACGGCGGGGCTAAACCCTTATCCAGCGGGAACTTTCTCTGCGGCGGAATACAG
TGCTGCATCAGAGCAGAGAGAATGGACCCTAAAGTCGTGTCCTGCAAATCGTGGAGGAGATGAATTTGGCTATGGCCATGGGAGCATCTTGTTCGAACCGAACGTCGTCC
CTCCACCACTCCCGGAGGCTCTCGGAGGAGCCTGTGACGGGCTTCCCCTTCTCCGGCACAATCCCATAAACAAACCCCTGTGCCTTGCAAGCCTCCATGATCTTATCCAT
GTACTTGAGAATCGAATCTTGAGCTCTCGCCATCTTCTTCCGGCGAGAGGCTTCCTCTCTGGCTGCCGACTCAGGCTCCTCACCATCACGCTCCTTCTTCATCTTCTTCA
ACCGCTGTCTATCTTTCCACATTCGCTTCTTGAGCTCATCGTAGCTAATATCTTTTTCTTCTTCTTCTTCTTCATCTTCTTGTGGATCATCTCCATCGTTGGGACCTCGG
ATTTCTCCATGAAACTTCACCATCATCAGAGACCTTGTTAA
Protein sequenceShow/hide protein sequence
MAANLSMFFAVCVLLLPALPLLSAQPLQKPFVVSGRVYCDSCRCGFETNASIPISGARVRLECRDRAKWMLKFSKEAITNSKGEYTIVVTEDHKDESCKVVLVSSPHPTC
NEPDRGRNSATVILTNNNGLTNNIRFANAMGFLTQRPLALCPALLKQYLDYDESFRSPFLPPDLRSNPCSLCASQTLRPSPAPRRRPSNTRCRSPPPCCCRTHLRRRLSA
PRRCSDRPSIAPSSSTPPRSSGAYRFHARPESAPLLCWFCCFPFRYRCFVGRFCKGTVFRVLFWRSCRRTPTAARSAGTIDSGKRRARRRRLLDRTDTFVSLSHLNWAER
RGLRRRLARPTKSPFLSATTAGLNPYPAGTFSAAEYSAASEQREWTLKSCPANRGGDEFGYGHGSILFEPNVVPPPLPEALGGACDGLPLLRHNPINKPLCLASLHDLIH
VLENRILSSRHLLPARGFLSGCRLRLLTITLLLHLLQPLSIFPHSLLELIVANIFFFFFFFIFLWIISIVGTSDFSMKLHHHQRPC