; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020897 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020897
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLEA_2 domain-containing protein
Genome locationtig00153577:120413..125921
RNA-Seq ExpressionSgr020897
SyntenySgr020897
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152938.1 NDR1/HIN1-like protein 10 [Momordica charantia]6.0e-4957.79Show/hide
Query:  TPSSKRSADG---QNRL-AKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNPNRRAAINIDSMIM
        T +SK S++G   QNR   KRTTI+RIIGR+MLG+IIL+GL++VINWLLIIPK P Y +E+  +T  +L DR LNAT++F+I++ NPNRRAAI+IDSM +
Subjt:  TPSSKRSADG---QNRL-AKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNPNRRAAINIDSMIM

Query:  TVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYC--VGVRLQIKTPTAPFENAKC
        TV Y+ QRF+STVPSF   PG QT L   VE +  SP G L  ++   G +++LRL AKIRY+IEKW+SKRR LE+YC  VG  L+I T T P +N KC
Subjt:  TVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYC--VGVRLQIKTPTAPFENAKC

XP_022964992.1 uncharacterized protein At1g08160-like [Cucurbita moschata]1.5e-4752.86Show/hide
Query:  MSSSTQPADHEQAPQQTPSSKRSADGQNR-LAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNP
        M S+TQ            + K S+ GQ    AKRT ++RIIGRS+L V+ LVGLA+VI WL++ PKTP   LE+G +TP+SL DRKLNA+ISF+I+S+NP
Subjt:  MSSSTQPADHEQAPQQTPSSKRSADGQNR-LAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNP

Query:  NRRAAINIDSMIMTVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIKT
        N+RA+I++DSM MT+  + Q F +T+P+F QPPGNQT L+  VE  FI P GQ+K  +  +GL+ EL  SA + Y +EKW+SKRR LE+YC  VRL+I  
Subjt:  NRRAAINIDSMIMTVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIKT

Query:  PTAPFENAKC
         T PF+N KC
Subjt:  PTAPFENAKC

XP_022970341.1 uncharacterized protein At1g08160-like [Cucurbita maxima]1.6e-4656.28Show/hide
Query:  QNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNPNRRAAINIDSMIMTVSYIDQRFQSTVP
        Q+  AKRT ++RIIGRS+L V+ LVGLA+VI WL++ PKTP   LE+G +TP+SL DRKLNA+ISF+I+S+NPN+RA+I++DSM MT+  + Q F +T+P
Subjt:  QNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNPNRRAAINIDSMIMTVSYIDQRFQSTVP

Query:  SFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIKTPTAPFENAKC
        +F Q PGNQT L   VE  FI P GQ+K  +   GL+ EL  SA + Y +EKW+SKRR LE+YC  VRL+I   T PF+N KC
Subjt:  SFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIKTPTAPFENAKC

XP_023520091.1 uncharacterized protein At1g08160-like [Cucurbita pepo subsp. pepo]1.5e-4752.86Show/hide
Query:  MSSSTQPADHEQAPQQTPSSKRSADG-QNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNP
        M S+TQ            + K S+ G Q+  AKRT ++RIIGRS+L V+ LVGLA+VI WL++ PKTP   LE+G +TP+SL DRKLNA+ISF+I+S+NP
Subjt:  MSSSTQPADHEQAPQQTPSSKRSADG-QNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNP

Query:  NRRAAINIDSMIMTVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIKT
        N+RA+I++DSM MT+  + Q F +T+P+F QPPGNQT L+  VE  FI P GQ+K  +  +GL+ EL  SA + Y +EKW+SKRR LE+YC  VRL+I  
Subjt:  NRRAAINIDSMIMTVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIKT

Query:  PTAPFENAKC
         T PF+N KC
Subjt:  PTAPFENAKC

XP_038895440.1 uncharacterized protein LOC120083674 [Benincasa hispida]6.2e-4648.73Show/hide
Query:  QQTPSSKRSADGQNR-LAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNPNRRAAINIDSMIMT
        + + SSK S+  Q    AKRT ++RI GRS+LG++ILVG+ ++I WL++ PKTP  T+ESG + P+ L DRKL ATI+F+++S+NPN+RA I++DSM+M 
Subjt:  QQTPSSKRSADGQNR-LAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNPNRRAAINIDSMIMT

Query:  VSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIKTPTAPFENAKCT
        V+ + Q F S +P+F QPPGNQT   + ++  FI P G +K  ++ EG+  +LR SAK+ Y +++W+SK R+LE+YC G+RL+    T PF+N KCT
Subjt:  VSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIKTPTAPFENAKCT

TrEMBL top hitse value%identityAlignment
A0A6J1DG76 NDR1/HIN1-like protein 102.9e-4957.79Show/hide
Query:  TPSSKRSADG---QNRL-AKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNPNRRAAINIDSMIM
        T +SK S++G   QNR   KRTTI+RIIGR+MLG+IIL+GL++VINWLLIIPK P Y +E+  +T  +L DR LNAT++F+I++ NPNRRAAI+IDSM +
Subjt:  TPSSKRSADG---QNRL-AKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNPNRRAAINIDSMIM

Query:  TVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYC--VGVRLQIKTPTAPFENAKC
        TV Y+ QRF+STVPSF   PG QT L   VE +  SP G L  ++   G +++LRL AKIRY+IEKW+SKRR LE+YC  VG  L+I T T P +N KC
Subjt:  TVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYC--VGVRLQIKTPTAPFENAKC

A0A6J1FBI3 uncharacterized protein LOC1114439282.0e-4246.7Show/hide
Query:  MSSSTQPADHEQAPQQT---PSSKRSADGQNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSF
        MSS+T   D   +   T   P  + S+ GQ+R   RT ++RIIGRSMLG+++LVGLA+V  WL++ PK P+++LE G +T +SL DRKLNA++SF IRSF
Subjt:  MSSSTQPADHEQAPQQT---PSSKRSADGQNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSF

Query:  NPNRRAAINIDSMIMTVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQI
        NPN++AAI+ID M+MT++ + ++F+  + +F Q PGN   L   +   F+ PL +L+  ++ +G+  EL LSA IRY I  W SKRRL+E+YC    L+I
Subjt:  NPNRRAAINIDSMIMTVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQI

Query:  KTPTAPFENAKC
           T P +N KC
Subjt:  KTPTAPFENAKC

A0A6J1HJ52 uncharacterized protein At1g08160-like7.1e-4852.86Show/hide
Query:  MSSSTQPADHEQAPQQTPSSKRSADGQNR-LAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNP
        M S+TQ            + K S+ GQ    AKRT ++RIIGRS+L V+ LVGLA+VI WL++ PKTP   LE+G +TP+SL DRKLNA+ISF+I+S+NP
Subjt:  MSSSTQPADHEQAPQQTPSSKRSADGQNR-LAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNP

Query:  NRRAAINIDSMIMTVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIKT
        N+RA+I++DSM MT+  + Q F +T+P+F QPPGNQT L+  VE  FI P GQ+K  +  +GL+ EL  SA + Y +EKW+SKRR LE+YC  VRL+I  
Subjt:  NRRAAINIDSMIMTVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIKT

Query:  PTAPFENAKC
         T PF+N KC
Subjt:  PTAPFENAKC

A0A6J1I064 uncharacterized protein LOC1114685192.9e-4145.75Show/hide
Query:  MSSSTQPADHEQAPQQT---PSSKRSADGQNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSF
        MSS+++  D   +   T   P  + S++GQ+R   RT ++RIIGRSMLG+++LVGLA+V  WL++ PK P+++LE G +T + L DRKLNA++SF IRSF
Subjt:  MSSSTQPADHEQAPQQT---PSSKRSADGQNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSF

Query:  NPNRRAAINIDSMIMTVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQI
        NPN++AAI+ID M+MT++ + ++F+  + +F Q PGN T L   +   F+ PL +L+  ++ +G+  EL LSA IRY I  W SK RL+E+YC    L+I
Subjt:  NPNRRAAINIDSMIMTVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQI

Query:  KTPTAPFENAKC
           T P +N KC
Subjt:  KTPTAPFENAKC

A0A6J1I0B9 uncharacterized protein At1g08160-like7.9e-4756.28Show/hide
Query:  QNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNPNRRAAINIDSMIMTVSYIDQRFQSTVP
        Q+  AKRT ++RIIGRS+L V+ LVGLA+VI WL++ PKTP   LE+G +TP+SL DRKLNA+ISF+I+S+NPN+RA+I++DSM MT+  + Q F +T+P
Subjt:  QNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNPNRRAAINIDSMIMTVSYIDQRFQSTVP

Query:  SFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIKTPTAPFENAKC
        +F Q PGNQT L   VE  FI P GQ+K  +   GL+ EL  SA + Y +EKW+SKRR LE+YC  VRL+I   T PF+N KC
Subjt:  SFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIKTPTAPFENAKC

SwissProt top hitse value%identityAlignment
Q8VZ13 Uncharacterized protein At1g081601.0e-1125.47Show/hide
Query:  QPADHEQAPQQTPSSKRSADGQNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSL--NDRKLNATISFSIRSFNPNRRA
        QPA   Q PQ  P S+  A           +  I+   +LG  +LVGLA++I +L + PK  IYT+E+ ++  +++  ND  +NA  S+ I+S+NP +  
Subjt:  QPADHEQAPQQTPSSKRSADGQNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSL--NDRKLNATISFSIRSFNPNRRA

Query:  AINIDSMIMTVSYIDQRF-QSTVPSFLQPPGNQTTLHAHVEAYFIS----PLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIK
        ++   SM ++ ++ +Q      +  F Q P N+T +   + ++ ++        L+    +  +++E+ ++A++ Y    + S+RR L+  C  V + + 
Subjt:  AINIDSMIMTVSYIDQRF-QSTVPSFLQPPGNQTTLHAHVEAYFIS----PLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIK

Query:  TPTAP-FENAKC
        + +   F+   C
Subjt:  TPTAP-FENAKC

Arabidopsis top hitse value%identityAlignment
AT1G08160.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.4e-1325.47Show/hide
Query:  QPADHEQAPQQTPSSKRSADGQNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSL--NDRKLNATISFSIRSFNPNRRA
        QPA   Q PQ  P S+  A           +  I+   +LG  +LVGLA++I +L + PK  IYT+E+ ++  +++  ND  +NA  S+ I+S+NP +  
Subjt:  QPADHEQAPQQTPSSKRSADGQNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSL--NDRKLNATISFSIRSFNPNRRA

Query:  AINIDSMIMTVSYIDQRF-QSTVPSFLQPPGNQTTLHAHVEAYFIS----PLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIK
        ++   SM ++ ++ +Q      +  F Q P N+T +   + ++ ++        L+    +  +++E+ ++A++ Y    + S+RR L+  C  V + + 
Subjt:  AINIDSMIMTVSYIDQRF-QSTVPSFLQPPGNQTTLHAHVEAYFIS----PLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIK

Query:  TPTAP-FENAKC
        + +   F+   C
Subjt:  TPTAP-FENAKC

AT1G61760.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.5e-0524.54Show/hide
Query:  TTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNPNRRAAINIDSMIMTVSYIDQRFQST--VPSFLQP
        T + ++I    L +++ +G+   I W+ + P  P   +   +I+  S  D    + ISF I + NPN+   I  DSM  +V Y ++R  ST     F Q 
Subjt:  TTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNPNRRAAINIDSMIMTVSYIDQRFQST--VPSFLQP

Query:  PGNQTTLHAHVEAYFISPLGQLKGSIERE----GLDLELRLSAKIRYSIEKWSSKRRLLEVYC
        P N +++   +    ++        +ER+     +   L++ + IR+ +  W SK   +   C
Subjt:  PGNQTTLHAHVEAYFISPLGQLKGSIERE----GLDLELRLSAKIRYSIEKWSSKRRLLEVYC

AT1G65690.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.3e-0424.48Show/hide
Query:  MLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLN-DRKLNATISFSIRSFNPNRRAAINI-DSMIMTVSYIDQRFQS-TVPSFLQPPGNQTTLHA
        +L +++ VG ++ I +L+  PK P Y+++   +T ++LN D  L    + +I + NPN +  I   D   +TV Y++ +  + ++P F Q   N T ++ 
Subjt:  MLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLN-DRKLNATISFSIRSFNPNRRAAINI-DSMIMTVSYIDQRFQS-TVPSFLQPPGNQTTLHA

Query:  HVEAYFISPLGQLKGSIERE-----GLDLELRLSAKIRYSIEK
         +     +  G L+ ++E +      + L +R++  +R    K
Subjt:  HVEAYFISPLGQLKGSIERE-----GLDLELRLSAKIRYSIEK

AT4G01410.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.6e-0430.65Show/hide
Query:  ADHEQAP--QQTPSSKRSADGQNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLN---DRKLNATISFSIRSFNPNRR
        ADH+ AP    TP S     G      R    R I  ++  +++++G+  +I WL+  P  P  T+    I  Y LN      ++ ++ FS+ + NPNRR
Subjt:  ADHEQAP--QQTPSSKRSADGQNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLN---DRKLNATISFSIRSFNPNRR

Query:  AAINIDSMIMTVSYIDQRFQSTVP
         +I+ D + M V+Y DQ     +P
Subjt:  AAINIDSMIMTVSYIDQRFQSTVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCTCCACTCAACCAGCTGATCATGAACAAGCTCCACAGCAGACCCCATCATCTAAACGAAGCGCCGATGGCCAAAACAGGCTAGCAAAACGCACCACAATCAT
AAGGATCATAGGAAGAAGCATGCTGGGAGTAATAATCCTGGTGGGTCTGGCAGTGGTCATCAATTGGCTTCTCATTATCCCAAAAACTCCCATTTACACTCTTGAAAGCG
GCACCATCACACCCTACAGTTTAAACGACAGAAAACTCAACGCCACCATCAGTTTCAGCATCAGAAGCTTCAACCCCAACAGAAGAGCCGCCATTAACATTGACTCCATG
ATTATGACGGTGAGTTACATCGACCAGAGGTTTCAGTCCACCGTTCCGTCGTTCTTGCAGCCGCCGGGGAACCAGACGACCTTGCACGCCCACGTCGAAGCTTACTTCAT
ATCCCCACTCGGCCAGCTGAAGGGCTCCATAGAAAGGGAAGGCCTGGATCTGGAGCTTCGTCTCTCGGCCAAGATCAGGTACAGTATCGAAAAGTGGTCGTCGAAGCGTC
GGCTGCTGGAGGTTTATTGCGTCGGCGTCAGGCTTCAGATCAAAACTCCTACAGCACCCTTCGAAAATGCCAAATGCACAAAGAAACCCTATAGAATAAGCCAAACAAGG
ACTCTACCTCTCCCCTTCCAGTGGCTTCAAAGAAGCTTTGGTCTGAAAATTTCCTCTTGCTCACAACTTTCTGCTTTCTCGGCGTCGTCGGGCAGCTTCGTGCGGTGGGT
ATTCTGTGTCGGAGAGACGTCGGCGTTCGACAACTCTCTCCGCCGCCGCCATCCGCCACCACCTGAACGGTTTCTTCGCCGTCTTCCTCCTTGGACTGATCACCTTCTTC
CGGTGGCTTTTGAGTTTCTGGTGACTTTTGGTGATCGGGTTTGTGAAATTCTGACATGGCGTTGTGGTGATTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAGCTCCACTCAACCAGCTGATCATGAACAAGCTCCACAGCAGACCCCATCATCTAAACGAAGCGCCGATGGCCAAAACAGGCTAGCAAAACGCACCACAATCAT
AAGGATCATAGGAAGAAGCATGCTGGGAGTAATAATCCTGGTGGGTCTGGCAGTGGTCATCAATTGGCTTCTCATTATCCCAAAAACTCCCATTTACACTCTTGAAAGCG
GCACCATCACACCCTACAGTTTAAACGACAGAAAACTCAACGCCACCATCAGTTTCAGCATCAGAAGCTTCAACCCCAACAGAAGAGCCGCCATTAACATTGACTCCATG
ATTATGACGGTGAGTTACATCGACCAGAGGTTTCAGTCCACCGTTCCGTCGTTCTTGCAGCCGCCGGGGAACCAGACGACCTTGCACGCCCACGTCGAAGCTTACTTCAT
ATCCCCACTCGGCCAGCTGAAGGGCTCCATAGAAAGGGAAGGCCTGGATCTGGAGCTTCGTCTCTCGGCCAAGATCAGGTACAGTATCGAAAAGTGGTCGTCGAAGCGTC
GGCTGCTGGAGGTTTATTGCGTCGGCGTCAGGCTTCAGATCAAAACTCCTACAGCACCCTTCGAAAATGCCAAATGCACAAAGAAACCCTATAGAATAAGCCAAACAAGG
ACTCTACCTCTCCCCTTCCAGTGGCTTCAAAGAAGCTTTGGTCTGAAAATTTCCTCTTGCTCACAACTTTCTGCTTTCTCGGCGTCGTCGGGCAGCTTCGTGCGGTGGGT
ATTCTGTGTCGGAGAGACGTCGGCGTTCGACAACTCTCTCCGCCGCCGCCATCCGCCACCACCTGAACGGTTTCTTCGCCGTCTTCCTCCTTGGACTGATCACCTTCTTC
CGGTGGCTTTTGAGTTTCTGGTGACTTTTGGTGATCGGGTTTGTGAAATTCTGACATGGCGTTGTGGTGATTTATGA
Protein sequenceShow/hide protein sequence
MSSSTQPADHEQAPQQTPSSKRSADGQNRLAKRTTIIRIIGRSMLGVIILVGLAVVINWLLIIPKTPIYTLESGTITPYSLNDRKLNATISFSIRSFNPNRRAAINIDSM
IMTVSYIDQRFQSTVPSFLQPPGNQTTLHAHVEAYFISPLGQLKGSIEREGLDLELRLSAKIRYSIEKWSSKRRLLEVYCVGVRLQIKTPTAPFENAKCTKKPYRISQTR
TLPLPFQWLQRSFGLKISSCSQLSAFSASSGSFVRWVFCVGETSAFDNSLRRRHPPPPERFLRRLPPWTDHLLPVAFEFLVTFGDRVCEILTWRCGDL