; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016844 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016844
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLysM domain-containing protein
Genome locationtig00153010:1864484..1868742
RNA-Seq ExpressionSgr016844
SyntenySgr016844
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR018392 - LysM domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039254.1 uncharacterized protein E6C27_scaffold64G00450 [Cucumis melo var. makuwa]2.7e-10672.5Show/hide
Query:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLT--HRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIAN
        MEVK+ QRNRA RF     LLP PTLPSLT   R WA++K+S  NQ R I+LRW FQL D+SK QLSTKHHFVHI+EG+E+  SI  +NG   +SIVI N
Subjt:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLT--HRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIAN

Query:  GKIVDTDLEHKGQDITIRNSRVIS---DIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE
         KIVDTDLE KGQ+I I+NSR +S   D +QL+EKLQS+LNGL+ YKKLF LAS  LPPARTTSFIVLVPL++FC RCIIGASYARVFGT +LK +NK E
Subjt:  GKIVDTDLEHKGQDITIRNSRVIS---DIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE

Query:  GERHEFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGSQSPE
        GERH+FRSGHWRSALRDIRE D LD E  +DS SP+ DEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGG+Q PE
Subjt:  GERHEFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGSQSPE

XP_022141745.1 uncharacterized protein LOC111012032 isoform X1 [Momordica charantia]1.6e-11178.34Show/hide
Query:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLTHRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEG-----SETFTSIPKQNGVSTHSIV
        ME+K+SQRNRA+RF LLPKLLPQPTLPS THRTWA +KRS KNQF A+ALRW FQL+DI +DQ  TKHHFV IVEG      ETFTSI KQNGVSTHSIV
Subjt:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLTHRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEG-----SETFTSIPKQNGVSTHSIV

Query:  IANGKIVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE
        I N KI DTDLEHKGQD  IRN   I D+YQLQEKLQSSLNGL+NYKKLF   SP LPPARTTSFIVLVPLIVFCARCIIGASYARV  T +LKT++KSE
Subjt:  IANGKIVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE

Query:  GERHEFRSGHWRSALRDIREPDSLDSETSVD---SASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG
        GE H+FRSGHWRSALRDIRE D LDSE+S D   S SP+ DEQISVEDLSHAYKKLD+DYEKFLSECGLS  GYWRG
Subjt:  GERHEFRSGHWRSALRDIREPDSLDSETSVD---SASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG

XP_022141746.1 uncharacterized protein LOC111012032 isoform X2 [Momordica charantia]6.6e-11378.83Show/hide
Query:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLTHRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEG-----SETFTSIPKQNGVSTHSIV
        ME+K+SQRNRA+RF LLPKLLPQPTLPS THRTWA +KRS KNQF A+ALRW FQL+DI +DQ  TKHHFV IVEG      ETFTSI KQNGVSTHSIV
Subjt:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLTHRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEG-----SETFTSIPKQNGVSTHSIV

Query:  IANGKIVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE
        I N KI DTDLEHKGQD  IRN   I D+YQLQEKLQSSLNGL+NYKKLF   SP LPPARTTSFIVLVPLIVFCARCIIGASYARV  T +LKT++KSE
Subjt:  IANGKIVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE

Query:  GERHEFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG
        GE H+FRSGHWRSALRDIRE D LDSE+S D +SP+ DEQISVEDLSHAYKKLD+DYEKFLSECGLS  GYWRG
Subjt:  GERHEFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG

XP_022141747.1 uncharacterized protein LOC111012032 isoform X3 [Momordica charantia]2.3e-11379.78Show/hide
Query:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLTHRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIANGK
        ME+K+SQRNRA+RF LLPKLLPQPTLPS THRTWA +KRS KNQF A+ALRW FQL+DI +DQ  TKHHFV IVEG ETFTSI KQNGVSTHSIVI N K
Subjt:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLTHRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIANGK

Query:  IVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSEGERHE
        I DTDLEHKGQD  IRN   I D+YQLQEKLQSSLNGL+NYKKLF   SP LPPARTTSFIVLVPLIVFCARCIIGASYARV  T +LKT++KSEGE H+
Subjt:  IVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSEGERHE

Query:  FRSGHWRSALRDIREPDSLDSETSVD---SASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG
        FRSGHWRSALRDIRE D LDSE+S D   S SP+ DEQISVEDLSHAYKKLD+DYEKFLSECGLS  GYWRG
Subjt:  FRSGHWRSALRDIREPDSLDSETSVD---SASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG

XP_038890844.1 uncharacterized protein LOC120080288 isoform X1 [Benincasa hispida]1.5e-11778.99Show/hide
Query:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLT-HRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIANG
        MEVKLSQRNRA+RF L PKLLPQPTLPSLT HR WA++++SLKNQFRAI LRW FQL+DISK+QLSTKHH VHIVEGSE+ T  P QNG  THSI +AN 
Subjt:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLT-HRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIANG

Query:  KIVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSEGERH
        +I DTDLE KGQ+I I+N R I D+YQL+EKLQS+LN LRNYKKLF LAS HLPPARTTSFIVLVPLIVFCARCIIGASYARVFGT RL+TV+K EG+ H
Subjt:  KIVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSEGERH

Query:  EFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGSQSPE
        +FRSGHWRSALRDIRE D LD E+ +DS SP+EDEQIS EDLSH YKKLDQDYEKFLSECGLSKWGYWRGG+Q PE
Subjt:  EFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGSQSPE

TrEMBL top hitse value%identityAlignment
A0A1S3CBV5 uncharacterized protein LOC103498697 isoform X25.0e-10672.14Show/hide
Query:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLT--HRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIAN
        MEVK+ QRNRA RF     LLP PTLPSLT   R WA++K+S  NQ R I+LRW FQL D+SK QLSTKHHFVHI+EG+E+  SI  +NG   +SIVI N
Subjt:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLT--HRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIAN

Query:  GKIVDTDLEHKGQDITIRNSRVIS---DIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE
         KIVDTD E KGQ+I I+N RV+S   D +QL+EKLQ++LNGL+ YKKLF LAS  LPPARTTSFIVLVPL++FC RCIIGASYARVFGT +LK +NK E
Subjt:  GKIVDTDLEHKGQDITIRNSRVIS---DIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE

Query:  GERHEFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGSQSPE
        GERH+FRSGHWRSALRDIRE D LD E  +DS SP+EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGG+Q PE
Subjt:  GERHEFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGSQSPE

A0A5A7T6Z4 LysM domain-containing protein1.3e-10672.5Show/hide
Query:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLT--HRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIAN
        MEVK+ QRNRA RF     LLP PTLPSLT   R WA++K+S  NQ R I+LRW FQL D+SK QLSTKHHFVHI+EG+E+  SI  +NG   +SIVI N
Subjt:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLT--HRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIAN

Query:  GKIVDTDLEHKGQDITIRNSRVIS---DIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE
         KIVDTDLE KGQ+I I+NSR +S   D +QL+EKLQS+LNGL+ YKKLF LAS  LPPARTTSFIVLVPL++FC RCIIGASYARVFGT +LK +NK E
Subjt:  GKIVDTDLEHKGQDITIRNSRVIS---DIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE

Query:  GERHEFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGSQSPE
        GERH+FRSGHWRSALRDIRE D LD E  +DS SP+ DEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGG+Q PE
Subjt:  GERHEFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGSQSPE

A0A6J1CIZ2 uncharacterized protein LOC111012032 isoform X17.9e-11278.34Show/hide
Query:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLTHRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEG-----SETFTSIPKQNGVSTHSIV
        ME+K+SQRNRA+RF LLPKLLPQPTLPS THRTWA +KRS KNQF A+ALRW FQL+DI +DQ  TKHHFV IVEG      ETFTSI KQNGVSTHSIV
Subjt:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLTHRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEG-----SETFTSIPKQNGVSTHSIV

Query:  IANGKIVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE
        I N KI DTDLEHKGQD  IRN   I D+YQLQEKLQSSLNGL+NYKKLF   SP LPPARTTSFIVLVPLIVFCARCIIGASYARV  T +LKT++KSE
Subjt:  IANGKIVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE

Query:  GERHEFRSGHWRSALRDIREPDSLDSETSVD---SASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG
        GE H+FRSGHWRSALRDIRE D LDSE+S D   S SP+ DEQISVEDLSHAYKKLD+DYEKFLSECGLS  GYWRG
Subjt:  GERHEFRSGHWRSALRDIREPDSLDSETSVD---SASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG

A0A6J1CK58 uncharacterized protein LOC111012032 isoform X23.2e-11378.83Show/hide
Query:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLTHRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEG-----SETFTSIPKQNGVSTHSIV
        ME+K+SQRNRA+RF LLPKLLPQPTLPS THRTWA +KRS KNQF A+ALRW FQL+DI +DQ  TKHHFV IVEG      ETFTSI KQNGVSTHSIV
Subjt:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLTHRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEG-----SETFTSIPKQNGVSTHSIV

Query:  IANGKIVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE
        I N KI DTDLEHKGQD  IRN   I D+YQLQEKLQSSLNGL+NYKKLF   SP LPPARTTSFIVLVPLIVFCARCIIGASYARV  T +LKT++KSE
Subjt:  IANGKIVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSE

Query:  GERHEFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG
        GE H+FRSGHWRSALRDIRE D LDSE+S D +SP+ DEQISVEDLSHAYKKLD+DYEKFLSECGLS  GYWRG
Subjt:  GERHEFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG

A0A6J1CKQ6 uncharacterized protein LOC111012032 isoform X31.1e-11379.78Show/hide
Query:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLTHRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIANGK
        ME+K+SQRNRA+RF LLPKLLPQPTLPS THRTWA +KRS KNQF A+ALRW FQL+DI +DQ  TKHHFV IVEG ETFTSI KQNGVSTHSIVI N K
Subjt:  MEVKLSQRNRAERFCLLPKLLPQPTLPSLTHRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIANGK

Query:  IVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSEGERHE
        I DTDLEHKGQD  IRN   I D+YQLQEKLQSSLNGL+NYKKLF   SP LPPARTTSFIVLVPLIVFCARCIIGASYARV  T +LKT++KSEGE H+
Subjt:  IVDTDLEHKGQDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSEGERHE

Query:  FRSGHWRSALRDIREPDSLDSETSVD---SASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG
        FRSGHWRSALRDIRE D LDSE+S D   S SP+ DEQISVEDLSHAYKKLD+DYEKFLSECGLS  GYWRG
Subjt:  FRSGHWRSALRDIREPDSLDSETSVD---SASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G09970.1 unknown protein1.6e-1125.42Show/hide
Query:  QFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIA--NGKIVDTDLEHKGQDITIRNSRVISDIYQLQ-----------EK
        +F+  + R  F ++ +S+++  TKH      + SE+   I KQ GVS  +   +  + ++ D D E K   +T   S VI D  ++            EK
Subjt:  QFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIA--NGKIVDTDLEHKGQDITIRNSRVISDIYQLQ-----------EK

Query:  LQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSEGERHEFRSGHWRSALRDIREP---DSLDSETSVDS
           ++    N         PHL          L+P++ FC  CIIG  +           +++   + H   S  WR+AL D  EP   D  DS +    
Subjt:  LQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSEGERHEFRSGHWRSALRDIREP---DSLDSETSVDS

Query:  ASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSK
         + T  E  + ++++ AY +++ +Y++FL ECG+ +
Subjt:  ASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSK

AT4G09970.2 unknown protein2.1e-1126.55Show/hide
Query:  RWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIA--NGKIVDTDLEHKGQDITIRNSRVISDIYQLQ-----------EKLQSSLNG
        +W F ++ +S+++  TKH      + SE+   I KQ GVS  +   +  + ++ D D E K   +T   S VI D  ++            EK   ++  
Subjt:  RWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIA--NGKIVDTDLEHKGQDITIRNSRVISDIYQLQ-----------EKLQSSLNG

Query:  LRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSEGERHEFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQIS
          N         PHL          L+P++ FC  CIIG  +           +++   + H   S  WR+AL D  EP + D     DS SP   E  +
Subjt:  LRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSEGERHEFRSGHWRSALRDIREPDSLDSETSVDSASPTEDEQIS

Query:  VEDLSHAYKKLDQDYEKFLSECGLSK
         ++++ AY +++ +Y++FL ECG+ +
Subjt:  VEDLSHAYKKLDQDYEKFLSECGLSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTGAAGCTGAGCCAGAGAAATAGAGCAGAACGTTTTTGTCTCCTCCCCAAGTTGCTCCCACAACCAACCCTCCCTTCTCTAACTCACAGAACTTGGGCTGACTC
CAAAAGATCATTGAAGAATCAATTCAGAGCCATTGCGCTGAGATGGACGTTTCAACTTCGGGATATATCCAAAGATCAACTCTCCACCAAGCACCACTTTGTTCATATTG
TCGAAGGAAGTGAGACCTTCACTTCGATTCCGAAGCAGAATGGAGTTTCCACACATTCTATTGTCATAGCTAATGGGAAGATAGTGGACACGGATCTAGAACACAAGGGG
CAGGATATCACGATTCGAAACTCTCGAGTGATTAGCGATATATATCAATTGCAAGAAAAGCTTCAAAGTTCTTTAAATGGACTTCGAAATTATAAAAAGCTTTTCACGCT
AGCCTCCCCTCATCTACCTCCTGCTAGAACCACTAGTTTTATAGTTTTGGTTCCTCTCATAGTATTTTGTGCCAGATGCATAATTGGTGCCTCCTATGCTAGAGTTTTCG
GAACATGGAGGCTTAAAACTGTTAATAAATCAGAGGGAGAACGTCACGAGTTCAGAAGTGGGCATTGGAGATCTGCTCTCCGCGATATAAGGGAACCGGACAGTTTGGAT
TCTGAGACATCTGTAGATTCTGCTAGTCCTACAGAAGATGAACAGATTTCAGTTGAAGATTTGTCACATGCTTACAAGAAACTGGACCAGGATTACGAAAAATTTCTATC
AGAATGTGGACTGAGTAAATGGGGCTACTGGCGCGGGGGTTCCCAGAGTCCTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTGAAGCTGAGCCAGAGAAATAGAGCAGAACGTTTTTGTCTCCTCCCCAAGTTGCTCCCACAACCAACCCTCCCTTCTCTAACTCACAGAACTTGGGCTGACTC
CAAAAGATCATTGAAGAATCAATTCAGAGCCATTGCGCTGAGATGGACGTTTCAACTTCGGGATATATCCAAAGATCAACTCTCCACCAAGCACCACTTTGTTCATATTG
TCGAAGGAAGTGAGACCTTCACTTCGATTCCGAAGCAGAATGGAGTTTCCACACATTCTATTGTCATAGCTAATGGGAAGATAGTGGACACGGATCTAGAACACAAGGGG
CAGGATATCACGATTCGAAACTCTCGAGTGATTAGCGATATATATCAATTGCAAGAAAAGCTTCAAAGTTCTTTAAATGGACTTCGAAATTATAAAAAGCTTTTCACGCT
AGCCTCCCCTCATCTACCTCCTGCTAGAACCACTAGTTTTATAGTTTTGGTTCCTCTCATAGTATTTTGTGCCAGATGCATAATTGGTGCCTCCTATGCTAGAGTTTTCG
GAACATGGAGGCTTAAAACTGTTAATAAATCAGAGGGAGAACGTCACGAGTTCAGAAGTGGGCATTGGAGATCTGCTCTCCGCGATATAAGGGAACCGGACAGTTTGGAT
TCTGAGACATCTGTAGATTCTGCTAGTCCTACAGAAGATGAACAGATTTCAGTTGAAGATTTGTCACATGCTTACAAGAAACTGGACCAGGATTACGAAAAATTTCTATC
AGAATGTGGACTGAGTAAATGGGGCTACTGGCGCGGGGGTTCCCAGAGTCCTGAATAG
Protein sequenceShow/hide protein sequence
MEVKLSQRNRAERFCLLPKLLPQPTLPSLTHRTWADSKRSLKNQFRAIALRWTFQLRDISKDQLSTKHHFVHIVEGSETFTSIPKQNGVSTHSIVIANGKIVDTDLEHKG
QDITIRNSRVISDIYQLQEKLQSSLNGLRNYKKLFTLASPHLPPARTTSFIVLVPLIVFCARCIIGASYARVFGTWRLKTVNKSEGERHEFRSGHWRSALRDIREPDSLD
SETSVDSASPTEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGSQSPE