; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022273 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022273
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionbeta glucosidase 12
Genome locationscaffold2:11805422..11812962
RNA-Seq ExpressionSpg022273
SyntenySpg022273
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
InterPro domainsIPR001360 - Glycoside hydrolase family 1
IPR017853 - Glycoside hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142767.1 uncharacterized protein LOC111012805 [Momordica charantia]2.6e-2947.85Show/hide
Query:  SIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLLKFPGRM
        S GK  P T++E +SRAQ+YMSA E   SK+ E   KR+      S  DK Q +    R R  +  P  KFEKYT T VP E+VLMEI+N  LLK+P RM
Subjt:  SIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLLKFPGRM

Query:  KSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGEPKA----EADQRWPRQRRTPTGDQDHLWRTSWRKFKQE
         ++  +R K +YCLFHWDHGH+T++C  LK+E+E LI  GYLKE++ EPKA    E+D+   R+ RT  G    + R S RK K +
Subjt:  KSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGEPKA----EADQRWPRQRRTPTGDQDHLWRTSWRKFKQE

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]1.1e-2237.41Show/hide
Query:  STPSYGHTKTDLRNLIEEKRRSARTAESEARAAEAEARAVEAEARAAEAEARLAEARAAEAEARLAEAEAKKDSLPWKTELLNALKELGNPQGDLPKLLN
        ST   G  +   R L   KR S  + +S ARA   +       +R       + +         +A    +K  +   T+ ++ L  +   + +   L  
Subjt:  STPSYGHTKTDLRNLIEEKRRSARTAESEARAAEAEARAVEAEARAAEAEARLAEARAAEAEARLAEAEAKKDSLPWKTELLNALKELGNPQGDLPKLLN

Query:  SIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLLKFPGRM
        S GK  P T++E +SRAQ+YMSA E   SK+ E + KR+      S  DK Q +    R R  +  P  KFEKYTPT VP E+VLMEI++  LLK+P RM
Subjt:  SIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLLKFPGRM

Query:  KSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGEPKA----EADQRWPRQRRTPTGDQDHLWRTSWRKFKQE
        K++  +R K +YCLFH DHGH+T++C  LK+E+E LI+ GYLKE++ EPKA    E+D+   R+ RT  G    + R S RK K +
Subjt:  KSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGEPKA----EADQRWPRQRRTPTGDQDHLWRTSWRKFKQE

XP_022159192.1 uncharacterized protein LOC111025612 [Momordica charantia]1.9e-2746.33Show/hide
Query:  SIGKSQPRTYAEFVSRAQKYMSAEEL------LKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLL
        S GK  P T+AE +SRAQKYMSAEE       L+ K+++++ +RS     +S+ +K+ R   G +  P       KFEKYTPT VP E+VLMEI++  LL
Subjt:  SIGKSQPRTYAEFVSRAQKYMSAEEL------LKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLL

Query:  KFPGRMKSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGE----PKAEADQRWP-RQRRTPTG
        K+P  MK+ P++R K +YCLFH DHGH+T +C  LK+E+E LI+ GYLKE++ +    P  E D + P R+ RT  G
Subjt:  KFPGRMKSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGE----PKAEADQRWP-RQRRTPTG

XP_030941654.1 uncharacterized protein LOC115966587 [Quercus lobata]6.3e-2339.33Show/hide
Query:  KLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAH--PFGKFEKYTPTAVPQEKVLMEIRNTGLL
        K L S+ K+ P+T A+ + RA KYM+AE+ L +++ E+  KR    D   R+D+ ++    G  R +R    P G+F  +TP   P ++VLM+I++ G L
Subjt:  KLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAH--PFGKFEKYTPTAVPQEKVLMEIRNTGLL

Query:  KFPGRMKSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGEPKAE-ADQRWPRQR----RTPTGD
         FPG++K +P++R + +YC FH DHGH T NC  LK +IEALI+ G L++F+   K +  +++ PR+     R P GD
Subjt:  KFPGRMKSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGEPKAE-ADQRWPRQR----RTPTGD

XP_030959011.1 uncharacterized protein LOC115980955 [Quercus lobata]2.2e-2338.07Show/hide
Query:  KLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLLKF
        K L S+ K+ P+T  + + RA KYM+AE+ L +++ ER  KR    D    + +K       R       P G+F  +TP   P ++VLM+I++ G L F
Subjt:  KLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLLKF

Query:  PGRMKSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGE-----PKAEADQRWPRQRRTPTGD
        PG+++  P++R + +YC FHWDHGH T NC  LK +IEALI+ G L+ F+       P+ +A +R   + R P GD
Subjt:  PGRMKSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGE-----PKAEADQRWPRQRRTPTGD

TrEMBL top hitse value%identityAlignment
A0A6J1CNT2 uncharacterized protein LOC1110128051.3e-2947.85Show/hide
Query:  SIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLLKFPGRM
        S GK  P T++E +SRAQ+YMSA E   SK+ E   KR+      S  DK Q +    R R  +  P  KFEKYT T VP E+VLMEI+N  LLK+P RM
Subjt:  SIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLLKFPGRM

Query:  KSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGEPKA----EADQRWPRQRRTPTGDQDHLWRTSWRKFKQE
         ++  +R K +YCLFHWDHGH+T++C  LK+E+E LI  GYLKE++ EPKA    E+D+   R+ RT  G    + R S RK K +
Subjt:  KSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGEPKA----EADQRWPRQRRTPTGDQDHLWRTSWRKFKQE

A0A6J1DWY0 uncharacterized protein LOC1110252935.2e-2337.41Show/hide
Query:  STPSYGHTKTDLRNLIEEKRRSARTAESEARAAEAEARAVEAEARAAEAEARLAEARAAEAEARLAEAEAKKDSLPWKTELLNALKELGNPQGDLPKLLN
        ST   G  +   R L   KR S  + +S ARA   +       +R       + +         +A    +K  +   T+ ++ L  +   + +   L  
Subjt:  STPSYGHTKTDLRNLIEEKRRSARTAESEARAAEAEARAVEAEARAAEAEARLAEARAAEAEARLAEAEAKKDSLPWKTELLNALKELGNPQGDLPKLLN

Query:  SIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLLKFPGRM
        S GK  P T++E +SRAQ+YMSA E   SK+ E + KR+      S  DK Q +    R R  +  P  KFEKYTPT VP E+VLMEI++  LLK+P RM
Subjt:  SIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLLKFPGRM

Query:  KSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGEPKA----EADQRWPRQRRTPTGDQDHLWRTSWRKFKQE
        K++  +R K +YCLFH DHGH+T++C  LK+E+E LI+ GYLKE++ EPKA    E+D+   R+ RT  G    + R S RK K +
Subjt:  KSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGEPKA----EADQRWPRQRRTPTGDQDHLWRTSWRKFKQE

A0A6J1DZ52 uncharacterized protein LOC1110256129.1e-2846.33Show/hide
Query:  SIGKSQPRTYAEFVSRAQKYMSAEEL------LKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLL
        S GK  P T+AE +SRAQKYMSAEE       L+ K+++++ +RS     +S+ +K+ R   G +  P       KFEKYTPT VP E+VLMEI++  LL
Subjt:  SIGKSQPRTYAEFVSRAQKYMSAEEL------LKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLL

Query:  KFPGRMKSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGE----PKAEADQRWP-RQRRTPTG
        K+P  MK+ P++R K +YCLFH DHGH+T +C  LK+E+E LI+ GYLKE++ +    P  E D + P R+ RT  G
Subjt:  KFPGRMKSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGE----PKAEADQRWP-RQRRTPTG

A0A7N2LNH8 Ribonuclease H4.4e-2241.34Show/hide
Query:  KLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPD-RAHPF-GKFEKYTPTAVPQEKVLMEIRNTGLL
        K L S+ K+ P+T +E + RA KYM+AE+ L S++ +R  KR    D  SR+D+ ++    G  R D R  P  G+F  +TP   P ++VLM+I++ G L
Subjt:  KLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPD-RAHPF-GKFEKYTPTAVPQEKVLMEIRNTGLL

Query:  KFPGRMKSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIG-----EPKAEADQRWPRQR-RTPTGD
         FPG++KS+P +R + +YC FH DHGH T +C  LK +IEALI+ G L++F+      +P  E   R   +R R P GD
Subjt:  KFPGRMKSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIG-----EPKAEADQRWPRQR-RTPTGD

A0A7N2N9G0 Reverse transcriptase1.2e-2239.33Show/hide
Query:  KLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPF-GKFEKYTPTAVPQEKVLMEIRNTGLLK
        K L S+ K+ P+T +E + RA KYM+AE+ L +++ E+  KR    D N +   +++T  G R    R  P  G+F  +TP   P ++VLM+I++   L 
Subjt:  KLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSEREHKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPF-GKFEKYTPTAVPQEKVLMEIRNTGLLK

Query:  FPGRMKSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGEPKAE--ADQRWPR----QRRTPTGD
        FPG++KS+P++R + +YC FH DHGH T +C  LK +IEALI+ G L+ F+ + +A+  A  + PR    + R P GD
Subjt:  FPGRMKSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEFIGEPKAE--ADQRWPR----QRRTPTGD

SwissProt top hitse value%identityAlignment
B8AVF0 Beta-glucosidase 126.7e-1268.89Show/hide
Query:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP
        E+I D SNGDVA DSYH YKEDV +MK++G D YRFSI+W+R+LP
Subjt:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP

P26205 Cyanogenic beta-glucosidase (Fragment)8.8e-1266.67Show/hide
Query:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP
        E+I D +NGDVA+D YHRYKED+ IMK++  D YRFSI+W RVLP
Subjt:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP

Q5Z9Z0 Beta-glucosidase 248.8e-1266.67Show/hide
Query:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP
        E+I + SNGD+A+DSYHRYKEDV IMK LG + YRFS++W R+LP
Subjt:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP

Q7XKV2 Beta-glucosidase 131.2e-1168.89Show/hide
Query:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP
        ++I D SNGDVA DSYH YKEDV IMK++G D YRFSI+W+R+LP
Subjt:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP

Q7XKV4 Beta-glucosidase 126.7e-1268.89Show/hide
Query:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP
        E+I D SNGDVA DSYH YKEDV +MK++G D YRFSI+W+R+LP
Subjt:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP

Arabidopsis top hitse value%identityAlignment
AT1G26560.1 beta glucosidase 405.3e-1263.64Show/hide
Query:  RIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP
        +I D SN DVA+D YHRY+EDV +MKN+G D YRFSI+W+R+ P
Subjt:  RIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP

AT2G25630.1 beta glucosidase 143.4e-1162.22Show/hide
Query:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP
        E+I D SNG +A DSYH YKEDV ++  +GF+ YRFSI+WSR+LP
Subjt:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP

AT2G44450.1 beta glucosidase 153.4e-1164.44Show/hide
Query:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP
        E+I D SNG VA +SYH YKEDVA++  +GF+ YRFSI+WSR+LP
Subjt:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP

AT5G36890.1 beta glucosidase 424.5e-1165.91Show/hide
Query:  RIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP
        +I D SNGDVA+D YHRYKEDV ++  LGF  YRFSI+WSR+ P
Subjt:  RIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP

AT5G42260.1 beta glucosidase 125.3e-1264.44Show/hide
Query:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP
        E+I D SNG +A DSYH YKEDV ++  +GFD YRFSI+WSR+LP
Subjt:  ERIPDHSNGDVALDSYHRYKEDVAIMKNLGFDVYRFSIAWSRVLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCTAGCCGCCCCGGTTTCGCCTGGTTTGTCCCGAAACGCCTCCGAATTCCTAAAAACCCTAGGAGCACGAGCAGTTGGCGCCATCTGTGGGGAAGAAAGCTTGCC
AAATCTGCATATCGGTCATTCCATGAGTAAGGGGATGGAGAAGATAAATCAAGACATAAACGCAGAAAATTCGGATGGTGACCGCCACCAGCGGAGGTCACGGGAAGAAG
GCCGAGATCGGCCTCGGATCGAATCTCCTCGTCCTCGGTCTCCACTGCCCTCATCCCGAGAGAAGCAAGCTGATCTAAAATTTGATGCTCTCGAAAACAAAGTAAGTGCG
ATGGATCATAATTTGTCCAGGATACTTCGTATCTTGGATAGAGCTGGTCCTAGCACTAAAACCCCTGATGAGAGGTTGGTTAGGGATCCGAGGAAGGGGAAGGAGCCCAT
GGAGCACACTCCAGAATCGGAGACGAGATCGAAGGGAAAGAAGACTAGCAGCATGACCAGCAAGATCAGGGGGCTCAAGCCTACTGGTCGTACAATCTTGAGGAGTCCAG
AGTCAAGCACATTTAGGGGACGTGACTACACAGTTTCTACCCCAAGCTATGGTCATACTAAGACAGACCTGAGGAATTTGATCGAGGAGAAGCGCAGGAGTGCCAGAACT
GCCGAATCCGAGGCCAGAGCTGCCGAAGCTGAAGCCAGGGCTGTTGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCTAGGTTGGCCGAGGCCAGAGCAGCCGAGGCCGA
GGCTAGGTTGGCCGAGGCCGAGGCCAAGAAGGACAGCCTCCCTTGGAAGACTGAGCTTCTTAACGCACTAAAGGAGCTTGGAAATCCACAGGGAGACCTGCCTAAACTGC
TCAATTCAATAGGTAAGAGCCAACCTCGAACCTATGCGGAGTTTGTCTCTCGGGCACAAAAATACATGAGCGCAGAGGAGTTGCTCAAGTCAAAGAAGTCAGAGCGTGAA
CACAAGAGGTCTTCTTCATCTGACCACAACAGTAGGAAGGATAAGAAGCAGCGGACCGACGAGGGAGGCCGAGGCCGACCAGACCGAGCACATCCCTTTGGTAAGTTCGA
GAAATATACGCCAACAGCTGTTCCACAGGAGAAAGTACTGATGGAGATCCGAAATACGGGACTCCTGAAATTCCCGGGGAGGATGAAGTCGAATCCTGATAGAAGAGACA
AGAGCCAGTATTGCCTTTTCCACTGGGACCATGGACATTCAACTAGGAATTGTATTCAGTTGAAGGATGAAATCGAAGCACTGATCCAGAATGGGTATCTGAAGGAGTTC
ATCGGTGAGCCCAAGGCCGAGGCCGACCAAAGATGGCCGAGACAAAGAAGAACCCCTACGGGAGATCAGGACCATCTTTGGAGGACCAGCTGGAGGAAGTTCAAGCAGGA
AGAGGAAAGCTATTGTCAGGGAAGCAATGTTCGAACCGGAGTATCGAGCCCTATTGATTCACCATTAAAGTTCAGTATTCAGGAGTGGCACAAGACAACCGATTCAAGCA
TCACGAAACGCGAGCCTGGACCGCCGAGTCTACCTCGCCCCATCGGCTTGAGACAATGTGAGCCTGGACCTTGGGCGAGCCTGGACTACCGAGTCTACCTCGCCTCATCC
AGATTGAGACTATGTGAGCCTGGACCACGGAGTCTACCTCACCTCATCTGGCTTGAGCATCCTCACATTGGGCGAGCCCGGACTACCGAGTCTACCTCGCCTCATCCAAA
CTTGAGATTGCATGAGCCTGGACCACCGAAGTCTACCTCACCCAAGTCTACCTCACCTCAGAGCTCAGACACTTGCTCAAGCACTTCATGTAAGTCAAGCTCACCTGGCG
GAGGCCGAGACCAAGCACCTCTTGCCAAAGCCGAGCACCTCTTGCCGAGGCCGAGCACAAACTTTAAAGGAAATTTTGGACCACCGGACGCACAAGGAGCTGACGAGGAC
GTCCGGGCGAAAATAGGGCTAGGAGACCGAGCTCGAGGAAGAACCGACCAAAGGGCCGGGCCAACTTGGCCCGACCCATATGGTCGGCCTCGGCCCAAGGCCGAGGCCTA
CCATTCGGCCCGCTTGCGCGGGCCGAGCTCGGTCACCTCCTCTCGGTCCCTGATGCCTCTAGCCGCCCCGGTTTCGCCTGGTTTGTCCCGAAACACCTCCGAATTCCTAA
AAACCCTAGGAGCACGAGCAGAGAGAATCCCAGATCATAGCAATGGAGATGTTGCCCTTGATTCATATCATCGATACAAGGAAGATGTTGCCATTATGAAGAACTTGGGC
TTTGATGTATACAGATTTTCGATAGCTTGGTCAAGAGTTTTGCCA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCTAGCCGCCCCGGTTTCGCCTGGTTTGTCCCGAAACGCCTCCGAATTCCTAAAAACCCTAGGAGCACGAGCAGTTGGCGCCATCTGTGGGGAAGAAAGCTTGCC
AAATCTGCATATCGGTCATTCCATGAGTAAGGGGATGGAGAAGATAAATCAAGACATAAACGCAGAAAATTCGGATGGTGACCGCCACCAGCGGAGGTCACGGGAAGAAG
GCCGAGATCGGCCTCGGATCGAATCTCCTCGTCCTCGGTCTCCACTGCCCTCATCCCGAGAGAAGCAAGCTGATCTAAAATTTGATGCTCTCGAAAACAAAGTAAGTGCG
ATGGATCATAATTTGTCCAGGATACTTCGTATCTTGGATAGAGCTGGTCCTAGCACTAAAACCCCTGATGAGAGGTTGGTTAGGGATCCGAGGAAGGGGAAGGAGCCCAT
GGAGCACACTCCAGAATCGGAGACGAGATCGAAGGGAAAGAAGACTAGCAGCATGACCAGCAAGATCAGGGGGCTCAAGCCTACTGGTCGTACAATCTTGAGGAGTCCAG
AGTCAAGCACATTTAGGGGACGTGACTACACAGTTTCTACCCCAAGCTATGGTCATACTAAGACAGACCTGAGGAATTTGATCGAGGAGAAGCGCAGGAGTGCCAGAACT
GCCGAATCCGAGGCCAGAGCTGCCGAAGCTGAAGCCAGGGCTGTTGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCTAGGTTGGCCGAGGCCAGAGCAGCCGAGGCCGA
GGCTAGGTTGGCCGAGGCCGAGGCCAAGAAGGACAGCCTCCCTTGGAAGACTGAGCTTCTTAACGCACTAAAGGAGCTTGGAAATCCACAGGGAGACCTGCCTAAACTGC
TCAATTCAATAGGTAAGAGCCAACCTCGAACCTATGCGGAGTTTGTCTCTCGGGCACAAAAATACATGAGCGCAGAGGAGTTGCTCAAGTCAAAGAAGTCAGAGCGTGAA
CACAAGAGGTCTTCTTCATCTGACCACAACAGTAGGAAGGATAAGAAGCAGCGGACCGACGAGGGAGGCCGAGGCCGACCAGACCGAGCACATCCCTTTGGTAAGTTCGA
GAAATATACGCCAACAGCTGTTCCACAGGAGAAAGTACTGATGGAGATCCGAAATACGGGACTCCTGAAATTCCCGGGGAGGATGAAGTCGAATCCTGATAGAAGAGACA
AGAGCCAGTATTGCCTTTTCCACTGGGACCATGGACATTCAACTAGGAATTGTATTCAGTTGAAGGATGAAATCGAAGCACTGATCCAGAATGGGTATCTGAAGGAGTTC
ATCGGTGAGCCCAAGGCCGAGGCCGACCAAAGATGGCCGAGACAAAGAAGAACCCCTACGGGAGATCAGGACCATCTTTGGAGGACCAGCTGGAGGAAGTTCAAGCAGGA
AGAGGAAAGCTATTGTCAGGGAAGCAATGTTCGAACCGGAGTATCGAGCCCTATTGATTCACCATTAAAGTTCAGTATTCAGGAGTGGCACAAGACAACCGATTCAAGCA
TCACGAAACGCGAGCCTGGACCGCCGAGTCTACCTCGCCCCATCGGCTTGAGACAATGTGAGCCTGGACCTTGGGCGAGCCTGGACTACCGAGTCTACCTCGCCTCATCC
AGATTGAGACTATGTGAGCCTGGACCACGGAGTCTACCTCACCTCATCTGGCTTGAGCATCCTCACATTGGGCGAGCCCGGACTACCGAGTCTACCTCGCCTCATCCAAA
CTTGAGATTGCATGAGCCTGGACCACCGAAGTCTACCTCACCCAAGTCTACCTCACCTCAGAGCTCAGACACTTGCTCAAGCACTTCATGTAAGTCAAGCTCACCTGGCG
GAGGCCGAGACCAAGCACCTCTTGCCAAAGCCGAGCACCTCTTGCCGAGGCCGAGCACAAACTTTAAAGGAAATTTTGGACCACCGGACGCACAAGGAGCTGACGAGGAC
GTCCGGGCGAAAATAGGGCTAGGAGACCGAGCTCGAGGAAGAACCGACCAAAGGGCCGGGCCAACTTGGCCCGACCCATATGGTCGGCCTCGGCCCAAGGCCGAGGCCTA
CCATTCGGCCCGCTTGCGCGGGCCGAGCTCGGTCACCTCCTCTCGGTCCCTGATGCCTCTAGCCGCCCCGGTTTCGCCTGGTTTGTCCCGAAACACCTCCGAATTCCTAA
AAACCCTAGGAGCACGAGCAGAGAGAATCCCAGATCATAGCAATGGAGATGTTGCCCTTGATTCATATCATCGATACAAGGAAGATGTTGCCATTATGAAGAACTTGGGC
TTTGATGTATACAGATTTTCGATAGCTTGGTCAAGAGTTTTGCCA
Protein sequenceShow/hide protein sequence
MPLAAPVSPGLSRNASEFLKTLGARAVGAICGEESLPNLHIGHSMSKGMEKINQDINAENSDGDRHQRRSREEGRDRPRIESPRPRSPLPSSREKQADLKFDALENKVSA
MDHNLSRILRILDRAGPSTKTPDERLVRDPRKGKEPMEHTPESETRSKGKKTSSMTSKIRGLKPTGRTILRSPESSTFRGRDYTVSTPSYGHTKTDLRNLIEEKRRSART
AESEARAAEAEARAVEAEARAAEAEARLAEARAAEAEARLAEAEAKKDSLPWKTELLNALKELGNPQGDLPKLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKKSERE
HKRSSSSDHNSRKDKKQRTDEGGRGRPDRAHPFGKFEKYTPTAVPQEKVLMEIRNTGLLKFPGRMKSNPDRRDKSQYCLFHWDHGHSTRNCIQLKDEIEALIQNGYLKEF
IGEPKAEADQRWPRQRRTPTGDQDHLWRTSWRKFKQEEESYCQGSNVRTGVSSPIDSPLKFSIQEWHKTTDSSITKREPGPPSLPRPIGLRQCEPGPWASLDYRVYLASS
RLRLCEPGPRSLPHLIWLEHPHIGRARTTESTSPHPNLRLHEPGPPKSTSPKSTSPQSSDTCSSTSCKSSSPGGGRDQAPLAKAEHLLPRPSTNFKGNFGPPDAQGADED
VRAKIGLGDRARGRTDQRAGPTWPDPYGRPRPKAEAYHSARLRGPSSVTSSRSLMPLAAPVSPGLSRNTSEFLKTLGARAERIPDHSNGDVALDSYHRYKEDVAIMKNLG
FDVYRFSIAWSRVLP