; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039720 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039720
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionfructose-bisphosphate aldolase-lysine N-methyltransferase, chloroplastic isoform X1
Genome locationscaffold10:42445209..42461798
RNA-Seq ExpressionSpg039720
SyntenySpg039720
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602353.1 [Fructose-bisphosphate aldolase]-lysine N-methyltransferase, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.7e-9069.34Show/hide
Query:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELHS------------TDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILK
        MLLGTRFTN+WRW TS  IST +AFHFN+   TS QLE   S            TD  FLPWLERKS T+ISS LSIGKS +GR LFAS+TIRAGD ILK
Subjt:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELHS------------TDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILK

Query:  VPFNVQISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRAL
        VPFNVQISPD LPSPIRDLLGDEIGNVAK+A+V+LLEQKLG                     IFW+E ELEMIRKS LYEESLNQRSQIEREFLAI++AL
Subjt:  VPFNVQISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRAL

Query:  ETFPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH
        ETFPEI+DSINCD+FMHAYALVTSRAWR TKG SLIPFADFLNHDGAS+++           VIADRDYAPGEH
Subjt:  ETFPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH

XP_008463512.1 PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic isoform X2 [Cucumis melo]1.6e-8869.92Show/hide
Query:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELH----STDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILKVPFNVQIS
        MLLG R  NIWRW+TS  +ST  AF+FNSH ST    +EL+    S D  FLPWLE+K+ TKISS LSIGKSS+GR LFAS+TIRAGDCILKVPFNVQIS
Subjt:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELH----STDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILKVPFNVQIS

Query:  PDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALETFPEIID
        PD LP PIRDLLG+EIGNVAKLAVV+LLEQKLG                     IFW ESELEMIRKSFLYEESLNQRSQI+REF AIR+ALE FPEIID
Subjt:  PDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALETFPEIID

Query:  SINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH
         I+CD+FMHAYALVTSRAWR T+GVSLIPFADFLNH+ ASEAM           V+ADRDYAPGEH
Subjt:  SINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH

XP_011655345.1 actin-histidine N-methyltransferase [Cucumis sativus]5.4e-8969.92Show/hide
Query:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELH----STDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILKVPFNVQIS
        M LG R  NIWRW+TS  + T  AF+FNSH STS   +EL+    S D  FLPWLERK+ TKISS LSIGKSS+GR LFAS+TIRAGDCILKVPFNVQIS
Subjt:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELH----STDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILKVPFNVQIS

Query:  PDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALETFPEIID
        PD+LP PIRDLLG+EIGNVAKLAVV+LLE KLG                     IFW ESELEMIRKS LYEESLNQRSQI+REFLAIR+ALE FPEIID
Subjt:  PDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALETFPEIID

Query:  SINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH
         I+CD+FMHAYALVTSRAWR T+GVSLIPFADFLNHDGASEAM           V+ADRD+APGEH
Subjt:  SINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH

XP_022953334.1 fructose-bisphosphate aldolase-lysine N-methyltransferase, chloroplastic isoform X1 [Cucurbita moschata]8.9e-9269.71Show/hide
Query:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELHS------------TDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILK
        MLLGTRFTN+WRW TS  IST +AFHFN+   TS QLE   S            TD  FLPWLERKS T+ISS LSIGKS +GR LFAS+TIRAGDCILK
Subjt:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELHS------------TDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILK

Query:  VPFNVQISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRAL
        VPFNVQISPD LPSPIRDLLGDEIGNVAK+A+V+LLEQKLG                     IFW+E ELEMIRKS LYEESLNQRSQIEREFLAI++AL
Subjt:  VPFNVQISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRAL

Query:  ETFPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH
        ETFPEI+DSINCD+FMHAYALVTSRAWR TKG SLIPFADFLNHDGAS+++           VIADRDYAPGEH
Subjt:  ETFPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH

XP_023534717.1 fructose-bisphosphate aldolase-lysine N-methyltransferase, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]5.7e-9169.34Show/hide
Query:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLE------------ELHSTDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILK
        MLLGTRFTN+WRW TS  IST +AFHFN+   TS QLE                TD  FLPWLERKS T+ISS LSIGKS +GR LFAS+TIRAGDCILK
Subjt:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLE------------ELHSTDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILK

Query:  VPFNVQISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRAL
        VPF+VQISPD LPSPIRDLLGDEIGNVAK+A+V+LLEQKLG                     IFW+E ELEMIRKS LYEESLNQRSQIEREFLAI++AL
Subjt:  VPFNVQISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRAL

Query:  ETFPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH
        ETFPEIIDSINCD+FMHAYALVTSRAWR TKG SLIPFADFLNHDGAS+++           VIADRDYAPGEH
Subjt:  ETFPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH

TrEMBL top hitse value%identityAlignment
A0A1S3CJF6 histone-lysine N-methyltransferase setd3 isoform X17.1e-8768.38Show/hide
Query:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELH----STDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILKVPFNV---
        MLLG R  NIWRW+TS  +ST  AF+FNSH ST    +EL+    S D  FLPWLE+K+ TKISS LSIGKSS+GR LFAS+TIRAGDCILKVPFNV   
Subjt:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELH----STDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILKVPFNV---

Query:  ---QISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALET
           QISPD LP PIRDLLG+EIGNVAKLAVV+LLEQKLG                     IFW ESELEMIRKSFLYEESLNQRSQI+REF AIR+ALE 
Subjt:  ---QISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALET

Query:  FPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH
        FPEIID I+CD+FMHAYALVTSRAWR T+GVSLIPFADFLNH+ ASEAM           V+ADRDYAPGEH
Subjt:  FPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH

A0A1S3CJW1 ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic isoform X27.6e-8969.92Show/hide
Query:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELH----STDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILKVPFNVQIS
        MLLG R  NIWRW+TS  +ST  AF+FNSH ST    +EL+    S D  FLPWLE+K+ TKISS LSIGKSS+GR LFAS+TIRAGDCILKVPFNVQIS
Subjt:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELH----STDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILKVPFNVQIS

Query:  PDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALETFPEIID
        PD LP PIRDLLG+EIGNVAKLAVV+LLEQKLG                     IFW ESELEMIRKSFLYEESLNQRSQI+REF AIR+ALE FPEIID
Subjt:  PDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALETFPEIID

Query:  SINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH
         I+CD+FMHAYALVTSRAWR T+GVSLIPFADFLNH+ ASEAM           V+ADRDYAPGEH
Subjt:  SINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH

A0A1S3CL08 histone-lysine N-methyltransferase setd3 isoform X37.1e-8768.38Show/hide
Query:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELH----STDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILKVPFNV---
        MLLG R  NIWRW+TS  +ST  AF+FNSH ST    +EL+    S D  FLPWLE+K+ TKISS LSIGKSS+GR LFAS+TIRAGDCILKVPFNV   
Subjt:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELH----STDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILKVPFNV---

Query:  ---QISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALET
           QISPD LP PIRDLLG+EIGNVAKLAVV+LLEQKLG                     IFW ESELEMIRKSFLYEESLNQRSQI+REF AIR+ALE 
Subjt:  ---QISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALET

Query:  FPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH
        FPEIID I+CD+FMHAYALVTSRAWR T+GVSLIPFADFLNH+ ASEAM           V+ADRDYAPGEH
Subjt:  FPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH

A0A6J1GPC6 fructose-bisphosphate aldolase-lysine N-methyltransferase, chloroplastic isoform X14.3e-9269.71Show/hide
Query:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELHS------------TDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILK
        MLLGTRFTN+WRW TS  IST +AFHFN+   TS QLE   S            TD  FLPWLERKS T+ISS LSIGKS +GR LFAS+TIRAGDCILK
Subjt:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELHS------------TDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILK

Query:  VPFNVQISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRAL
        VPFNVQISPD LPSPIRDLLGDEIGNVAK+A+V+LLEQKLG                     IFW+E ELEMIRKS LYEESLNQRSQIEREFLAI++AL
Subjt:  VPFNVQISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRAL

Query:  ETFPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH
        ETFPEI+DSINCD+FMHAYALVTSRAWR TKG SLIPFADFLNHDGAS+++           VIADRDYAPGEH
Subjt:  ETFPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH

A0A6J1JPU4 fructose-bisphosphate aldolase-lysine N-methyltransferase, chloroplastic isoform X11.6e-8667.52Show/hide
Query:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELHS------------TDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILK
        MLLGTRFTN+WRW TS  I T ++F  N+   TS QLE   S            TD  FLPWLERKS T ISS LSIGKS +GR LFAS+TIRAGDCILK
Subjt:  MLLGTRFTNIWRWRTSSAIST-YAFHFNSHCSTSLQLEELHS------------TDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILK

Query:  VPFNVQISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRAL
        VPFNVQISPD LPSPIRDLLGDEIGNVAK+A+V+LLEQKLG                     IFW+E ELEMIRK  L+EESLNQRSQIEREFLAI++AL
Subjt:  VPFNVQISPDTLPSPIRDLLGDEIGNVAKLAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRAL

Query:  ETFPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH
        ETFPEIIDSIN D+FMHAYALVTSRAWR TKG SLIPFADFLNHDGAS+++           VIADRDYAPGEH
Subjt:  ETFPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAM-----------VIADRDYAPGEH

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657508.3e-0834.91Show/hide
Query:  KYGGLGIGSLRQRNNALLLKWLWRFMQEVNALWRKVIASIYGCDPL---GWMTRPPKGALKACPWVEIAKNSALFVNF-IHFKANCGRRIRFWHDRWVGN
        K GGLG+ + +  N AL+ K  WR +QE N+LW  V+   Y    +    W+   PKG+  +  W  IA      V+  + +    G++IRFW DRWV  
Subjt:  KYGGLGIGSLRQRNNALLLKWLWRFMQEVNALWRKVIASIYGCDPL---GWMTRPPKGALKACPWVEIAKNSALFVNF-IHFKANCGRRIRFWHDRWVGN

Query:  SSLEEV
          L E+
Subjt:  SSLEEV

Arabidopsis top hitse value%identityAlignment
AT3G55080.1 SET domain-containing protein7.3e-5246.09Show/hide
Query:  ISTYAFHFNSHCSTSLQLEELHSTDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILKVPFNVQISPDTLPSPIRDLLGDEIGNVAKLA
        +S+ A  F+     +L+L+   S D  FLPWLER +G KI++ LSIGKS+ GRSLFAS  I AGDC+LKVPFN QI+PD LPS IR LL +E+GN+  LA
Subjt:  ISTYAFHFNSHCSTSLQLEELHSTDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILKVPFNVQISPDTLPSPIRDLLGDEIGNVAKLA

Query:  VVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALETF-PEIIDSINCDNFMHAYALVTSRAWRWT
         VL+ E+K+G                     IFW E EL MIR S +++E++ Q++QIE++F  + +A +   P + +  + ++FM+AYALV SRAW  +
Subjt:  VVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALETF-PEIIDSINCDNFMHAYALVTSRAWRWT

Query:  KGVSLIPFADFLNHDGASEAMVI-----------ADRDYAPGE
        K +SLIPFADF+NHDG S ++V+           ADR+Y+PG+
Subjt:  KGVSLIPFADFLNHDGASEAMVI-----------ADRDYAPGE

AT3G55080.2 SET domain-containing protein1.1e-2039.57Show/hide
Query:  LAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALETF-PEIIDSINCDNFMHAYALVTSRAWR
        LA VL+ E+K+G                     IFW E EL MIR S +++E++ Q++QIE++F  + +A +   P + +  + ++FM+AYALV SRAW 
Subjt:  LAVVLLLEQKLG--------------------PIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALETF-PEIIDSINCDNFMHAYALVTSRAWR

Query:  WTKGVSLIPFADFLNHDGASEAMVIADRDYAPGEHEKLE
         +K +SLIPFADF+NHDG S ++V+ D D    E   L+
Subjt:  WTKGVSLIPFADFLNHDGASEAMVIADRDYAPGEHEKLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTATTGGGAACTCGATTCACCAATATATGGCGATGGCGGACATCTTCAGCAATTTCTACTTATGCTTTCCACTTCAATAGCCACTGCTCAACCTCTTTACAACTTGA
GGAACTACATTCAACTGACTGTGCATTTTTACCATGGTTGGAGCGAAAATCAGGAACAAAGATCTCATCAGGGCTTTCTATTGGGAAGTCTTCCATGGGCAGGTCTCTGT
TTGCTTCTGACACTATACGCGCTGGGGATTGCATTTTAAAGGTTCCTTTCAATGTGCAAATTTCACCTGATACTCTTCCTTCACCCATTAGAGATCTCTTAGGCGATGAG
ATTGGAAATGTTGCCAAGCTCGCTGTTGTACTTCTTCTTGAACAGAAACTGGGTCCAATATTTTGGAACGAAAGTGAGTTGGAGATGATTCGTAAAAGCTTTTTGTATGA
GGAATCACTTAATCAAAGATCACAAATTGAAAGGGAATTTTTGGCAATCAGGCGAGCTCTGGAAACCTTCCCTGAAATTATTGATAGCATCAATTGCGACAATTTCATGC
ATGCTTATGCCCTTGTTACTTCTAGAGCATGGAGATGGACAAAGGGTGTCTCTCTGATTCCATTTGCAGACTTTTTAAATCATGATGGTGCTTCAGAAGCAATGGTCATT
GCTGATCGTGATTATGCCCCTGGTGAACATGAAAAGCTGGAATCCAGATTTCACATGGGTGCTAATAAGATACGGAAAATATTCAAATGCTTCATTGATGTTGGACTTTG
GGTTTGCGCTTCCATACAACATTCACGATCAGGGCTTTCAGAAAAATCTAGTGGGGGCAGAGTTTCTAGAAGTATGTGGAGGTTTAATAGGCTGATTGATGACCTTGAGC
TCATGGATATTCCGATGAACAATGGTAAATTTACTTGGTGGAATAAGGAAAGCTTCGGGTTTATTGAGGATGAGAAGAGAAAAATTCAAAAAGAGATTGATGACATTGAT
AGCGGTGAAGAAGAGGGAGTGTTGAGCAAGGATTTAATTAATAGGTGCAATGAGCTAAAAGGGAATCTGGCAGAGCTTGTTCGGAAGGAACAACGAATGTGGAACAAAAA
ATGCAAATTTCAATGGCTGAAGGAGGGCGACGAAAATACTAGTTTCTTTCATCGATGGGCCAATAGTAGGAGAAACAGGGCTTTCATTTCTTCCATTGAAGATGATAATG
GGAACTTTTGGACTAATATGAATGATATTGAAGAAGCGGGACTCTCTCTTAATGTTTCAAAGACGGTTCTTGTGGGCTTAAATGTGGATGCAAATGTTCTTAATCAGAAG
GCTAATTTGATTGGTTGTGAGGTTGAATCTTTGCCTATTACCTACTTAGGGATTCCTTTGGGGGGAATTTTCCATTCGACTGTGTTATGGGAACCAATGGTGGATAAATT
TCGTGCTAAGTATGGGGGGTTGGGAATTGGATCCTTAAGACAAAGAAATAATGCCCTCCTTTTGAAATGGTTATGGAGATTTATGCAGGAGGTAAATGCCTTATGGAGGA
AAGTCATAGCTAGTATCTATGGGTGTGACCCTCTCGGTTGGATGACCCGCCCTCCAAAAGGTGCCTTGAAAGCGTGCCCTTGGGTGGAAATTGCTAAAAACAGTGCACTG
TTTGTTAATTTTATTCATTTTAAGGCCAATTGTGGAAGAAGAATCAGGTTTTGGCATGATAGATGGGTGGGAAATTCGTCCTTGGAAGAAGTTTTTCCAAATCTTTTCCT
TATATCCCTTAAAAAAGAAGCTACAGTGGCAGATTGCCGGAATTCCGACCAAAATGATTGGGACCTTGGGTTTCGTAGGGGTATAAGGGATAGAGAATTTGATAGTTGGC
TTGGATTGGTATCATTAATCGACTCGGTGAGATTGGGGGAAGGAATACACAAGGCATATTGGGCCCTCGAAAAATCTGCCAAAGGGTGGAACCAATGCTTTAAAGCCCTT
GGCCTAAGCATGTGTCACCCAGAAAATTTGGAGCAGTGGATTAATGAAGCTCTTGATGGTTGGTCTTTAAAAGGAAAGGCGAGATTTCTGTGGAGATGTGTTGCCCGAGG
CTACTTATTGGAGATTTGGAGGGAACGTAATGGGAGGATTTTTGAAGATAAGTCGAATTCTTTTGAAAACGCTCGATTTTCGGGACGCATTTTTACCATGACAACCAAGA
AAGGCTCGACTAATGTAAGTGGAGATCCATCCTTGGTTGATAAAGAGGCGGAATCTACCCCCATCCTCTCACCACAAGAGACAACTGCACGCTTGCTGTCAGTTGAAAAC
GACGTGCGTGAGATCAAGAAAATCCTAGAACTAATGTGGGAAAAGATTGGCATTCATACCGAACAACTGACATTGAATCCGGAGGCGCAAACAACTAGGGGAAAAGAAAC
ACAAAATCGCCGAGGAGAATGCAGCAAGAAGGCAGAATTAAATTTCAACAAGAAAGGTGTTCTGGTGAACAAAAGACCGCACAAGATTGGGTTGCGGCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTATTGGGAACTCGATTCACCAATATATGGCGATGGCGGACATCTTCAGCAATTTCTACTTATGCTTTCCACTTCAATAGCCACTGCTCAACCTCTTTACAACTTGA
GGAACTACATTCAACTGACTGTGCATTTTTACCATGGTTGGAGCGAAAATCAGGAACAAAGATCTCATCAGGGCTTTCTATTGGGAAGTCTTCCATGGGCAGGTCTCTGT
TTGCTTCTGACACTATACGCGCTGGGGATTGCATTTTAAAGGTTCCTTTCAATGTGCAAATTTCACCTGATACTCTTCCTTCACCCATTAGAGATCTCTTAGGCGATGAG
ATTGGAAATGTTGCCAAGCTCGCTGTTGTACTTCTTCTTGAACAGAAACTGGGTCCAATATTTTGGAACGAAAGTGAGTTGGAGATGATTCGTAAAAGCTTTTTGTATGA
GGAATCACTTAATCAAAGATCACAAATTGAAAGGGAATTTTTGGCAATCAGGCGAGCTCTGGAAACCTTCCCTGAAATTATTGATAGCATCAATTGCGACAATTTCATGC
ATGCTTATGCCCTTGTTACTTCTAGAGCATGGAGATGGACAAAGGGTGTCTCTCTGATTCCATTTGCAGACTTTTTAAATCATGATGGTGCTTCAGAAGCAATGGTCATT
GCTGATCGTGATTATGCCCCTGGTGAACATGAAAAGCTGGAATCCAGATTTCACATGGGTGCTAATAAGATACGGAAAATATTCAAATGCTTCATTGATGTTGGACTTTG
GGTTTGCGCTTCCATACAACATTCACGATCAGGGCTTTCAGAAAAATCTAGTGGGGGCAGAGTTTCTAGAAGTATGTGGAGGTTTAATAGGCTGATTGATGACCTTGAGC
TCATGGATATTCCGATGAACAATGGTAAATTTACTTGGTGGAATAAGGAAAGCTTCGGGTTTATTGAGGATGAGAAGAGAAAAATTCAAAAAGAGATTGATGACATTGAT
AGCGGTGAAGAAGAGGGAGTGTTGAGCAAGGATTTAATTAATAGGTGCAATGAGCTAAAAGGGAATCTGGCAGAGCTTGTTCGGAAGGAACAACGAATGTGGAACAAAAA
ATGCAAATTTCAATGGCTGAAGGAGGGCGACGAAAATACTAGTTTCTTTCATCGATGGGCCAATAGTAGGAGAAACAGGGCTTTCATTTCTTCCATTGAAGATGATAATG
GGAACTTTTGGACTAATATGAATGATATTGAAGAAGCGGGACTCTCTCTTAATGTTTCAAAGACGGTTCTTGTGGGCTTAAATGTGGATGCAAATGTTCTTAATCAGAAG
GCTAATTTGATTGGTTGTGAGGTTGAATCTTTGCCTATTACCTACTTAGGGATTCCTTTGGGGGGAATTTTCCATTCGACTGTGTTATGGGAACCAATGGTGGATAAATT
TCGTGCTAAGTATGGGGGGTTGGGAATTGGATCCTTAAGACAAAGAAATAATGCCCTCCTTTTGAAATGGTTATGGAGATTTATGCAGGAGGTAAATGCCTTATGGAGGA
AAGTCATAGCTAGTATCTATGGGTGTGACCCTCTCGGTTGGATGACCCGCCCTCCAAAAGGTGCCTTGAAAGCGTGCCCTTGGGTGGAAATTGCTAAAAACAGTGCACTG
TTTGTTAATTTTATTCATTTTAAGGCCAATTGTGGAAGAAGAATCAGGTTTTGGCATGATAGATGGGTGGGAAATTCGTCCTTGGAAGAAGTTTTTCCAAATCTTTTCCT
TATATCCCTTAAAAAAGAAGCTACAGTGGCAGATTGCCGGAATTCCGACCAAAATGATTGGGACCTTGGGTTTCGTAGGGGTATAAGGGATAGAGAATTTGATAGTTGGC
TTGGATTGGTATCATTAATCGACTCGGTGAGATTGGGGGAAGGAATACACAAGGCATATTGGGCCCTCGAAAAATCTGCCAAAGGGTGGAACCAATGCTTTAAAGCCCTT
GGCCTAAGCATGTGTCACCCAGAAAATTTGGAGCAGTGGATTAATGAAGCTCTTGATGGTTGGTCTTTAAAAGGAAAGGCGAGATTTCTGTGGAGATGTGTTGCCCGAGG
CTACTTATTGGAGATTTGGAGGGAACGTAATGGGAGGATTTTTGAAGATAAGTCGAATTCTTTTGAAAACGCTCGATTTTCGGGACGCATTTTTACCATGACAACCAAGA
AAGGCTCGACTAATGTAAGTGGAGATCCATCCTTGGTTGATAAAGAGGCGGAATCTACCCCCATCCTCTCACCACAAGAGACAACTGCACGCTTGCTGTCAGTTGAAAAC
GACGTGCGTGAGATCAAGAAAATCCTAGAACTAATGTGGGAAAAGATTGGCATTCATACCGAACAACTGACATTGAATCCGGAGGCGCAAACAACTAGGGGAAAAGAAAC
ACAAAATCGCCGAGGAGAATGCAGCAAGAAGGCAGAATTAAATTTCAACAAGAAAGGTGTTCTGGTGAACAAAAGACCGCACAAGATTGGGTTGCGGCTCTAA
Protein sequenceShow/hide protein sequence
MLLGTRFTNIWRWRTSSAISTYAFHFNSHCSTSLQLEELHSTDCAFLPWLERKSGTKISSGLSIGKSSMGRSLFASDTIRAGDCILKVPFNVQISPDTLPSPIRDLLGDE
IGNVAKLAVVLLLEQKLGPIFWNESELEMIRKSFLYEESLNQRSQIEREFLAIRRALETFPEIIDSINCDNFMHAYALVTSRAWRWTKGVSLIPFADFLNHDGASEAMVI
ADRDYAPGEHEKLESRFHMGANKIRKIFKCFIDVGLWVCASIQHSRSGLSEKSSGGRVSRSMWRFNRLIDDLELMDIPMNNGKFTWWNKESFGFIEDEKRKIQKEIDDID
SGEEEGVLSKDLINRCNELKGNLAELVRKEQRMWNKKCKFQWLKEGDENTSFFHRWANSRRNRAFISSIEDDNGNFWTNMNDIEEAGLSLNVSKTVLVGLNVDANVLNQK
ANLIGCEVESLPITYLGIPLGGIFHSTVLWEPMVDKFRAKYGGLGIGSLRQRNNALLLKWLWRFMQEVNALWRKVIASIYGCDPLGWMTRPPKGALKACPWVEIAKNSAL
FVNFIHFKANCGRRIRFWHDRWVGNSSLEEVFPNLFLISLKKEATVADCRNSDQNDWDLGFRRGIRDREFDSWLGLVSLIDSVRLGEGIHKAYWALEKSAKGWNQCFKAL
GLSMCHPENLEQWINEALDGWSLKGKARFLWRCVARGYLLEIWRERNGRIFEDKSNSFENARFSGRIFTMTTKKGSTNVSGDPSLVDKEAESTPILSPQETTARLLSVEN
DVREIKKILELMWEKIGIHTEQLTLNPEAQTTRGKETQNRRGECSKKAELNFNKKGVLVNKRPHKIGLRL