; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0023141 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0023141
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionCholine monooxygenase, chloroplastic
Genome locationchr02:8568394..8571620
RNA-Seq ExpressionIVF0023141
SyntenyIVF0023141
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016020 - membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK04233.1 choline monooxygenase [Cucumis melo var. makuwa]1.91e-27387.15Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----
        KDAHDFFTGR+  ++     DNNRKVRAFHNVCRHHASLLATGCGKKSCF     GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV    
Subjt:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----

Query:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
                DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
Subjt:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQ        +   +L   V+   KG    
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT

Query:  SLQVGRYAPSVENAMHHFHRLLHRNLTK
        + + GRYAPSVENAMHHFHRLLHRNLTK
Subjt:  SLQVGRYAPSVENAMHHFHRLLHRNLTK

XP_004139149.1 choline monooxygenase, chloroplastic isoform X1 [Cucumis sativus]2.11e-25481.54Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PS SFNFH CNHRSP RISA+LSFR+PDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----
        KDAHDFFTGR+  ++     DNNRKVRAFHNVCRHHASLLA+GCGKKSCF     GWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFV    
Subjt:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----

Query:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
                DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT
        GDD GRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQ SLEDSESVQ        +   +L   V+   KG    
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT

Query:  SLQVGRYAPSVENAMHHFHRLLHRNLTK
        + + GRYAPSVENAMHHFHRLLH NLTK
Subjt:  SLQVGRYAPSVENAMHHFHRLLHRNLTK

XP_008443631.1 PREDICTED: choline monooxygenase, chloroplastic isoform X2 [Cucumis melo]5.48e-27386.92Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----
        KDAHDFFTGR+  ++     DNNRKVRAFHNVCRHHASLLATGCGKKSCF     GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV    
Subjt:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----

Query:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
                DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
Subjt:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQQSLEDSESVQ        +   +L   V+   KG    
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT

Query:  SLQVGRYAPSVENAMHHFHRLLHRNLTK
        + + GRYAPSVENAMHHFHRLLHRNLTK
Subjt:  SLQVGRYAPSVENAMHHFHRLLHRNLTK

XP_016899776.1 PREDICTED: choline monooxygenase, chloroplastic isoform X1 [Cucumis melo]5.90e-26883Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV------
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQA+      
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV------

Query:  -------------GYVEQLKDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNEND
                     GYVEQLKDAHDFFTGR+  ++     DNNRKVRAFHNVCRHHASLLATGCGKKSCF     GWTYGLDGILLKATRINGIQNFNEND
Subjt:  -------------GYVEQLKDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNEND

Query:  FGLVPLPVAMWGPFV------------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYS
        FGLVPLPVAMWGPFV            DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYS
Subjt:  FGLVPLPVAMWGPFV------------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYS

Query:  TELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLS
        TELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQQSLEDSESVQ        +
Subjt:  TELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLS

Query:  ISCVLHYSVRRSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNLTK
           +L   V+   KG    + + GRYAPSVENAMHHFHRLLHRNLTK
Subjt:  ISCVLHYSVRRSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNLTK

XP_038880722.1 choline monooxygenase, chloroplastic [Benincasa hispida]1.51e-24778.5Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQPPSISFN H CNHRSPSRISA+LSFR+ DS FIEA+KLVD+FDPEIPLEKA+TPPSSWYIDPSF+ALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----
        KD HDFFTGR+  ++     DNNRKVRAFHNVCRHHASLLA+GCGK+SCF     GWTYGLDG+LLKATRINGIQNF+EN+FGL+PLPVA WGPFV    
Subjt:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----

Query:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
                DVDEDKV  EWLG+CAD+L LNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL L+SYSTE+FETVSIQSCKGGGE+K
Subjt:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT
        G+DYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLG RKCLVVFDYFLE+SFKND  FIQ+SLEDSESVQ            +L   V+   KG    
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT

Query:  SLQVGRYAPSVENAMHHFHRLLHRNLTK
        + + GRYAPSVENAMHHFHRLLH NLTK
Subjt:  SLQVGRYAPSVENAMHHFHRLLHRNLTK

TrEMBL top hitse value%identityAlignment
A0A0A0LXX3 Choline monooxygenase, chloroplastic6.0e-20181.54Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PS SFNFH CNHRSP RISA+LSFR+PDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----
        KDAHDFFTGR+  ++     DNNRKVRAFHNVCRHHASLLA+GCGKKSCF     GWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFV    
Subjt:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----

Query:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
                DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT
        GDD GRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQ SLEDSESVQ        +   +L   V+   KG    
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT

Query:  SLQVGRYAPSVENAMHHFHRLLHRNLTK
        + + GRYAPSVENAMHHFHRLLH NLTK
Subjt:  SLQVGRYAPSVENAMHHFHRLLHRNLTK

A0A1S3B9A0 Choline monooxygenase, chloroplastic4.3e-21586.92Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----
        KDAHDFFTGR+  ++     DNNRKVRAFHNVCRHHASLLATGCGKKSCF     GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV    
Subjt:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----

Query:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
                DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
Subjt:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQQSLEDSESVQ        +   +L   V+   KG    
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT

Query:  SLQVGRYAPSVENAMHHFHRLLHRNLTK
        + + GRYAPSVENAMHHFHRLLHRNLTK
Subjt:  SLQVGRYAPSVENAMHHFHRLLHRNLTK

A0A1S4DUW5 choline monooxygenase, chloroplastic isoform X11.7e-21183Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV------
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQA+      
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV------

Query:  -------------GYVEQLKDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNEND
                     GYVEQLKDAHDFFTGR+  ++     DNNRKVRAFHNVCRHHASLLATGCGKKSCF     GWTYGLDGILLKATRINGIQNFNEND
Subjt:  -------------GYVEQLKDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNEND

Query:  FGLVPLPVAMWGPFV------------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYS
        FGLVPLPVAMWGPFV            DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYS
Subjt:  FGLVPLPVAMWGPFV------------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYS

Query:  TELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLS
        TELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQQSLEDSESVQ        +
Subjt:  TELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLS

Query:  ISCVLHYSVRRSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNLTK
           +L   V+   KG    + + GRYAPSVENAMHHFHRLLHRNLTK
Subjt:  ISCVLHYSVRRSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNLTK

A0A5A7UY48 Choline monooxygenase, chloroplastic4.3e-21586.92Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----
        KDAHDFFTGR+  ++     DNNRKVRAFHNVCRHHASLLATGCGKKSCF     GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV    
Subjt:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFV----

Query:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
                DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
Subjt:  --------DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQQSLEDSESVQ        +   +L   V+   KG    
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT

Query:  SLQVGRYAPSVENAMHHFHRLLHRNLTK
        + + GRYAPSVENAMHHFHRLLHRNLTK
Subjt:  SLQVGRYAPSVENAMHHFHRLLHRNLTK

A0A5D3BWX2 Choline monooxygenase, chloroplastic1.9e-21587.15Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPF-----
        KDAHDFFTGR+  ++     DNNRKVRAFHNVCRHHASLLATGCGKKSCF     GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPF     
Subjt:  KDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPF-----

Query:  -------VDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
               VDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
Subjt:  -------VDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQ        +   +L   V+   KG    
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVT

Query:  SLQVGRYAPSVENAMHHFHRLLHRNLTK
        + + GRYAPSVENAMHHFHRLLHRNLTK
Subjt:  SLQVGRYAPSVENAMHHFHRLLHRNLTK

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic4.8e-9443.18Show/hide
Query:  RSPSRISA---------SLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRVCGMQ---
        R+P++I+          SL+  +P S     + LV +FDP+IP E A TPPSSWY +P+F++ EL+R+FY+GWQ  G  +Q+K+ + +FTG +  ++   
Subjt:  RSPSRISA---------SLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRVCGMQ---

Query:  --DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVDVDEDK-------VAREWLGTC
          D   KV AFHNVC H AS+LA G GKKSCF     GW YG+DG L KA++    QN +  + GLVPL VA+WGPFV +  D+       V  EWLGT 
Subjt:  --DNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVDVDEDK-------VAREWLGTC

Query:  ADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMIN
        A+ ++ +  D SL ++ R E+ ++ NWK+F DNYLD  YHVPYAHK  A+ LN ++Y T++ E V+IQ  + G  +K D + R+G +A YAF YPNF + 
Subjt:  ADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMIN

Query:  RYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVTSLQVGRYAPSVENAMHHFHRLLH
        RYGPWM T  + PLGPRKC +V DY++E+S  +D  +I++ +  +++VQ            VL  SV+R   G    + + GRY   +E  +HHFH  L 
Subjt:  RYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVTSLQVGRYAPSVENAMHHFHRLLH

Query:  RNL
        + L
Subjt:  RNL

O22553 Choline monooxygenase, chloroplastic7.3e-9545.43Show/hide
Query:  RKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSC
        R LV +FDPEIP E ALTPPS+WY +P+F++ EL+R+FY+GWQ  GY EQ+K+ + +FTG +  ++     D   ++ AFHNVC H AS+LA G GKKSC
Subjt:  RKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSC

Query:  F-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVDVDEDK-------VAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFC
        F     GW YGLDG L KA++    QN +  + GL PL VA WGPF+ +  D+       V  EW+G  A+ ++ +  D +L +  R E+ ++CNWKVFC
Subjt:  F-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVDVDEDK-------VAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFC

Query:  DNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSF
        DNYLD  YHVPYAHK  A+ L+ ++Y+TE+ E   IQ   G   +K D + RLG EA YAFIYPNF + RYG WM T  V+P+G RKC +V DY+LE + 
Subjt:  DNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSF

Query:  KNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNL
         +D ++I + +  +++VQ            VL  SV+R   G    + + GRY   +E  +HHFH  LH  L
Subjt:  KNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNL

Q93XE1 Choline monooxygenase, chloroplastic2.5e-9545.31Show/hide
Query:  RKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSC
        ++++ +FDP++P E   TPPS+WY DPS ++ ELDR+F +GWQ  GY +Q+K+ + +FTG +  ++     D   KV AFHNVC H AS+LA G GKKSC
Subjt:  RKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSC

Query:  F-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVDVDEDK--------VAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVF
        F     GW +GLDG L+KAT+    Q F+  + GLV L VA+WGPFV +  D+        V +EW+G+CA+ ++ +  D SL ++ R E+ ++ NWKVF
Subjt:  F-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVDVDEDK--------VAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVF

Query:  CDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESS
        CDNYLD  YHVPYAHK  A+ L+ ++Y T+L E V IQ       +K + + RLG EA YAFIYPNF + RYGPWM T  + PLGPRKC +V DY+LE++
Subjt:  CDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESS

Query:  FKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNL
          ND  +I++S+  +++VQ            VL  SV+R   G    + + GRY   +E  +HHFH  LH+ L
Subjt:  FKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNL

Q9LKN0 Choline monooxygenase, chloroplastic1.1e-9545.16Show/hide
Query:  RKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSC
        + LV DFDP +P E ALTPPSSWY +P+F+A ELDR+FY+GWQ  GY +Q+K+A+ +FTG +  ++     D   KV AFHNVC H AS+LA G GKKSC
Subjt:  RKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRVCGMQ-----DNNRKVRAFHNVCRHHASLLATGCGKKSC

Query:  F-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVDVDEDKVAR-------EWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFC
        F     GW YG++G L KA++    Q+ N ++ GLVPL VA+WGPF+ +  D+ +R       EWLG+CA+ ++ +  D +L ++ R E+ I+ NWK+F 
Subjt:  F-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVDVDEDKVAR-------EWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFC

Query:  DNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSF
        DNYLD  YHVPYAHK  A+ L+ ++Y T++   V+IQ   G   +  + + RLG +A YAF YPNF + RYGPWM T  ++PLGPRKC +V DY++E S 
Subjt:  DNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSF

Query:  KNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNL
         +D  +I++ +  +++VQ            VL  SV+   KG    + + GRY   +E  +HHFH  LH+ L
Subjt:  KNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNL

Q9SZR0 Choline monooxygenase, chloroplastic8.3e-13156.78Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISAS----LSFRSPDSRFI--EARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV
        M TLT    +  F PPS+     + N  S   +S S      F +P   F   +  KLV +FDP+IPLE+A TPPSSWY DP F++ ELDRVFY GWQAV
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISAS----LSFRSPDSRFI--EARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV

Query:  GYVEQLKDAHDFFTGR-------VCGMQDNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMW
        GY +Q+K++ DFFTGR       VC  +D N K+ AFHNVC HHAS+LA+G G+KSCF     GWTY L G L+KATR++GIQNF+ ++ GL PL VA+W
Subjt:  GYVEQLKDAHDFFTGR-------VCGMQDNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMW

Query:  GPFV------------DVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQ
        GPFV            +V+ D+ VA EWLGT    L   GVD+ LSY+CRREYTIDCNWKVFCDNYLDGGYHVPYAHKGL S L+LE+YST +FE VSIQ
Subjt:  GPFV------------DVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQ

Query:  SCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVR
         C GG +   D + RLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+ S K+D++FI++SLE+S+ VQ+           +L  SV+
Subjt:  SCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVR

Query:  RSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNL
        R   G    +   GRYA  VE  MHHFH LLH NL
Subjt:  RSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNL

Arabidopsis top hitse value%identityAlignment
AT4G29890.1 choline monooxygenase, putative (CMO-like)5.9e-13256.78Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISAS----LSFRSPDSRFI--EARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV
        M TLT    +  F PPS+     + N  S   +S S      F +P   F   +  KLV +FDP+IPLE+A TPPSSWY DP F++ ELDRVFY GWQAV
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISAS----LSFRSPDSRFI--EARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV

Query:  GYVEQLKDAHDFFTGR-------VCGMQDNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMW
        GY +Q+K++ DFFTGR       VC  +D N K+ AFHNVC HHAS+LA+G G+KSCF     GWTY L G L+KATR++GIQNF+ ++ GL PL VA+W
Subjt:  GYVEQLKDAHDFFTGR-------VCGMQDNNRKVRAFHNVCRHHASLLATGCGKKSCF-----GWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMW

Query:  GPFV------------DVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQ
        GPFV            +V+ D+ VA EWLGT    L   GVD+ LSY+CRREYTIDCNWKVFCDNYLDGGYHVPYAHKGL S L+LE+YST +FE VSIQ
Subjt:  GPFV------------DVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQ

Query:  SCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVR
         C GG +   D + RLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+ S K+D++FI++SLE+S+ VQ+           +L  SV+
Subjt:  SCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLESSFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVR

Query:  RSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNL
        R   G    +   GRYA  VE  MHHFH LLH NL
Subjt:  RSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACTTTGACGAAGCATATCCAAATTCACTTCTTCCAACCTCCTTCCATTTCTTTCAATTTCCATTTCTGTAATCATCGTTCTCCCTCACGAATCTCCGCGTCTTT
ATCCTTTCGAAGCCCTGATTCTCGATTCATTGAAGCTCGGAAACTCGTTGATGATTTCGACCCTGAAATTCCATTGGAGAAGGCCCTCACTCCACCCTCCTCCTGGTATA
TAGACCCTTCATTTTTCGCTCTCGAGCTCGATCGTGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATGCCCATGACTTTTTCACAGGCAGA
GTATGTGGTATGCAAGATAATAACAGGAAGGTTCGTGCATTTCACAATGTTTGTCGCCATCATGCCTCACTTCTTGCTACTGGATGTGGGAAGAAGTCGTGCTTTGGATG
GACATACGGGTTGGATGGAATACTGCTTAAGGCGACTAGAATAAATGGGATACAGAACTTCAATGAAAATGATTTTGGCCTTGTACCATTACCAGTAGCTATGTGGGGGC
CTTTCGTGGATGTTGATGAAGATAAAGTGGCACGTGAATGGCTTGGAACATGTGCAGATGTGCTGAGACTAAACGGAGTTGATGCTTCCCTAAGTTATGTCTGTCGACGT
GAATACACTATTGATTGTAACTGGAAGGTTTTCTGTGACAACTATTTAGATGGAGGATATCACGTTCCCTATGCACATAAAGGGCTTGCATCTAATCTCAACCTTGAGTC
TTATTCTACGGAATTATTTGAAACTGTTAGCATTCAAAGTTGTAAGGGTGGGGGAGAATCAAAAGGTGATGATTATGGTCGACTTGGACCAGAAGCACTATATGCTTTTA
TATACCCAAATTTCATGATAAATAGGTATGGACCTTGGATGGACACTAATCTAGTACTCCCACTTGGACCTCGAAAATGTTTGGTGGTTTTCGATTATTTTCTTGAATCT
TCTTTTAAGAATGATGACTCCTTTATACAACAAAGTTTAGAAGACAGTGAAAGTGTGCAGGTATTACAAACAAATCCACCCCTTTCCATTTCATGTGTGCTACATTATTC
TGTGCGAAGGAGTTCAAAAGGGTCTCGAGTCACCAGCTTACAAGTTGGCCGGTATGCTCCTTCGGTTGAGAATGCCATGCACCATTTCCATCGTCTTCTTCATCGTAACC
TCACAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACTTTGACGAAGCATATCCAAATTCACTTCTTCCAACCTCCTTCCATTTCTTTCAATTTCCATTTCTGTAATCATCGTTCTCCCTCACGAATCTCCGCGTCTTT
ATCCTTTCGAAGCCCTGATTCTCGATTCATTGAAGCTCGGAAACTCGTTGATGATTTCGACCCTGAAATTCCATTGGAGAAGGCCCTCACTCCACCCTCCTCCTGGTATA
TAGACCCTTCATTTTTCGCTCTCGAGCTCGATCGTGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATGCCCATGACTTTTTCACAGGCAGA
GTATGTGGTATGCAAGATAATAACAGGAAGGTTCGTGCATTTCACAATGTTTGTCGCCATCATGCCTCACTTCTTGCTACTGGATGTGGGAAGAAGTCGTGCTTTGGATG
GACATACGGGTTGGATGGAATACTGCTTAAGGCGACTAGAATAAATGGGATACAGAACTTCAATGAAAATGATTTTGGCCTTGTACCATTACCAGTAGCTATGTGGGGGC
CTTTCGTGGATGTTGATGAAGATAAAGTGGCACGTGAATGGCTTGGAACATGTGCAGATGTGCTGAGACTAAACGGAGTTGATGCTTCCCTAAGTTATGTCTGTCGACGT
GAATACACTATTGATTGTAACTGGAAGGTTTTCTGTGACAACTATTTAGATGGAGGATATCACGTTCCCTATGCACATAAAGGGCTTGCATCTAATCTCAACCTTGAGTC
TTATTCTACGGAATTATTTGAAACTGTTAGCATTCAAAGTTGTAAGGGTGGGGGAGAATCAAAAGGTGATGATTATGGTCGACTTGGACCAGAAGCACTATATGCTTTTA
TATACCCAAATTTCATGATAAATAGGTATGGACCTTGGATGGACACTAATCTAGTACTCCCACTTGGACCTCGAAAATGTTTGGTGGTTTTCGATTATTTTCTTGAATCT
TCTTTTAAGAATGATGACTCCTTTATACAACAAAGTTTAGAAGACAGTGAAAGTGTGCAGGTATTACAAACAAATCCACCCCTTTCCATTTCATGTGTGCTACATTATTC
TGTGCGAAGGAGTTCAAAAGGGTCTCGAGTCACCAGCTTACAAGTTGGCCGGTATGCTCCTTCGGTTGAGAATGCCATGCACCATTTCCATCGTCTTCTTCATCGTAACC
TCACAAAATAA
Protein sequenceShow/hide protein sequence
MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGR
VCGMQDNNRKVRAFHNVCRHHASLLATGCGKKSCFGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRR
EYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLES
SFKNDDSFIQQSLEDSESVQVLQTNPPLSISCVLHYSVRRSSKGSRVTSLQVGRYAPSVENAMHHFHRLLHRNLTK