; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002448 (gene) of Snake gourd v1 genome

Gene IDTan0002448
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCholine monooxygenase, chloroplastic
Genome locationLG09:69002970..69007321
RNA-Seq ExpressionTan0002448
SyntenyTan0002448
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001663 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit
IPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK04233.1 choline monooxygenase [Cucumis melo var. makuwa]8.8e-21885.61Show/hide
Query:  MATLTKHIPTHFFQAPCIT--RHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQ P I+   H CNHRSPSRISA  SFR+PDS FI+ +KLVD+FDP+IPLE A+TPPSSWY DPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIPTHFFQAPCIT--RHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL
        +D+HDFFTGRLGNVE+VVCKDNNRKVRAFHNVCRHHASLL +G GKKSCFVCPYHGWTYGLDG LLKATRINGIQNF+ NDFGL+PLPVA WGPF+LLNL
Subjt:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL

Query:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK
        D KLSS+ DVDEDKVA EWLG+CA++L LNGVDASLS+VCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL L+SYSTE+FETVSIQSCK GGESK
Subjt:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV
        GDDYGRLG EALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+ FKND+S IQ+SLEDSE VQNEDII+CE VQKGLESPAYK GRYAPSV
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV

Query:  EKAMHHFHRLLHLNLAK
        E AMHHFHRLLH NL K
Subjt:  EKAMHHFHRLLHLNLAK

XP_008443631.1 PREDICTED: choline monooxygenase, chloroplastic isoform X2 [Cucumis melo]3.0e-21885.85Show/hide
Query:  MATLTKHIPTHFFQAPCIT--RHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQ P I+   H CNHRSPSRISA  SFR+PDS FI+ +KLVD+FDP+IPLE A+TPPSSWY DPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIPTHFFQAPCIT--RHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL
        +D+HDFFTGRLGNVE+VVCKDNNRKVRAFHNVCRHHASLL +G GKKSCFVCPYHGWTYGLDG LLKATRINGIQNF+ NDFGL+PLPVA WGPF+LLNL
Subjt:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL

Query:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK
        D KLSS+ DVDEDKVA EWLG+CA++L LNGVDASLS+VCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL L+SYSTE+FETVSIQSCK GGESK
Subjt:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV
        GDDYGRLG EALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEA FKND+S IQ+SLEDSE VQNEDII+CE VQKGLESPAYK GRYAPSV
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV

Query:  EKAMHHFHRLLHLNLAK
        E AMHHFHRLLH NL K
Subjt:  EKAMHHFHRLLHLNLAK

XP_022965039.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita moschata]3.0e-21886.57Show/hide
Query:  MATLTKHIPTHFFQA--PCITRHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHI THFFQ         S ++RSP+RIS+  SFRN DSHFI+  KLVDEFDP+IPLE AVTPPSSWYTDPSFFALELDRVF+RGWQAVGYVEQL
Subjt:  MATLTKHIPTHFFQA--PCITRHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL
        +D HDFFTGRLGNVE+VVCKDNN+KVRAFHNVCRHHASLL SG GKKSCFVCPYHGWTYGLDG LLKATRINGIQNFDVNDFGL+PLPVATWGPF+LLNL
Subjt:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL

Query:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK
        DEKLSSE  VDED+VA+EWLGSCA++LSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYSTEIFE VSIQSCK GGESK
Subjt:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE PFKND S IQ+SLEDSE VQ EDII+CE VQKGLESPAYK GRYAPSV
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV

Query:  EKAMHHFHRLLHLNLAK
        E AMHHFHRLLHLNL K
Subjt:  EKAMHHFHRLLHLNLAK

XP_022987330.1 choline monooxygenase, chloroplastic isoform X2 [Cucurbita maxima]4.7e-21986.81Show/hide
Query:  MATLTKHIPTHFFQA--PCITRHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHI THFFQ         S +HRSP+RIS   SFRN DSHFI+  KLVDEFDP+IPLE AVTPPSSWYTDPSFFALELDRVF+RGWQAVGYVEQL
Subjt:  MATLTKHIPTHFFQA--PCITRHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL
        +D HDFFTGRLGNVE+VVCKDNN+KVRAFHNVCRHHASLL SG GKKSCFVCPYHGWTYGLDG LLKATRINGIQNFDVNDFGL+PLPVATWGPF+LLN+
Subjt:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL

Query:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK
        DEKLSSE  VDEDKVAYEWLGSCA++LSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYSTEIFE VSIQSCK GGESK
Subjt:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGP+KCLVVFDYFLE PFKND S IQ+SLEDSE VQ EDII+CE VQKGLESPAYK GRYAPSV
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV

Query:  EKAMHHFHRLLHLNLAK
        E AMHHFHRLLHLNL K
Subjt:  EKAMHHFHRLLHLNLAK

XP_038880722.1 choline monooxygenase, chloroplastic [Benincasa hispida]1.3e-22187.77Show/hide
Query:  MATLTKHIPTHFFQAPCIT--RHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI THFFQ P I+   HSCNHRSPSRISA  SFRN DSHFI+ QKLVDEFDP+IPLE AVTPPSSWY DPSF+ALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIPTHFFQAPCIT--RHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL
        +D HDFFTGRLGNVE+VVCKDNNRKVRAFHNVCRHHASLL SG GK+SCFVCPYHGWTYGLDG LLKATRINGIQNFD N+FGL+PLPVATWGPF+LLNL
Subjt:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL

Query:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK
        D KLSSE DVDEDKV  EWLGSCA++LSLNGVDASLS+VCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCK GGE+K
Subjt:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV
        G+DYGRLG EALYAFIYPNFMINRYGPWMDTNLVLPLG RKCLVVFDYFLEA FKND   IQ+SLEDSE VQ EDII+CE VQKGLESPAYK GRYAPSV
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV

Query:  EKAMHHFHRLLHLNLAK
        E AMHHFHRLLHLNL K
Subjt:  EKAMHHFHRLLHLNLAK

TrEMBL top hitse value%identityAlignment
A0A1S3B9A0 Choline monooxygenase, chloroplastic1.5e-21885.85Show/hide
Query:  MATLTKHIPTHFFQAPCIT--RHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQ P I+   H CNHRSPSRISA  SFR+PDS FI+ +KLVD+FDP+IPLE A+TPPSSWY DPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIPTHFFQAPCIT--RHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL
        +D+HDFFTGRLGNVE+VVCKDNNRKVRAFHNVCRHHASLL +G GKKSCFVCPYHGWTYGLDG LLKATRINGIQNF+ NDFGL+PLPVA WGPF+LLNL
Subjt:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL

Query:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK
        D KLSS+ DVDEDKVA EWLG+CA++L LNGVDASLS+VCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL L+SYSTE+FETVSIQSCK GGESK
Subjt:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV
        GDDYGRLG EALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEA FKND+S IQ+SLEDSE VQNEDII+CE VQKGLESPAYK GRYAPSV
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV

Query:  EKAMHHFHRLLHLNLAK
        E AMHHFHRLLH NL K
Subjt:  EKAMHHFHRLLHLNLAK

A0A5A7UY48 Choline monooxygenase, chloroplastic1.5e-21885.85Show/hide
Query:  MATLTKHIPTHFFQAPCIT--RHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQ P I+   H CNHRSPSRISA  SFR+PDS FI+ +KLVD+FDP+IPLE A+TPPSSWY DPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIPTHFFQAPCIT--RHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL
        +D+HDFFTGRLGNVE+VVCKDNNRKVRAFHNVCRHHASLL +G GKKSCFVCPYHGWTYGLDG LLKATRINGIQNF+ NDFGL+PLPVA WGPF+LLNL
Subjt:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL

Query:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK
        D KLSS+ DVDEDKVA EWLG+CA++L LNGVDASLS+VCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL L+SYSTE+FETVSIQSCK GGESK
Subjt:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV
        GDDYGRLG EALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEA FKND+S IQ+SLEDSE VQNEDII+CE VQKGLESPAYK GRYAPSV
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV

Query:  EKAMHHFHRLLHLNLAK
        E AMHHFHRLLH NL K
Subjt:  EKAMHHFHRLLHLNLAK

A0A5D3BWX2 Choline monooxygenase, chloroplastic4.3e-21885.61Show/hide
Query:  MATLTKHIPTHFFQAPCIT--RHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQ P I+   H CNHRSPSRISA  SFR+PDS FI+ +KLVD+FDP+IPLE A+TPPSSWY DPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIPTHFFQAPCIT--RHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL
        +D+HDFFTGRLGNVE+VVCKDNNRKVRAFHNVCRHHASLL +G GKKSCFVCPYHGWTYGLDG LLKATRINGIQNF+ NDFGL+PLPVA WGPF+LLNL
Subjt:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL

Query:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK
        D KLSS+ DVDEDKVA EWLG+CA++L LNGVDASLS+VCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL L+SYSTE+FETVSIQSCK GGESK
Subjt:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV
        GDDYGRLG EALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+ FKND+S IQ+SLEDSE VQNEDII+CE VQKGLESPAYK GRYAPSV
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV

Query:  EKAMHHFHRLLHLNLAK
        E AMHHFHRLLH NL K
Subjt:  EKAMHHFHRLLHLNLAK

A0A6J1HPX3 Choline monooxygenase, chloroplastic1.5e-21886.57Show/hide
Query:  MATLTKHIPTHFFQA--PCITRHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHI THFFQ         S ++RSP+RIS+  SFRN DSHFI+  KLVDEFDP+IPLE AVTPPSSWYTDPSFFALELDRVF+RGWQAVGYVEQL
Subjt:  MATLTKHIPTHFFQA--PCITRHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL
        +D HDFFTGRLGNVE+VVCKDNN+KVRAFHNVCRHHASLL SG GKKSCFVCPYHGWTYGLDG LLKATRINGIQNFDVNDFGL+PLPVATWGPF+LLNL
Subjt:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL

Query:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK
        DEKLSSE  VDED+VA+EWLGSCA++LSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYSTEIFE VSIQSCK GGESK
Subjt:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE PFKND S IQ+SLEDSE VQ EDII+CE VQKGLESPAYK GRYAPSV
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV

Query:  EKAMHHFHRLLHLNLAK
        E AMHHFHRLLHLNL K
Subjt:  EKAMHHFHRLLHLNLAK

A0A6J1JGJ2 Choline monooxygenase, chloroplastic2.3e-21986.81Show/hide
Query:  MATLTKHIPTHFFQA--PCITRHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHI THFFQ         S +HRSP+RIS   SFRN DSHFI+  KLVDEFDP+IPLE AVTPPSSWYTDPSFFALELDRVF+RGWQAVGYVEQL
Subjt:  MATLTKHIPTHFFQA--PCITRHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL
        +D HDFFTGRLGNVE+VVCKDNN+KVRAFHNVCRHHASLL SG GKKSCFVCPYHGWTYGLDG LLKATRINGIQNFDVNDFGL+PLPVATWGPF+LLN+
Subjt:  RDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNL

Query:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK
        DEKLSSE  VDEDKVAYEWLGSCA++LSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYSTEIFE VSIQSCK GGESK
Subjt:  DEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGP+KCLVVFDYFLE PFKND S IQ+SLEDSE VQ EDII+CE VQKGLESPAYK GRYAPSV
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSV

Query:  EKAMHHFHRLLHLNLAK
        E AMHHFHRLLHLNL K
Subjt:  EKAMHHFHRLLHLNLAK

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic2.1e-12150.5Show/hide
Query:  TRHSCNHRSPSRI--SAVFSFRNPDSHFIQP---QKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLRDSHDFFTGRLGNVE
        T  +   R+P++I  +AV +   P      P   Q LV EFDPQIP E+A TPPSSWYT+P+F++ EL+R+FY+GWQ  G  +Q+++ + +FTG LGNVE
Subjt:  TRHSCNHRSPSRI--SAVFSFRNPDSHFIQP---QKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLRDSHDFFTGRLGNVE

Query:  FVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNLDEKLSSEGDVDEDKV
        ++V +D   KV AFHNVC H AS+L  G+GKKSCFVCPYHGW YG+DG L KA++    QN D  + GL+PL VA WGPF+L++LD  L   GD     V
Subjt:  FVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNLDEKLSSEGDVDEDKV

Query:  AYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESKGDDYGRLGSEALYAF
          EWLG+ AE +  +  D SL F+ R E+ +E NWK+F DNYLD  YHVPYAHK  A+ L  D+Y T++ E V+IQ  + G  +K D + R+G +A YAF
Subjt:  AYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESKGDDYGRLGSEALYAF

Query:  IYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSVEKAMHHFHRLLHLNL
         YPNF + RYGPWM T  + PLGPRKC +V DY++E    +D   I+K +  ++ VQ ED+++CESVQ+GLE+PAY+ GRY   +EK +HHFH  L   L
Subjt:  IYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSVEKAMHHFHRLLHLNL

O22553 Choline monooxygenase, chloroplastic1.2e-12153.68Show/hide
Query:  QKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLRDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSC
        + LV EFDP+IP E+A+TPPS+WYT+P+F++ EL+R+FY+GWQ  GY EQ+++ + +FTG LGNVE++V +D   ++ AFHNVC H AS+L  G+GKKSC
Subjt:  QKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLRDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSC

Query:  FVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNLDEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECN
        FVCPYHGW YGLDG L KA++    QN D  + GL PL VA WGPFIL++LD  L +  D     V  EW+G  AE +  +  D +L F  R E+ +ECN
Subjt:  FVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNLDEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGES-KGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDY
        WKVFCDNYLD  YHVPYAHK  A+ L  D+Y+TE+ E   IQ  + G  S K D + RLG+EA YAFIYPNF + RYG WM T  V+P+G RKC +V DY
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGES-KGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDY

Query:  FLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSVEKAMHHFHRLLHLNL
        +LE    +D + I K +  ++ VQ ED ++CESVQ+GLE+PAY+ GRY   +EK +HHFH  LH  L
Subjt:  FLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSVEKAMHHFHRLLHLNL

Q93XE1 Choline monooxygenase, chloroplastic2.0e-12454.92Show/hide
Query:  QKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLRDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSC
        ++++ EFDP++P E+  TPPS+WYTDPS ++ ELDR+F +GWQ  GY +Q+++ + +FTG LGNVE++VC+D   KV AFHNVC H AS+L  GTGKKSC
Subjt:  QKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLRDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSC

Query:  FVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNLDEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECN
        FVCPYHGW +GLDG L+KAT+    Q FD  + GL+ L VA WGPF+L++LD +  SEG  D   V  EW+GSCAE +  +  D SL F+ R E+ +E N
Subjt:  FVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNLDEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WKVFCDNYLD  YHVPYAHK  A+ L  D+Y T++ E V IQ       +K + + RLGSEA YAFIYPNF + RYGPWM T  + PLGPRKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSVEKAMHHFHRLLHLNL
        LE    ND   I+KS+  ++ VQ ED+++CESVQ+GLE+PAY+ GRY   +EK +HHFH  LH  L
Subjt:  LEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSVEKAMHHFHRLLHLNL

Q9LKN0 Choline monooxygenase, chloroplastic1.9e-12253.44Show/hide
Query:  QKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLRDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSC
        Q LV +FDP +P E+A+TPPSSWYT+P+F+A ELDR+FY+GWQ  GY +Q+++++ +FTG LGNVE++VC+D   KV AFHNVC H AS+L  G+GKKSC
Subjt:  QKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLRDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSC

Query:  FVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNLDEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECN
        FVCPYHGW YG++G L KA++    Q+ + ++ GL+PL VA WGPFIL++LD      GDV       EWLGSCAE +  +  D +L F+ R E+ IE N
Subjt:  FVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNLDEKLSSEGDVDEDKVAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WK+F DNYLD  YHVPYAHK  A+ L  D+Y T++   V+IQ  +  G S  + + RLG++A YAF YPNF + RYGPWM T  ++PLGPRKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSVEKAMHHFHRLLH
        +E    +D   I+K +  ++ VQ ED+++CESVQKGLE+PAY+ GRY   +EK +HHFH  LH
Subjt:  LEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSVEKAMHHFHRLLH

Q9SZR0 Choline monooxygenase, chloroplastic4.5e-15663.57Show/hide
Query:  MATLTKHIPTHFFQAPCITRHSCNHRSPSRIS-AVFS---FRNPDSHFI--QPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGY
        M TLT  +P     +   TR   N  S   +S + FS   F NP   F      KLV EFDP+IPLE A TPPSSWYTDP F++ ELDRVFY GWQAVGY
Subjt:  MATLTKHIPTHFFQAPCITRHSCNHRSPSRIS-AVFS---FRNPDSHFI--QPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGY

Query:  VEQLRDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFI
         +Q+++S DFFTGRLG+V+FVVC+D N K+ AFHNVC HHAS+L SG G+KSCFVC YHGWTY L G L+KATR++GIQNF +++ GL PL VA WGPF+
Subjt:  VEQLRDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFI

Query:  LLNLDEKLSSEGDVDEDK-VAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKT
        LL +    S +G+V+ D+ VA EWLG+    LS  GVD+ LS++CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L L++YST IFE VSIQ C  
Subjt:  LLNLDEKLSSEGDVDEDK-VAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKT

Query:  GGESKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGR
        G +   D + RLGSEALYAF+YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+   K+D + I++SLE+S+ VQ ED+++CESVQ+GLES AY  GR
Subjt:  GGESKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGR

Query:  YAPSVEKAMHHFHRLLHLNL
        YA  VEK MHHFH LLH NL
Subjt:  YAPSVEKAMHHFHRLLHLNL

Arabidopsis top hitse value%identityAlignment
AT1G44446.1 Pheophorbide a oxygenase family protein with Rieske [2Fe-2S] domain2.1e-0436.54Show/hide
Query:  FVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLK
        +V+ +  + K     N C H A  L  GT  +    CPYHGW Y  DG+  K
Subjt:  FVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLK

AT1G44446.2 Pheophorbide a oxygenase family protein with Rieske [2Fe-2S] domain2.1e-0436.54Show/hide
Query:  FVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLK
        +V+ +  + K     N C H A  L  GT  +    CPYHGW Y  DG+  K
Subjt:  FVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLK

AT1G44446.3 Pheophorbide a oxygenase family protein with Rieske [2Fe-2S] domain2.1e-0436.54Show/hide
Query:  FVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLK
        +V+ +  + K     N C H A  L  GT  +    CPYHGW Y  DG+  K
Subjt:  FVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLK

AT4G29890.1 choline monooxygenase, putative (CMO-like)3.2e-15763.57Show/hide
Query:  MATLTKHIPTHFFQAPCITRHSCNHRSPSRIS-AVFS---FRNPDSHFI--QPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGY
        M TLT  +P     +   TR   N  S   +S + FS   F NP   F      KLV EFDP+IPLE A TPPSSWYTDP F++ ELDRVFY GWQAVGY
Subjt:  MATLTKHIPTHFFQAPCITRHSCNHRSPSRIS-AVFS---FRNPDSHFI--QPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGY

Query:  VEQLRDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFI
         +Q+++S DFFTGRLG+V+FVVC+D N K+ AFHNVC HHAS+L SG G+KSCFVC YHGWTY L G L+KATR++GIQNF +++ GL PL VA WGPF+
Subjt:  VEQLRDSHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFI

Query:  LLNLDEKLSSEGDVDEDK-VAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKT
        LL +    S +G+V+ D+ VA EWLG+    LS  GVD+ LS++CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L L++YST IFE VSIQ C  
Subjt:  LLNLDEKLSSEGDVDEDK-VAYEWLGSCAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKT

Query:  GGESKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGR
        G +   D + RLGSEALYAF+YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+   K+D + I++SLE+S+ VQ ED+++CESVQ+GLES AY  GR
Subjt:  GGESKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGR

Query:  YAPSVEKAMHHFHRLLHLNL
        YA  VEK MHHFH LLH NL
Subjt:  YAPSVEKAMHHFHRLLHLNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGTTAACGAAGCATATTCCTACTCACTTCTTTCAAGCTCCTTGCATTACTCGGCATTCGTGTAACCATCGTTCTCCCTCACGAATCTCAGCGGTTTTCTCCTT
CCGAAACCCCGATTCTCACTTCATTCAACCTCAGAAACTCGTCGATGAATTCGACCCTCAAATCCCCCTGGAGAACGCCGTCACTCCACCCAGCTCCTGGTACACTGACC
CTTCATTTTTCGCTCTCGAGCTCGATCGCGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAGAGATTCCCATGACTTTTTCACAGGCAGGTTGGGG
AATGTAGAGTTTGTGGTGTGCAAAGATAATAACAGGAAGGTTCGTGCATTTCACAATGTCTGCCGCCATCATGCCTCACTTCTCGTGTCTGGAACTGGGAAAAAGTCGTG
CTTTGTTTGCCCGTACCATGGATGGACATATGGGTTGGATGGAGATCTGCTTAAGGCGACTAGAATAAATGGGATACAAAACTTCGATGTAAATGATTTTGGGCTCCTAC
CATTACCAGTAGCTACATGGGGGCCTTTCATTCTTCTCAATCTAGATGAAAAATTATCATCTGAGGGGGATGTTGATGAAGATAAAGTAGCATATGAATGGCTAGGAAGC
TGTGCAGAGATGCTGAGTTTGAATGGAGTTGATGCTTCATTAAGTTTTGTTTGTCGACGTGAATACACCATTGAATGTAATTGGAAGGTTTTTTGTGACAACTACTTAGA
TGGAGGATATCACGTTCCCTATGCACATAAAGGGCTCGCATCAAATCTCAAGCTGGATTCTTATTCTACAGAAATATTTGAAACTGTTAGCATTCAAAGTTGTAAGACTG
GAGGAGAATCAAAAGGCGATGATTATGGTCGACTCGGATCAGAAGCACTATATGCTTTTATATACCCAAACTTCATGATAAATAGATATGGACCTTGGATGGACACTAAT
CTAGTACTGCCACTCGGACCTCGAAAATGTCTAGTGGTTTTTGATTATTTTCTTGAAGCTCCTTTTAAGAATGACAACTCTCTTATACAAAAAAGTTTAGAAGACAGTGA
ATGCGTGCAGAATGAAGACATTATTGTGTGTGAAAGTGTTCAAAAGGGTCTCGAGTCTCCTGCTTACAAGGTTGGCCGATATGCTCCTTCTGTCGAGAAAGCCATGCACC
ATTTCCATCGTCTGCTTCATCTTAACCTCGCCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACGTTAACGAAGCATATTCCTACTCACTTCTTTCAAGCTCCTTGCATTACTCGGCATTCGTGTAACCATCGTTCTCCCTCACGAATCTCAGCGGTTTTCTCCTT
CCGAAACCCCGATTCTCACTTCATTCAACCTCAGAAACTCGTCGATGAATTCGACCCTCAAATCCCCCTGGAGAACGCCGTCACTCCACCCAGCTCCTGGTACACTGACC
CTTCATTTTTCGCTCTCGAGCTCGATCGCGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAGAGATTCCCATGACTTTTTCACAGGCAGGTTGGGG
AATGTAGAGTTTGTGGTGTGCAAAGATAATAACAGGAAGGTTCGTGCATTTCACAATGTCTGCCGCCATCATGCCTCACTTCTCGTGTCTGGAACTGGGAAAAAGTCGTG
CTTTGTTTGCCCGTACCATGGATGGACATATGGGTTGGATGGAGATCTGCTTAAGGCGACTAGAATAAATGGGATACAAAACTTCGATGTAAATGATTTTGGGCTCCTAC
CATTACCAGTAGCTACATGGGGGCCTTTCATTCTTCTCAATCTAGATGAAAAATTATCATCTGAGGGGGATGTTGATGAAGATAAAGTAGCATATGAATGGCTAGGAAGC
TGTGCAGAGATGCTGAGTTTGAATGGAGTTGATGCTTCATTAAGTTTTGTTTGTCGACGTGAATACACCATTGAATGTAATTGGAAGGTTTTTTGTGACAACTACTTAGA
TGGAGGATATCACGTTCCCTATGCACATAAAGGGCTCGCATCAAATCTCAAGCTGGATTCTTATTCTACAGAAATATTTGAAACTGTTAGCATTCAAAGTTGTAAGACTG
GAGGAGAATCAAAAGGCGATGATTATGGTCGACTCGGATCAGAAGCACTATATGCTTTTATATACCCAAACTTCATGATAAATAGATATGGACCTTGGATGGACACTAAT
CTAGTACTGCCACTCGGACCTCGAAAATGTCTAGTGGTTTTTGATTATTTTCTTGAAGCTCCTTTTAAGAATGACAACTCTCTTATACAAAAAAGTTTAGAAGACAGTGA
ATGCGTGCAGAATGAAGACATTATTGTGTGTGAAAGTGTTCAAAAGGGTCTCGAGTCTCCTGCTTACAAGGTTGGCCGATATGCTCCTTCTGTCGAGAAAGCCATGCACC
ATTTCCATCGTCTGCTTCATCTTAACCTCGCCAAATAATTGTTCTGCAGATTTCCTCTTAGAGATGTGAATATTGTATTAACATTTGTCAGTTAATAGACTTGTTCAATA
TCTTGGCTCTTCAGTGATTGACAACCCAACCACCTAGTGACCACTTTGATGACTCAATCATCCTGATGACTACTTCAGTGATCGACAACCCAATCACCCAAACCATGGTT
GACAACCACTTGGTGACCACCATGGAGACCCAACCATCCTAACGACCACTCCGACAATGACTCAAACAACCCGACCACTCGATAACCACCCAATGACCACTTCAACGACC
TGACCACCGTGCAATCATCACCTTCAAACCAGGATAAACCCATTGAAATTGTCTAACCTTCTTTGATTTAGAGTTAAAAGATCCCCA
Protein sequenceShow/hide protein sequence
MATLTKHIPTHFFQAPCITRHSCNHRSPSRISAVFSFRNPDSHFIQPQKLVDEFDPQIPLENAVTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLRDSHDFFTGRLG
NVEFVVCKDNNRKVRAFHNVCRHHASLLVSGTGKKSCFVCPYHGWTYGLDGDLLKATRINGIQNFDVNDFGLLPLPVATWGPFILLNLDEKLSSEGDVDEDKVAYEWLGS
CAEMLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEIFETVSIQSCKTGGESKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTN
LVLPLGPRKCLVVFDYFLEAPFKNDNSLIQKSLEDSECVQNEDIIVCESVQKGLESPAYKVGRYAPSVEKAMHHFHRLLHLNLAK