; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039874 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039874
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionFAD_binding_3 domain-containing protein
Genome locationchr13:580476..584276
RNA-Seq ExpressionLag0039874
SyntenyLag0039874
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0071949 - FAD binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002938 - FAD-binding domain
IPR036188 - FAD/NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7012746.1 hypothetical protein SDJN02_25499, partial [Cucurbita argyrosperma subsp. argyrosperma]7.7e-20090.5Show/hide
Query:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK
        M  LG+FKRFNGL+KF+A+L AIP+G VQSRGLS+SKLFHGGEET VPVLIVGAGPVGLVLAILLTKLGVKCAI+EKN  FSNHPQAHFINNRSMEVFRK
Subjt:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK

Query:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS
        LDGLAEEIQL QPPVDSWRKFIYCTSL GTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDS EG   VREKQIL+GHECVS
Subjt:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS

Query:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG
        I  TDD+VTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGE +LQ LVS+HFFSRELGEYLL ERPGMLYFIFNTEAIGVLVAHDLKQG
Subjt:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG

Query:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF
        EFILQVPFYPPQQNIEDFCPKMCEE+IF LVGLNLCD+DV+DVKPWIMHAEVAEKFI C+ RVLLAGDAAHRFPPAGGF
Subjt:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF

XP_022151935.1 uncharacterized protein LOC111019789 [Momordica charantia]4.1e-20190.77Show/hide
Query:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK
        M +LG+  RFNGLRKFDA L A+P+G VQSRGLS+SK+FHGGE+TMVPVLIVGAGPVGLVLAILLTKLGVKCA+VEKNR FSNHPQAHFINNRSMEVFRK
Subjt:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK

Query:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS
        L GLAEEIQLCQPPVDSWRKFIYCTSLNG ILGSVDHMQPQDF +IISPVSVAHFSQYKLNRLLLK+LQNLGFQVCSPDSLE +  VREKQIL+GHECVS
Subjt:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS

Query:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG
        IDATDDSVT TASYLKEGKH ERRNICSNILVG DGAGSTVRRLVGIE+KGEN+LQ LVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG
Subjt:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG

Query:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF
        EFILQVPFYPPQQ+IEDF PKMCEELIFKLVGLNLCDIDV+DVKPWIMHAEVAEKFICC NRVLLAGDAAHRFPPAGGF
Subjt:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF

XP_022945741.1 uncharacterized protein LOC111449881 [Cucurbita moschata]2.2e-19990.5Show/hide
Query:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK
        M  LG+FKRFNGL+KF+A+L AIP+G VQSRGLS+SKLFHGGEET VPVLIVGAGPVGLVLAILLTKLGVKCAI+EKN  FSNHPQAHFINNRSMEVFRK
Subjt:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK

Query:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS
        LDGLAEEIQL QPPVDSWRKFIYCTSL GTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNL FQVCSPDS EG   VREKQIL+GHECVS
Subjt:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS

Query:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG
        I  TDD+VTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGE +LQ LVS+HFFSRELGEYLL ERPGMLYFIFNTEAIGVLVAHDLKQG
Subjt:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG

Query:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF
        EFILQVPFYPPQQNIEDFCPKMCEE+IF LVGLNLCD+DV+DVKPWIMHAEVAEKFI CQ RVLLAGDAAHRFPPAGGF
Subjt:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF

XP_022966729.1 uncharacterized protein LOC111466353 isoform X1 [Cucurbita maxima]2.2e-20291.03Show/hide
Query:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK
        M  LG+FKRFNGLR+F+A+L AIP+G VQSRGLS+SKLFHGGEET VPVLIVGAGPVGLVLAILLTKLGVKCAI+EKN  FSNHPQAHFINNRSMEVFRK
Subjt:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK

Query:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS
        LDGLAEEIQLCQPPVDSWRKF+YCTSL GTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDS EG   VREKQIL+GHECVS
Subjt:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS

Query:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG
        I  TDD+VTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGE NLQ LVS+HFFSRELGEYLL ERPGMLYFIFN EAIGVLVAHDLKQG
Subjt:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG

Query:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF
        EFILQVPFYPPQQNIEDFCPKMCEE+IF LVGLNLCD+DV+DVKPWIMHAEVAEKFI CQNRVLLAGDAAHRFPPAGGF
Subjt:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF

XP_023541526.1 uncharacterized protein LOC111801672 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-20090.77Show/hide
Query:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK
        M  LG+FKRFNGL+KFDA+L A+P+G VQSRGLS+SKLFHGGEET VPVLIVGAGPVGLVLAILLTKLGVKCAI+EKN  FSNHPQAHFINNRSMEVFRK
Subjt:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK

Query:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS
        LDGLAEEIQL QPPVDSWRKFIYCTSL GTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDS EG   VREKQIL+GHECVS
Subjt:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS

Query:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG
        I  TDD+VTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGE +LQ LVS+HFFSRELGEYLL ERPGMLYFIFNTEAIGVLVAHDLKQG
Subjt:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG

Query:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF
        EFILQVPFYPPQQNIEDFCPKMCEE+IF LVGLNLCD+DV+DVKPWIMHAEVAEKFI CQ RVLLAGDAAHRFPPAGGF
Subjt:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF

TrEMBL top hitse value%identityAlignment
A0A0A0KLD9 FAD_binding_3 domain-containing protein1.8e-19488.13Show/hide
Query:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK
        M  LG+FKRFNGL+KFDA L   P+  +Q RG S+SK+FHGG+ETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKN+ FS HPQAHFINNR+MEVFRK
Subjt:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK

Query:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS
        LDGLAE+IQL QPPV+SWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLN LLLKQLQNLGFQVCSPDSLEG   VREK+IL+GHECVS
Subjt:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS

Query:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG
        IDATD+SV MTASYLKEGKH+ERRNI  NILVGADGAGSTVRRLVGIEMKGEN+LQ LVS+HFFSRELGEYLL +RPGMLYFIFNTEAIGVLVAHDLKQG
Subjt:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG

Query:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF
        EFILQVPFYPPQQNIEDF P+MCEELIFKLVG NLCDIDVRDVKPWIMHAEVAEKFIC QN VLLAGDAAHRFPPAGGF
Subjt:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF

A0A1S3BZD5 putative polyketide hydroxylase3.9e-19788.65Show/hide
Query:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK
        M  LG+F+RFNGL+KFDAT  AIP+G VQ RG S+SKLFHGG+ETMVPVLIVGAGPVGLVLAILLTKLG+KCAIVEKNR FS HPQAHFINNR+MEVFRK
Subjt:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK

Query:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS
        LDGLAE+IQL QPPV+SWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQV SPDSLEG   VREK+IL+GHECVS
Subjt:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS

Query:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG
        IDATD+SV MTASYLKEGKH+ERRNI  NILVGADGAGSTVRRLVG+EMKGEN+LQ LVS+HFFSRELGEYLL +RPGMLYFIFNTEAIGVLVAHDLKQG
Subjt:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG

Query:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF
        EFILQVPFYPPQQNIEDFCP MC ELIFKLVG NLCDIDV+DVKPWIMHAEVAEKFICC+N VLLAGDAAHRFPPAGGF
Subjt:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF

A0A6J1DCK1 uncharacterized protein LOC1110197892.0e-20190.77Show/hide
Query:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK
        M +LG+  RFNGLRKFDA L A+P+G VQSRGLS+SK+FHGGE+TMVPVLIVGAGPVGLVLAILLTKLGVKCA+VEKNR FSNHPQAHFINNRSMEVFRK
Subjt:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK

Query:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS
        L GLAEEIQLCQPPVDSWRKFIYCTSLNG ILGSVDHMQPQDF +IISPVSVAHFSQYKLNRLLLK+LQNLGFQVCSPDSLE +  VREKQIL+GHECVS
Subjt:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS

Query:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG
        IDATDDSVT TASYLKEGKH ERRNICSNILVG DGAGSTVRRLVGIE+KGEN+LQ LVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG
Subjt:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG

Query:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF
        EFILQVPFYPPQQ+IEDF PKMCEELIFKLVGLNLCDIDV+DVKPWIMHAEVAEKFICC NRVLLAGDAAHRFPPAGGF
Subjt:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF

A0A6J1G1R0 uncharacterized protein LOC1114498811.1e-19990.5Show/hide
Query:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK
        M  LG+FKRFNGL+KF+A+L AIP+G VQSRGLS+SKLFHGGEET VPVLIVGAGPVGLVLAILLTKLGVKCAI+EKN  FSNHPQAHFINNRSMEVFRK
Subjt:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK

Query:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS
        LDGLAEEIQL QPPVDSWRKFIYCTSL GTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNL FQVCSPDS EG   VREKQIL+GHECVS
Subjt:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS

Query:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG
        I  TDD+VTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGE +LQ LVS+HFFSRELGEYLL ERPGMLYFIFNTEAIGVLVAHDLKQG
Subjt:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG

Query:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF
        EFILQVPFYPPQQNIEDFCPKMCEE+IF LVGLNLCD+DV+DVKPWIMHAEVAEKFI CQ RVLLAGDAAHRFPPAGGF
Subjt:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF

A0A6J1HUM9 uncharacterized protein LOC111466353 isoform X11.0e-20291.03Show/hide
Query:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK
        M  LG+FKRFNGLR+F+A+L AIP+G VQSRGLS+SKLFHGGEET VPVLIVGAGPVGLVLAILLTKLGVKCAI+EKN  FSNHPQAHFINNRSMEVFRK
Subjt:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK

Query:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS
        LDGLAEEIQLCQPPVDSWRKF+YCTSL GTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDS EG   VREKQIL+GHECVS
Subjt:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVS

Query:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG
        I  TDD+VTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGE NLQ LVS+HFFSRELGEYLL ERPGMLYFIFN EAIGVLVAHDLKQG
Subjt:  IDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQG

Query:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF
        EFILQVPFYPPQQNIEDFCPKMCEE+IF LVGLNLCD+DV+DVKPWIMHAEVAEKFI CQNRVLLAGDAAHRFPPAGGF
Subjt:  EFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF

SwissProt top hitse value%identityAlignment
P27138 2,4-dichlorophenol 6-monooxygenase6.4e-2425.81Show/hide
Query:  VLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRKLDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSV-----DHMQPQDF
        VL+VG GP G     LL + GV+  ++ K  + +  P+AH  N R+ME+ R L GL  E +L   P D   +   C SL G   G +     D  +  D+
Subjt:  VLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRKLDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSV-----DHMQPQDF

Query:  EHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVSIDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRR
        +   SP S+    Q  L  +L+K             +L+G   VR     +GHE              +S L++  + E   + S  L+GADGA S V  
Subjt:  EHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVSIDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRR

Query:  LVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIF----NTEAIGVLVAHDLKQGEFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDID
         + + ++G       +++  F  +L  Y+ + RP +LY++     +   +G+ V   ++     L +  Y  +Q   +        ++  L+G +   + 
Subjt:  LVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIF----NTEAIGVLVAHDLKQGEFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDID

Query:  VRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF---EVFQEFFQ---------NGIINKRTNETY-ICLIPKKKKVT-----MVKDFRPISLI
        +  +  W ++   A +    Q RV  AGDA HR PP  G       Q+ F          NG  ++   +TY I   P  K+V       ++DF PI++ 
Subjt:  VRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF---EVFQEFFQ---------NGIINKRTNETY-ICLIPKKKKVT-----MVKDFRPISLI

Query:  SSL
          L
Subjt:  SSL

P31020 Phenol 2-monooxygenase1.8e-1323.55Show/hide
Query:  GLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRKLDGLAEEIQLCQPPVDSWRKFIYCTSLNGTI
        GLS++   +        VLIVG+GP G   A+ L+  G+   ++ K R+ +N P+AH  N R+ME+ R   G+ +++     P +     +YC S+ G  
Subjt:  GLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRKLDGLAEEIQLCQPPVDSWRKFIYCTSLNGTI

Query:  LG-----SVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVSIDATDDSVTMTASYLKEGKHIERRNI
        +G          +  D+E + SP       Q  L  ++LK                 + T+R  Q     E +S    D  V++       G   +   I
Subjt:  LG-----SVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVSIDATDDSVTMTASYLKEGKHIERRNI

Query:  CSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLY---------FIFNTEAIGVLVAHDLKQGEFILQVPFYPPQQNIED
         +  L+GADGA S V   +G    G  N+     +  +     + +L   P + Y          +       V+   D+ Q          PP+ N ++
Subjt:  CSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLY---------FIFNTEAIGVLVAHDLKQGEFILQVPFYPPQQNIED

Query:  FCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGG
               +++  LVG+   D+++     W  + + A      + RV  AGDA H+ PP+ G
Subjt:  FCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGG

P39888 Tetracenomycin polyketide synthesis hydroxylase TcmG7.4e-2028.53Show/hide
Query:  VPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFR-------------KLDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILG
        VPVLIVG G  GL  A+ L++ GV C +VEK+R  +   +A  I++R+ME+ R             KL   A   +L QP        I    L+  +  
Subjt:  VPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFR-------------KLDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILG

Query:  SVDHMQPQ-DFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVSIDATDDSVTMTASYLKEGKHIERRNICSNILV
        +V   +P  D  H +SP       Q +L  +L  +    G ++     +E +FT  E  +      +   AT +  T+ A Y                L+
Subjt:  SVDHMQPQ-DFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVSIDATDDSVTMTASYLKEGKHIERRNICSNILV

Query:  GADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVL-------VAHDLKQGEFILQVPFYPPQQNIEDFCPKMCEE
         ADG  S VR  +GI   G   + N +SV  F  +L + +   R  ++ ++ N +  GVL       V     +  +I    F P + + E F  + C +
Subjt:  GADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVL-------VAHDLKQGEFILQVPFYPPQQNIEDFCPKMCEE

Query:  LIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF
        +I    GL    ++V+  +PW M    A  +     RV LAGDAAH  PPAG F
Subjt:  LIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF

Q54530 Aklavinone 12-hydroxylase RdmE1.8e-2129.62Show/hide
Query:  VPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRKLDGLAEEIQLCQPPVDSWRKFI--YCTSLNGTILGSVDHMQPQDFE
        V VL+VGAG  GL  A+ L + GV+  +VE+    S +P+A   N R+ME+ R + G+A+E+        +   F+     S+ G IL +V     + F+
Subjt:  VPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRKLDGLAEEIQLCQPPVDSWRKFI--YCTSLNGTILGSVDHMQPQDFE

Query:  HII------SPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVSIDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAG
         ++      +P   A  SQ KL  +LL Q +  G                   I  G   +S    DD      +    G   E  ++ +  LVGADG  
Subjt:  HII------SPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVSIDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAG

Query:  STVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQGEFILQVPFYPPQ-QNIEDFCPKMCEELIFKLVGLNLCD
        S VR  +GI   G   L ++V V  F  +L   +     G  Y++ + E  G     D +     L V + P + +  EDF P+ C ELI   +      
Subjt:  STVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQGEFILQVPFYPPQ-QNIEDFCPKMCEELIFKLVGLNLCD

Query:  IDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGG
         ++ D++ W M A +AE++   + RV LAGDAA   PP GG
Subjt:  IDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGG

Q8KN28 2,4-dichlorophenol 6-monooxygenase9.9e-2526.92Show/hide
Query:  VLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRKLDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSV----DHMQPQDFE
        VL+VG+GP G    +LL   GVK   V K    S  P++H  N R+MEV R L GL  E +    P +   + +YCTSL G  LG V     H Q +   
Subjt:  VLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRKLDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSV----DHMQPQDFE

Query:  HIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVSIDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRL
         + SP  +    Q  L  +++      G                   +    E VS+   +  VT T   +++     + +I +  L+GADGA S V   
Subjt:  HIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVSIDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRL

Query:  VGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIF----NTEAIGVLVAHDLKQGEFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDV
        VG+ M+G+  +   ++V  F  +L +Y +  RP +LY++     +   +G+ V   ++     L +  Y       D       +++  L+G +   + +
Subjt:  VGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIF----NTEAIGVLVAHDLKQGEFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDV

Query:  RDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGG
             W ++   A +     NRV   GDA HR PP  G
Subjt:  RDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGG

Arabidopsis top hitse value%identityAlignment
AT1G24340.1 FAD/NAD(P)-binding oxidoreductase family protein5.4e-15166.67Show/hide
Query:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK
        MA+LG  KR   +   ++ +   PV   Q + LSS+ LF+G +   +PVLIVGAGPVGLVL+ILLTKLGVKCA+V+K   FS HPQAHFINNRSME+FR+
Subjt:  MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRK

Query:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEG--SFTVREKQILVGHEC
        LDGLAEEI+  QPPVD WRKFIYCTSL+G+ LG+VDHMQPQDFE ++SP SVAHFSQYKL  LLLK+L++LGF V      +G  + +V  +QIL+GHEC
Subjt:  LDGLAEEIQLCQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEG--SFTVREKQILVGHEC

Query:  VSIDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLK
        V IDA  DS+T T S+LK GKH+ +RNI  ++LVGADGAGS VR+L  IEM+GE +LQ LVSVHF SRELGEYL++ RPGML+FIFNT+ IGVLVAHDL 
Subjt:  VSIDATDDSVTMTASYLKEGKHIERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLK

Query:  QGEFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF
        QGEF+LQ+P+YPPQQ++ DF P+MC+ LIF LVG  L D+DV D+KPW+MHAEVAEKF+CC+NRV+LAGDAAHRFPPAGGF
Subjt:  QGEFILQVPFYPPQQNIEDFCPKMCEELIFKLVGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGF

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.8e-0637.04Show/hide
Query:  LAERLKEVLPETINDCQAAFVKGIQILDAILVASEVVE--ERKHNREETFLLKLDFEKAYDKVSWD-------GPNFERVW
        + ERLK ++   I   QA+F+ G    D I+   E V    RK   +   LLKLD EKAYD++ WD          F  VW
Subjt:  LAERLKEVLPETINDCQAAFVKGIQILDAILVASEVVE--ERKHNREETFLLKLDFEKAYDKVSWD-------GPNFERVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTTTTAGGTTATTTCAAGAGGTTTAATGGCCTCCGGAAATTTGATGCCACACTTGGAGCAATACCAGTTGGGTGCGTTCAGAGTAGAGGCTTATCGAGTTCCAA
GCTTTTCCATGGCGGCGAAGAAACAATGGTTCCGGTTTTAATTGTTGGTGCAGGACCTGTAGGTCTCGTCCTTGCTATTCTTCTCACGAAATTAGGGGTCAAATGTGCAA
TTGTGGAGAAGAACAGATTTTTTTCTAATCATCCGCAAGCTCACTTCATAAATAACAGATCTATGGAGGTATTTCGCAAATTGGATGGATTAGCAGAGGAGATACAATTA
TGTCAACCTCCTGTAGACTCATGGAGAAAGTTCATATATTGTACTTCACTGAATGGTACAATTCTTGGATCTGTAGACCATATGCAACCTCAAGATTTTGAGCACATTAT
CAGCCCGGTTTCTGTTGCACATTTCTCCCAATACAAATTAAACAGGTTACTACTTAAGCAACTTCAAAATCTTGGGTTTCAAGTTTGTTCACCGGATAGCTTGGAGGGTT
CCTTTACAGTAAGAGAAAAGCAAATACTTGTGGGGCATGAGTGTGTTTCTATTGATGCTACTGATGACTCTGTAACCATGACTGCATCTTATCTCAAGGAAGGGAAGCAT
ATCGAGAGGAGGAATATATGCAGTAATATCCTTGTTGGTGCAGATGGTGCTGGAAGTACTGTGCGGAGGCTAGTAGGCATAGAAATGAAGGGTGAAAACAACTTACAAAA
TCTTGTAAGCGTCCATTTTTTTAGTAGAGAGCTTGGTGAGTATCTGCTAAATGAGAGACCTGGTATGCTATATTTCATCTTTAACACTGAAGCTATTGGGGTTCTTGTTG
CTCATGATCTCAAGCAAGGGGAATTCATATTGCAGGTACCATTCTATCCTCCTCAACAAAACATTGAAGATTTTTGTCCTAAGATGTGTGAGGAGTTAATCTTCAAATTG
GTTGGTCTAAACCTCTGTGACATAGATGTGCGAGATGTAAAACCTTGGATTATGCATGCTGAAGTTGCTGAGAAGTTCATATGCTGTCAAAATCGTGTATTACTTGCTGG
TGATGCTGCTCATCGATTTCCTCCAGCTGGTGGTTTTGAGGTGTTCCAGGAGTTTTTTCAAAACGGCATCATTAATAAGCGCACAAATGAAACTTACATATGCCTGATCC
CTAAGAAGAAGAAAGTGACAATGGTCAAAGATTTTCGTCCCATTAGCCTCATCTCCTCCCTGTACAAAATCGTAGCGAAAGTCTTAGCTGAAAGGCTTAAGGAAGTGCTC
CCCGAGACCATTAATGATTGCCAAGCTGCCTTCGTTAAAGGTATACAAATTCTGGATGCTATCTTGGTGGCTTCGGAAGTGGTGGAAGAACGAAAACATAATAGGGAAGA
AACCTTTTTGCTAAAGCTGGATTTTGAGAAAGCCTATGACAAAGTAAGTTGGGACGGTCCTAATTTTGAAAGGGTTTGGGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTTTTAGGTTATTTCAAGAGGTTTAATGGCCTCCGGAAATTTGATGCCACACTTGGAGCAATACCAGTTGGGTGCGTTCAGAGTAGAGGCTTATCGAGTTCCAA
GCTTTTCCATGGCGGCGAAGAAACAATGGTTCCGGTTTTAATTGTTGGTGCAGGACCTGTAGGTCTCGTCCTTGCTATTCTTCTCACGAAATTAGGGGTCAAATGTGCAA
TTGTGGAGAAGAACAGATTTTTTTCTAATCATCCGCAAGCTCACTTCATAAATAACAGATCTATGGAGGTATTTCGCAAATTGGATGGATTAGCAGAGGAGATACAATTA
TGTCAACCTCCTGTAGACTCATGGAGAAAGTTCATATATTGTACTTCACTGAATGGTACAATTCTTGGATCTGTAGACCATATGCAACCTCAAGATTTTGAGCACATTAT
CAGCCCGGTTTCTGTTGCACATTTCTCCCAATACAAATTAAACAGGTTACTACTTAAGCAACTTCAAAATCTTGGGTTTCAAGTTTGTTCACCGGATAGCTTGGAGGGTT
CCTTTACAGTAAGAGAAAAGCAAATACTTGTGGGGCATGAGTGTGTTTCTATTGATGCTACTGATGACTCTGTAACCATGACTGCATCTTATCTCAAGGAAGGGAAGCAT
ATCGAGAGGAGGAATATATGCAGTAATATCCTTGTTGGTGCAGATGGTGCTGGAAGTACTGTGCGGAGGCTAGTAGGCATAGAAATGAAGGGTGAAAACAACTTACAAAA
TCTTGTAAGCGTCCATTTTTTTAGTAGAGAGCTTGGTGAGTATCTGCTAAATGAGAGACCTGGTATGCTATATTTCATCTTTAACACTGAAGCTATTGGGGTTCTTGTTG
CTCATGATCTCAAGCAAGGGGAATTCATATTGCAGGTACCATTCTATCCTCCTCAACAAAACATTGAAGATTTTTGTCCTAAGATGTGTGAGGAGTTAATCTTCAAATTG
GTTGGTCTAAACCTCTGTGACATAGATGTGCGAGATGTAAAACCTTGGATTATGCATGCTGAAGTTGCTGAGAAGTTCATATGCTGTCAAAATCGTGTATTACTTGCTGG
TGATGCTGCTCATCGATTTCCTCCAGCTGGTGGTTTTGAGGTGTTCCAGGAGTTTTTTCAAAACGGCATCATTAATAAGCGCACAAATGAAACTTACATATGCCTGATCC
CTAAGAAGAAGAAAGTGACAATGGTCAAAGATTTTCGTCCCATTAGCCTCATCTCCTCCCTGTACAAAATCGTAGCGAAAGTCTTAGCTGAAAGGCTTAAGGAAGTGCTC
CCCGAGACCATTAATGATTGCCAAGCTGCCTTCGTTAAAGGTATACAAATTCTGGATGCTATCTTGGTGGCTTCGGAAGTGGTGGAAGAACGAAAACATAATAGGGAAGA
AACCTTTTTGCTAAAGCTGGATTTTGAGAAAGCCTATGACAAAGTAAGTTGGGACGGTCCTAATTTTGAAAGGGTTTGGGGTTAG
Protein sequenceShow/hide protein sequence
MAVLGYFKRFNGLRKFDATLGAIPVGCVQSRGLSSSKLFHGGEETMVPVLIVGAGPVGLVLAILLTKLGVKCAIVEKNRFFSNHPQAHFINNRSMEVFRKLDGLAEEIQL
CQPPVDSWRKFIYCTSLNGTILGSVDHMQPQDFEHIISPVSVAHFSQYKLNRLLLKQLQNLGFQVCSPDSLEGSFTVREKQILVGHECVSIDATDDSVTMTASYLKEGKH
IERRNICSNILVGADGAGSTVRRLVGIEMKGENNLQNLVSVHFFSRELGEYLLNERPGMLYFIFNTEAIGVLVAHDLKQGEFILQVPFYPPQQNIEDFCPKMCEELIFKL
VGLNLCDIDVRDVKPWIMHAEVAEKFICCQNRVLLAGDAAHRFPPAGGFEVFQEFFQNGIINKRTNETYICLIPKKKKVTMVKDFRPISLISSLYKIVAKVLAERLKEVL
PETINDCQAAFVKGIQILDAILVASEVVEERKHNREETFLLKLDFEKAYDKVSWDGPNFERVWG