; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039875 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039875
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionFAD_binding_3 domain-containing protein
Genome locationchr13:584989..589611
RNA-Seq ExpressionLag0039875
SyntenyLag0039875
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0071949 - FAD binding (molecular function)
InterPro domainsIPR002938 - FAD-binding domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036188 - FAD/NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573661.1 hypothetical protein SDJN03_27548, partial [Cucurbita argyrosperma subsp. sororia]1.9e-15687.31Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMER+PIA  NTALS+KNFKAAME+PAALGLDPKIANSVH+AVNHG+GSVLSSSLQ  VLDGIFKIGR
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV
        LQLSDTLLNDKNPVGSSRLAKL HIFDEGKSLQLQFPAEDLGFRYSEGAL+PDN    G+EEPTGRRR+YIPSADPGS+LPHMNVR L S ++ISTLDLV
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV

Query:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW
        SGDKVEFLLIIAPLPESYRLA AAL VAEEFKT VRVCI WSADITRIES SKEELTPWE+YIDVQEI Q ST   SWWD+CQMTDKGAILVRPDEH+ W
Subjt:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW

Query:  RKKSGICGDPNTELRRVFATLLK
        R KSG+ GDPNTELRRVF TLLK
Subjt:  RKKSGICGDPNTELRRVFATLLK

KAG7012746.1 hypothetical protein SDJN02_25499, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-15687.31Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMER+PIA  NTALS+KNFKAAME+PAALGLDPKIANSVH+AVNHG+GSVLSSSLQ  VLDGIFKIGR
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV
        LQLSDTLLNDKNPVGSSRLAKL HIFDEGKSLQLQFPAEDLGFRYSEGAL+PDN    G+EEPTGRRR+YIPSADPGS+LPHMNVR L S ++ISTLDLV
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV

Query:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW
        SGDKVEFLLIIAPLPESYRLA AAL VAEEFKT VRVCI WSADITRIES SKEELTPWE+YIDVQEI Q ST   SWWD+CQMTDKGAILVRPDEH+ W
Subjt:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW

Query:  RKKSGICGDPNTELRRVFATLLK
        R KSG+ GDPNTELRRVF TLLK
Subjt:  RKKSGICGDPNTELRRVFATLLK

XP_022151935.1 uncharacterized protein LOC111019789 [Momordica charantia]2.5e-15685.76Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALS+KNFKAAMEVPAALGLDP+IANSVH+AVNHGLGSVL SSLQS+VLDGIFKIGR
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV
        +QLS++LLNDKNP+GSSRLAKLR IFDEGKSLQLQFPAEDLGFRYSEGALIPDNT +SG EEPTGRRRQY+PSADPGSRLPHMNVR L SED+ISTLDLV
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV

Query:  SGDKVEFLLIIAPLPESYRLACAALKV-AEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDKGAILVRPDEHIAW
        SGDKVEFLLIIAPLPESY LA +ALK+ AEEFKT V+VC+ WSADI R+ESRS++ELTPWE+Y+DVQEI QSST PSWWD+C+MTD GAILVRPDEHIAW
Subjt:  SGDKVEFLLIIAPLPESYRLACAALKV-AEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDKGAILVRPDEHIAW

Query:  RKKSGICGDPNTELRRVFATLLK
        R KSG+ GD NT+++RVF  LLK
Subjt:  RKKSGICGDPNTELRRVFATLLK

XP_022945741.1 uncharacterized protein LOC111449881 [Cucurbita moschata]1.1e-15687.93Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMER+PIA  NTALS+KNFKAAMEVPAALGLDPKIANSVH+AVNHGLGSVLSSSLQ  VLDGIFKIGR
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV
        LQLSDTLLNDKNPVGSSRLAKL HIFDEGKSLQLQFPAEDLGFRYSEGAL+PDN    G+EEPTGRRR+YIPSADPGS+LPHMNVR L S ++ISTLDLV
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV

Query:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW
        SGDKVEFLLIIAPLPESYRLA AAL VAEEFKT VRVCI WSADITRIES SKEELTPWE+YIDVQEI Q ST   SWWD+CQMTDKGAILVRPDEH+ W
Subjt:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW

Query:  RKKSGICGDPNTELRRVFATLLK
        R KSG+ GDPNTELRRVF TLLK
Subjt:  RKKSGICGDPNTELRRVFATLLK

XP_022966729.1 uncharacterized protein LOC111466353 isoform X1 [Cucurbita maxima]7.2e-15687.31Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMER+PIA  NTALS+KNFKAAMEVPAALGLDPKIANSVH+AVNHGLGSVLSSSLQ  VLDGIFKIGR
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV
        LQLSDTLLN+KNPVGSSRLAKL HIFDEGKSLQLQFPAEDLGFRYSEGALIPDN  + G+EEPTGRRR+YIPSADPGS+LPHMNVR L S ++ISTLDLV
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV

Query:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW
        SGDKVEFLLIIAPLPE YRLA AAL VAEEFKT VRVCI WSADITRIES SKEEL PWE+YIDVQEI Q ST   SWWD+CQMTDKGAILVRPDEH+AW
Subjt:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW

Query:  RKKSGICGDPNTELRRVFATLLK
        R KSG+ GDPNTELRRVF +LLK
Subjt:  RKKSGICGDPNTELRRVFATLLK

TrEMBL top hitse value%identityAlignment
A0A5A7UMT6 Putative polyketide hydroxylase7.8e-15686.38Show/hide
Query:  EGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIG
        EGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIA FNTALS+KNFKAAMEVPAALGLDPKIANSVH  VN+GLGS+LSSSLQSAVLDGIFKIG
Subjt:  EGMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIG

Query:  RLQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDL
        RLQLSD  LN KNP+GSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYS+GA+IPDNT + G+EEPTGRRRQYIPSADPGSRLPHMNVRVL SEDIISTLDL
Subjt:  RLQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDL

Query:  VSGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDKGAILVRPDEHIAW
        VSGDK+EFLLIIAP  ESY LA A  KVAEEFKT V+VCI WSA  T+IES SK+ LTPWE+Y+DV+EI QS+T PSWWDIC+MTDKGAILVRPDEHIAW
Subjt:  VSGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDKGAILVRPDEHIAW

Query:  RKKSGICGDPNTELRRVFATLLK
        R KSGI GDPNTEL RVF TLLK
Subjt:  RKKSGICGDPNTELRRVFATLLK

A0A6J1DCK1 uncharacterized protein LOC1110197891.2e-15685.76Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALS+KNFKAAMEVPAALGLDP+IANSVH+AVNHGLGSVL SSLQS+VLDGIFKIGR
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV
        +QLS++LLNDKNP+GSSRLAKLR IFDEGKSLQLQFPAEDLGFRYSEGALIPDNT +SG EEPTGRRRQY+PSADPGSRLPHMNVR L SED+ISTLDLV
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV

Query:  SGDKVEFLLIIAPLPESYRLACAALKV-AEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDKGAILVRPDEHIAW
        SGDKVEFLLIIAPLPESY LA +ALK+ AEEFKT V+VC+ WSADI R+ESRS++ELTPWE+Y+DVQEI QSST PSWWD+C+MTD GAILVRPDEHIAW
Subjt:  SGDKVEFLLIIAPLPESYRLACAALKV-AEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDKGAILVRPDEHIAW

Query:  RKKSGICGDPNTELRRVFATLLK
        R KSG+ GD NT+++RVF  LLK
Subjt:  RKKSGICGDPNTELRRVFATLLK

A0A6J1G1R0 uncharacterized protein LOC1114498815.4e-15787.93Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMER+PIA  NTALS+KNFKAAMEVPAALGLDPKIANSVH+AVNHGLGSVLSSSLQ  VLDGIFKIGR
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV
        LQLSDTLLNDKNPVGSSRLAKL HIFDEGKSLQLQFPAEDLGFRYSEGAL+PDN    G+EEPTGRRR+YIPSADPGS+LPHMNVR L S ++ISTLDLV
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV

Query:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW
        SGDKVEFLLIIAPLPESYRLA AAL VAEEFKT VRVCI WSADITRIES SKEELTPWE+YIDVQEI Q ST   SWWD+CQMTDKGAILVRPDEH+ W
Subjt:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW

Query:  RKKSGICGDPNTELRRVFATLLK
        R KSG+ GDPNTELRRVF TLLK
Subjt:  RKKSGICGDPNTELRRVFATLLK

A0A6J1HSF3 uncharacterized protein LOC111466353 isoform X23.5e-15687.31Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMER+PIA  NTALS+KNFKAAMEVPAALGLDPKIANSVH+AVNHGLGSVLSSSLQ  VLDGIFKIGR
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV
        LQLSDTLLN+KNPVGSSRLAKL HIFDEGKSLQLQFPAEDLGFRYSEGALIPDN  + G+EEPTGRRR+YIPSADPGS+LPHMNVR L S ++ISTLDLV
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV

Query:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW
        SGDKVEFLLIIAPLPE YRLA AAL VAEEFKT VRVCI WSADITRIES SKEEL PWE+YIDVQEI Q ST   SWWD+CQMTDKGAILVRPDEH+AW
Subjt:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW

Query:  RKKSGICGDPNTELRRVFATLLK
        R KSG+ GDPNTELRRVF +LLK
Subjt:  RKKSGICGDPNTELRRVFATLLK

A0A6J1HUM9 uncharacterized protein LOC111466353 isoform X13.5e-15687.31Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMER+PIA  NTALS+KNFKAAMEVPAALGLDPKIANSVH+AVNHGLGSVLSSSLQ  VLDGIFKIGR
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV
        LQLSDTLLN+KNPVGSSRLAKL HIFDEGKSLQLQFPAEDLGFRYSEGALIPDN  + G+EEPTGRRR+YIPSADPGS+LPHMNVR L S ++ISTLDLV
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNT-ISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLV

Query:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW
        SGDKVEFLLIIAPLPE YRLA AAL VAEEFKT VRVCI WSADITRIES SKEEL PWE+YIDVQEI Q ST   SWWD+CQMTDKGAILVRPDEH+AW
Subjt:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSST-LPSWWDICQMTDKGAILVRPDEHIAW

Query:  RKKSGICGDPNTELRRVFATLLK
        R KSG+ GDPNTELRRVF +LLK
Subjt:  RKKSGICGDPNTELRRVFATLLK

SwissProt top hitse value%identityAlignment
P27138 2,4-dichlorophenol 6-monooxygenase2.3e-1126.73Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        G NT IQD  NLAWK+A VL   A  S+L+TY +ER PIA+     + K+ +    +  ALGL    A S  E                           
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNTISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLVS
        ++ +     +  P   ++  +LR     G +        ++  RY   A++ DN  S  E        +  S  PG+ +PH+ V        IST DL  
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNTISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLVS

Query:  GDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDKGAILVRPDEHIAWRK
          K  F L       +++ A AA  V+ +    V V I                + P ++Y D            +  I ++ D GAILVRPD H+A+R 
Subjt:  GDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDKGAILVRPDEHIAWRK

Query:  KS---GICGDPNTELRRV
         S      GD  + +RR+
Subjt:  KS---GICGDPNTELRRV

P31020 Phenol 2-monooxygenase4.9e-1426.67Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        G NT IQD +NL WKLA VL+  A P +L TY  ER PIA+               V  A G     ++S ++ +   LG V  ++     ++ +     
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNTISGKEEPTGRRRQYI---PSADPGSRLPHMNVRVLVSEDIISTLD
              L  + +P G+ R A LR   D  K  +      ++G  Y   A+I D    G++ P       +    S  PG RLPH    +  +++  ST D
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNTISGKEEPTGRRRQYI---PSADPGSRLPHMNVRVLVSEDIISTLD

Query:  LVSGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDKGAILVRPDEHIA
        +  G +  F +      +++  A AA++VAE     ++  +                         VQ+++       W    ++ + G ILVRPD+HI 
Subjt:  LVSGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDKGAILVRPDEHIA

Query:  WRKKSGICGDPNTEL
        WR +S +  DP T L
Subjt:  WRKKSGICGDPNTEL

P42534 Putative polyketide hydroxylase6.4e-1429.25Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTA----LSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIF
        G NTGIQD HNLAWKLAAVL+  A  ++L+TY+ ERRP+A+  +A     S+++       P   G     A +   A   G G             G  
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTA----LSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIF

Query:  KIGRLQLSDTLLNDKNPVGSSRLAKLRHIFDE----------GKSLQLQFPAEDLGFRYSEGALIPDNTISGKEEPTGRRRQYIP-SADPGSRLPHMNVR
         +G             P G       R               G   Q       LG+RY  GA++      G +  T    + +  +  PGSR PH+ VR
Subjt:  KIGRLQLSDTLLNDKNPVGSSRLAKLRHIFDE----------GKSLQLQFPAEDLGFRYSEGALIPDNTISGKEEPTGRRRQYIP-SADPGSRLPHMNVR

Query:  VLVSEDIISTLDLVSGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDK
            +D +STLDL     V  LL  A  P  +  A A   VA   + P++          R+      +L P +   D            W     +T  
Subjt:  VLVSEDIISTLDLVSGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDK

Query:  GAILVRPDEHIAWRKKSGICGDPNTELRRVFATLL
        GA+LVRPD  +AWR   G   DP + LR+V  T+L
Subjt:  GAILVRPDEHIAWRKKSGICGDPNTELRRVFATLL

Q05355 Putative polyketide hydroxylase2.7e-1226.09Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        G NTGIQD HNLAWKLAAVL   A   +L+TY+ ERRP+A+  TA +                      +   A +   G             GI  +  
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNTISGKEEPTGRRRQYIPSA-DPGSRLPHMNVRVLVSEDIISTLDLV
                                                LG+RY  GA++      G +  T    + +  A +PGSR PH+   +    + +STLDL 
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNTISGKEEPTGRRRQYIPSA-DPGSRLPHMNVRVLVSEDIISTLDLV

Query:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDKGAILVRPDEHIAWR
            V      A  P+++    +A+++AEE   P+           R+   +  +LTP +   DV    +  T P           GA+LVRPD  +AWR
Subjt:  SGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDKGAILVRPDEHIAWR

Query:  KKSGI-CGDPNTELRRVFATLL
         +  +   +    LR V  T+L
Subjt:  KKSGI-CGDPNTELRRVFATLL

Q5ATH0 FAD-dependent monooxygenase apdD7.3e-1028.02Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        GMNTGI D +++ WKLAAV+Q  A P+ L +YE ERRP+ +     S  +    M++ A LGLD  +       +N+  G+ L  ++ S           
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNTISGKEEPTGRRRQYIPSADPGSRLPHM
        LQ  D        +G                        ++G+RY     +P    +    P    R+Y P   PG R PH+
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNTISGKEEPTGRRRQYIPSADPGSRLPHM

Arabidopsis top hitse value%identityAlignment
AT1G24340.1 FAD/NAD(P)-binding oxidoreductase family protein2.9e-11059.2Show/hide
Query:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR
        GMNTGIQD HNLAWK+AA++Q  A+ SIL TYE ERRPIA  NT+LS++NF+AAM VP+ALGLDP +ANSVH  +N  +GS+L + LQ A+LD +F +GR
Subjt:  GMNTGIQDVHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGR

Query:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDN--TISGKEEPTGRRRQYIPSADPGSRLPHMNVRVL---VSEDIIST
         QLS++LLN+ NP+G+ RL++L+ IF+ GKSLQLQFPAEDLGFRY EGA++PDN       E P+GRRR Y+P A+PGSRLPHM V++L     E I+ST
Subjt:  LQLSDTLLNDKNPVGSSRLAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDN--TISGKEEPTGRRRQYIPSADPGSRLPHMNVRVL---VSEDIIST

Query:  LDLVSGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEI-WQSSTLPSWWDICQMTDKGAILVRPDE
        LDLVS +KVEFLLII+PL ESY LA A  KVA+EF   V+VC+ W +    +E +S   L PWE+Y+DV E+  Q+    SWW IC+M+++G+ILVRPD+
Subjt:  LDLVSGDKVEFLLIIAPLPESYRLACAALKVAEEFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEI-WQSSTLPSWWDICQMTDKGAILVRPDE

Query:  HIAWRKKSGICGDPNTELRRVFATLL
        HIAWR KSGI  DP   +R VF  +L
Subjt:  HIAWRKKSGICGDPNTELRRVFATLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATTGGACGGCTCTTCCAACTTCGCTAGGAGGTTTGGGGATTGGATCCCTCTACCACAGGAACACGACCCTTCTTACCAAATGGTTGTGGAGATTTACAATGTTTC
TGTGAACAAAGGCAAAACCATATTCCAATGTTGGAATTCTGAGAACCAAGTCTGGGACCTTGGTCTCAGACGAGGCCTTTTTGATTGGGAGCTCGTTGCTTGGATTGCCT
TGGTGGATAGGCTCTCGGCTGTGCAGAGGGGGGAGGGGATGGACAGGCTGCAATGGTCCATTGAGAAATCTGGTACCTTTTCAAGCAAATCTGTGTTTCTCCAGATGAAT
ATGGGTCAATCAACAGCAAATTTGCCTTTGATTAGTTTAATTTGGAAAGGTTATAGCCCGAAAAAAGTCAAGGTGTTTTTGTGGTCGTTGGCTTTTAGAAGTCTGAACAC
AGATGACAGGTTGCAAAGAAAATTTAGTAGGTGGACCTTGTCTCCCTCGGGGTGCAGATTGTGTCTTAAGGGAGGGGAAAATGTGGACCACTTCTTTCTTCACTGTGACT
TTGCTTTTCAAGCGTGGGGGTGGTTGGCTGAGAGGCTGGGAATACATTTTTGTTTACCTAAGAAGATTGATGATTGGTTGATGGAAGGAATGAATACTGGAATTCAAGAC
GTTCATAATCTTGCCTGGAAATTAGCTGCAGTGCTACAAGATATTGCTTCACCTTCAATACTGAATACTTATGAAATGGAAAGGAGGCCGATAGCACAATTCAACACGGC
CCTTAGCATTAAGAACTTCAAAGCAGCCATGGAAGTACCTGCGGCTCTTGGTCTGGATCCAAAGATTGCAAACTCTGTGCACGAAGCAGTTAACCATGGCCTTGGTTCTG
TTTTATCATCCTCACTACAGAGCGCAGTTCTGGATGGAATTTTTAAGATAGGTCGTTTGCAGCTCTCAGATACTCTTCTAAATGATAAAAACCCCGTTGGTTCTTCAAGG
CTTGCAAAACTAAGACATATATTTGATGAGGGGAAGAGCCTTCAACTTCAGTTCCCTGCAGAGGATCTTGGTTTCAGGTACTCTGAAGGGGCACTTATTCCTGACAATAC
TATCAGTGGTAAAGAAGAACCTACTGGTCGTCGGAGACAGTATATCCCTTCTGCAGATCCAGGATCAAGGCTGCCTCATATGAACGTGAGGGTATTGGTCAGTGAGGACA
TTATTTCTACTCTTGATCTTGTATCTGGGGATAAAGTTGAATTCCTTCTCATAATAGCACCGCTACCGGAGTCCTACCGTCTTGCTTGCGCTGCTCTTAAGGTAGCTGAG
GAATTCAAAACTCCTGTTAGGGTATGCATTTTCTGGTCTGCTGATATCACCAGGATTGAGTCAAGGAGCAAGGAAGAACTAACTCCATGGGAGAGCTACATTGATGTCCA
AGAAATTTGGCAATCATCAACTTTACCATCATGGTGGGATATATGTCAGATGACTGACAAAGGAGCGATCCTAGTTCGGCCCGATGAGCATATTGCTTGGAGGAAGAAGT
CTGGCATTTGTGGTGATCCAAACACAGAACTGAGAAGAGTTTTTGCTACTCTGTTGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGATTGGACGGCTCTTCCAACTTCGCTAGGAGGTTTGGGGATTGGATCCCTCTACCACAGGAACACGACCCTTCTTACCAAATGGTTGTGGAGATTTACAATGTTTC
TGTGAACAAAGGCAAAACCATATTCCAATGTTGGAATTCTGAGAACCAAGTCTGGGACCTTGGTCTCAGACGAGGCCTTTTTGATTGGGAGCTCGTTGCTTGGATTGCCT
TGGTGGATAGGCTCTCGGCTGTGCAGAGGGGGGAGGGGATGGACAGGCTGCAATGGTCCATTGAGAAATCTGGTACCTTTTCAAGCAAATCTGTGTTTCTCCAGATGAAT
ATGGGTCAATCAACAGCAAATTTGCCTTTGATTAGTTTAATTTGGAAAGGTTATAGCCCGAAAAAAGTCAAGGTGTTTTTGTGGTCGTTGGCTTTTAGAAGTCTGAACAC
AGATGACAGGTTGCAAAGAAAATTTAGTAGGTGGACCTTGTCTCCCTCGGGGTGCAGATTGTGTCTTAAGGGAGGGGAAAATGTGGACCACTTCTTTCTTCACTGTGACT
TTGCTTTTCAAGCGTGGGGGTGGTTGGCTGAGAGGCTGGGAATACATTTTTGTTTACCTAAGAAGATTGATGATTGGTTGATGGAAGGAATGAATACTGGAATTCAAGAC
GTTCATAATCTTGCCTGGAAATTAGCTGCAGTGCTACAAGATATTGCTTCACCTTCAATACTGAATACTTATGAAATGGAAAGGAGGCCGATAGCACAATTCAACACGGC
CCTTAGCATTAAGAACTTCAAAGCAGCCATGGAAGTACCTGCGGCTCTTGGTCTGGATCCAAAGATTGCAAACTCTGTGCACGAAGCAGTTAACCATGGCCTTGGTTCTG
TTTTATCATCCTCACTACAGAGCGCAGTTCTGGATGGAATTTTTAAGATAGGTCGTTTGCAGCTCTCAGATACTCTTCTAAATGATAAAAACCCCGTTGGTTCTTCAAGG
CTTGCAAAACTAAGACATATATTTGATGAGGGGAAGAGCCTTCAACTTCAGTTCCCTGCAGAGGATCTTGGTTTCAGGTACTCTGAAGGGGCACTTATTCCTGACAATAC
TATCAGTGGTAAAGAAGAACCTACTGGTCGTCGGAGACAGTATATCCCTTCTGCAGATCCAGGATCAAGGCTGCCTCATATGAACGTGAGGGTATTGGTCAGTGAGGACA
TTATTTCTACTCTTGATCTTGTATCTGGGGATAAAGTTGAATTCCTTCTCATAATAGCACCGCTACCGGAGTCCTACCGTCTTGCTTGCGCTGCTCTTAAGGTAGCTGAG
GAATTCAAAACTCCTGTTAGGGTATGCATTTTCTGGTCTGCTGATATCACCAGGATTGAGTCAAGGAGCAAGGAAGAACTAACTCCATGGGAGAGCTACATTGATGTCCA
AGAAATTTGGCAATCATCAACTTTACCATCATGGTGGGATATATGTCAGATGACTGACAAAGGAGCGATCCTAGTTCGGCCCGATGAGCATATTGCTTGGAGGAAGAAGT
CTGGCATTTGTGGTGATCCAAACACAGAACTGAGAAGAGTTTTTGCTACTCTGTTGAAGTGA
Protein sequenceShow/hide protein sequence
MGLDGSSNFARRFGDWIPLPQEHDPSYQMVVEIYNVSVNKGKTIFQCWNSENQVWDLGLRRGLFDWELVAWIALVDRLSAVQRGEGMDRLQWSIEKSGTFSSKSVFLQMN
MGQSTANLPLISLIWKGYSPKKVKVFLWSLAFRSLNTDDRLQRKFSRWTLSPSGCRLCLKGGENVDHFFLHCDFAFQAWGWLAERLGIHFCLPKKIDDWLMEGMNTGIQD
VHNLAWKLAAVLQDIASPSILNTYEMERRPIAQFNTALSIKNFKAAMEVPAALGLDPKIANSVHEAVNHGLGSVLSSSLQSAVLDGIFKIGRLQLSDTLLNDKNPVGSSR
LAKLRHIFDEGKSLQLQFPAEDLGFRYSEGALIPDNTISGKEEPTGRRRQYIPSADPGSRLPHMNVRVLVSEDIISTLDLVSGDKVEFLLIIAPLPESYRLACAALKVAE
EFKTPVRVCIFWSADITRIESRSKEELTPWESYIDVQEIWQSSTLPSWWDICQMTDKGAILVRPDEHIAWRKKSGICGDPNTELRRVFATLLK