; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G13470 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G13470
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionMonooxygenase, FAD-binding protein
Genome locationClcChr08:24604264..24622352
RNA-Seq ExpressionClc08G13470
SyntenyClc08G13470
Gene Ontology termsGO:0004497 - monooxygenase activity (molecular function)
GO:0071949 - FAD binding (molecular function)
InterPro domainsIPR002938 - FAD-binding domain
IPR036188 - FAD/NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020364.1 2,6-dihydroxypyridine 3-monooxygenase [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0064.51Show/hide
Query:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND
        M++KKQKPKA+IVGGSIAGISCAHTL+KAGWEVQVL+K+TTPP GCSTGAGL LD LSQ+L+QSWLSRPELL+ESTSPL TEQNRAIDGESK  RILTND
Subjt:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND

Query:  ENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSK
        EN NFRAVHWADLH LLY ELPP IFLWGHLF S+CIS DKTSVKI AKV+++ EIVEIVGDLLVAAD CLSSIRQ FLP+FKLRYSGYY WRGVFDFS+
Subjt:  ENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSK

Query:  KENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYD
         ENSE+ +NI KAYP++GK LY DLAS +H+ +FEVPKKKINW+W+VN+P+PQ+K RSMTMKVNEEMV++LH+Q ++IW+PE A +VRETKDPFIN IYD
Subjt:  KENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYD

Query:  CDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATST
        CDPLEQLVW NVVLVGEAAHP TPHC RSTNM++LDA++LG+CL KWG  DL  ALAEYQSLRLP+I  QVLHSR VGRIKQGL     + FDPNVA   
Subjt:  CDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATST

Query:  HNMQEIQIRNTPFLDNSGMQLIQLLHIVQEARIWQILDEAILGKCLQKWEAQNLKLALVEYQSFRLSVISVQILYSRRIGQIRQVLTLSNRDPLNLNKQK
         N+QE+QIRNTPF D++                                                                            +N   + 
Subjt:  HNMQEIQIRNTPFLDNSGMQLIQLLHIVQEARIWQILDEAILGKCLQKWEAQNLKLALVEYQSFRLSVISVQILYSRRIGQIRQVLTLSNRDPLNLNKQK

Query:  PKAVIIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGRILTYDDNLNFRA
        PKAVI+GGSIAGISCAH L  AGW VQ+LEKS +PPT  STGAGLGLDPLSQ L+QSWLS+P+LLL+ST PLT +QN+A D E+K  RIL  D+N NFRA
Subjt:  PKAVIIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGRILTYDDNLNFRA

Query:  ALWADLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVVGDLLVAADGCLSSIRQTFLPNFKLRL--LCLER-VFDYSKNENSETV
        A WADLH LLY ELPPDIFLWGH F+S C S+DK+SVKI A +++TDEI+E+VGDLLV ADGCLSS+ QTFLPNF+LR    C  R V D+S+NENSET+
Subjt:  ALWADLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVVGDLLVAADGCLSSIRQTFLPNFKLRL--LCLER-VFDYSKNENSETV

Query:  MNIHRAYPEISKC---------------VPKKKINWLWY---------ARSMTMKVNEEMVRKLHEQVDEIWVPEFAKFVRETKDPFINAIYDYDPLKKL
        ++I +AYP++ KC               +P KKINW+WY          +S+TMKV++ MVRK+HEQ +++WVPEFA+ ++ETK+PFIN IYD +PL++L
Subjt:  MNIHRAYPEISKC---------------VPKKKINWLWY---------ARSMTMKVNEEMVRKLHEQVDEIWVPEFAKFVRETKDPFINAIYDYDPLKKL

Query:  VWDNVVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQKWGAQGLKSALMEYQSLRLPVVSAQVLHSRRAGRIKQGLTLPDCEPFDPNVATSTQNFPELQ
        VWDNVVLVG+AAHPTTPH  RSTNM+ILDAAVLGKCL+KWG + L++AL EYQS+RLPV S QVLHSR  GR+KQGL LPD EPFDP VA   +N  ELQ
Subjt:  VWDNVVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQKWGAQGLKSALMEYQSLRLPVVSAQVLHSRRAGRIKQGLTLPDCEPFDPNVATSTQNFPELQ

Query:  IRNVPYLNDVPQI
         +N+P+ NDVPQ+
Subjt:  IRNVPYLNDVPQI

KAG8375488.1 hypothetical protein BUALT_Bualt10G0105000 [Buddleja alternifolia]6.8e-27651.54Show/hide
Query:  KKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENF
        K+   KAV+VGGSIAG+S AH L+ AGW+V VLEK+++PP GC+TGAGL LDPL+QKL+ SWL +P +L  +T PL  +QN+A +G++K   +LT DENF
Subjt:  KKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENF

Query:  NFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKEN
        NFRA +WADLHSLLYK LPP I  WGH+F+S  IS+DKT VK++ K++++G+IVEIVGDLL+AAD CLSSIRQ+F+P+ KLRYSGY  WRGV +FS  EN
Subjt:  NFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKEN

Query:  SETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYDCDP
         ET++ ++KA+P++GKCLY DL S +H + +E+  K+INW+W+VN+P+PQLK  S+TMKV+++M++K+H+  + +W+PEL K++RETK PF+NVIYD DP
Subjt:  SETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYDCDP

Query:  LEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATSTHNM
        L+++VW NVVL+G+AAHPTTPH  RSTNM+++D ++LG+CL KWG+ +L  AL EYQS+RLPV   QVL SRR+GRIKQGL     EPFDP +  S  + 
Subjt:  LEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATSTHNM

Query:  QEIQIRNTPFLDNSGMQLIQLLHIVQEARIWQILDEAILGKCLQKWEAQNLKLALVEYQSFRLSVISVQILYSRRIGQIRQVLTLSNRDPLNLNKQKPKA
        +E+Q +N  FL +                                                                 +  +L+ ++   +   + K KA
Subjt:  QEIQIRNTPFLDNSGMQLIQLLHIVQEARIWQILDEAILGKCLQKWEAQNLKLALVEYQSFRLSVISVQILYSRRIGQIRQVLTLSNRDPLNLNKQKPKA

Query:  VIIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGRILTYDDNLNFRAALW
        V++GGSIAGISCAH L  AGWDV VLEK+ T P+ C+TGAGLGLD +S +L++ WL +PELL   T PLT +  +A DG+ K    LT D+N NFRA  W
Subjt:  VIIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGRILTYDDNLNFRAALW

Query:  ADLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVVGDLLVAADGCLSSIRQTFLPNFKLRL--LCLER-VFDYSKNENSETVMNI
        ADLHSLLY  L  ++ LWGH F+S CISDDK  VK++AK+++TDE +E+ GDLLVAADGCLSSIR+TFLP+ KLR    C  R V D+S N+NSET++ +
Subjt:  ADLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVVGDLLVAADGCLSSIRQTFLPNFKLRL--LCLER-VFDYSKNENSETVMNI

Query:  HRAYPEISKC---------------VPKKKINWLWY---------ARSMTMKVNEEMVRKLHEQVDEIWVPEFAKFVRETKDPFINAIYDYDPLKKLVWD
         +AYP++ +C               +  ++INW+WY           S+T KV+ +M+ K++E  +++W+PE  K +R+T +PF+NAIYD DPL +L WD
Subjt:  HRAYPEISKC---------------VPKKKINWLWY---------ARSMTMKVNEEMVRKLHEQVDEIWVPEFAKFVRETKDPFINAIYDYDPLKKLVWD

Query:  NVVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQKWGAQGLKSALMEYQSLRLPVVSAQVLHSRRAGRIKQGLTLPDCEPFDPNVATSTQNFPELQIRN
        NVVL+G+AAHP TPH ARSTNM+ILDAAVLGKCL+KWG + L SAL EY+S+R+PV   QVL SRR GRIKQ L L D E FD ++    +    LQ+RN
Subjt:  NVVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQKWGAQGLKSALMEYQSLRLPVVSAQVLHSRRAGRIKQGLTLPDCEPFDPNVATSTQNFPELQIRN

Query:  VPYLNDVPQI
        +PY +D P I
Subjt:  VPYLNDVPQI

OMO99513.1 Monooxygenase, FAD-binding protein [Corchorus capsularis]4.0e-26851.04Show/hide
Query:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND
        M+EKK K KA+IVGGSIAG+SCAH L  AGWEV VLEK+  PP G  TGAGL LDPLSQ+L+ SWL  P LL ++T PL  +Q++A D  +K +  LT D
Subjt:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND

Query:  ENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSK
        E FNFRA HWADLH LLY  LP  IF WGH F+S  IS DK SVK+KAKV++  EI+EI G+LLVAAD CLS IRQ+FLP+ KLRYSGY  WRGV +FS 
Subjt:  ENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSK

Query:  KENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYD
        KE+SET+  IRKAYP++GKCLY DL+  +H + +E+  +++NW++++N+P+P +K  S+TMKV+E+M+ ++ Q+ + +WVPEL ++++ETK+PF+N IYD
Subjt:  KENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYD

Query:  CDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATST
        CDPL QL W NVVL+G+AAHPTTPH  RSTNM++LDA++LG+CL KWG+ DL  AL EYQ++RLPV   QVLHSR +GRIKQ                  
Subjt:  CDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATST

Query:  HNMQEIQIRNTPFLDNSGMQLIQLLHIVQEARIWQILDEAILGKCLQKWEAQNLKLALVEYQSFRLSVISVQILYSRRIGQIRQVLTLSNRDPLNLNKQK
                                                                                    R IG I +             K K
Subjt:  HNMQEIQIRNTPFLDNSGMQLIQLLHIVQEARIWQILDEAILGKCLQKWEAQNLKLALVEYQSFRLSVISVQILYSRRIGQIRQVLTLSNRDPLNLNKQK

Query:  PKAVIIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGRILTYDDNLNFRA
         KA+I+GGSIAG+SCAH LT  GW V VLEK+ +PPT    GAGLGLDPL+Q L+ SWL  P LL ++T PLT +QN+A D  +     LT D+  NFRA
Subjt:  PKAVIIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGRILTYDDNLNFRA

Query:  ALWADLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVVGDLLVAADGCLSSIRQTFLPNFKLRL--LCLER-VFDYSKNENSETV
        A WADLH LLY  LPPDIF WG+ F+S C S++K SVK+KAK+++TDE++E+ G+LLVAADGCLS IRQ FLP+ KLR    C  R V D++  E+SET+
Subjt:  ALWADLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVVGDLLVAADGCLSSIRQTFLPNFKLRL--LCLER-VFDYSKNENSETV

Query:  MNIHRAYPEISKC---------------VPKKKINWLWYA---------RSMTMKVNEEMVRKLHEQVDEIWVPEFAKFVRETKDPFINAIYDYDPLKKL
          I +AYPE+ KC               +P K++NW++Y           S+TMKV+E+M+ ++ ++ + +WVPE  + ++ETK+PF+N +YD DPL+++
Subjt:  MNIHRAYPEISKC---------------VPKKKINWLWYA---------RSMTMKVNEEMVRKLHEQVDEIWVPEFAKFVRETKDPFINAIYDYDPLKKL

Query:  VWDNVVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQKWGAQGLKSALMEYQSLRLPVVSAQVLHSRRAGRIKQGLTLPDCEPFDPNVATSTQNFPELQ
         WDNVVLVG+AAHPTTPH  RSTNM+ILDAAVLGKCL+KWG + L SAL EYQS+RLPV S QVLHSR  GRIKQGL LP+ EPFDP  AT  ++  +LQ
Subjt:  VWDNVVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQKWGAQGLKSALMEYQSLRLPVVSAQVLHSRRAGRIKQGLTLPDCEPFDPNVATSTQNFPELQ

Query:  IRNVPYLNDVPQI
         +N+P+   VP +
Subjt:  IRNVPYLNDVPQI

RDX66069.1 Aurachin C monooxygenase/isomerase, partial [Mucuna pruriens]2.7e-20043.44Show/hide
Query:  KQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENFN
        K KP+AVIVGGSIAGIS AHTL  AGW V  LEK+  PPAG  TGAGL LDPLS ++++SWL +P+LL   T PL  +QN+A D E K +  L  DE+ N
Subjt:  KQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENFN

Query:  FRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKENS
        FRA HW  LH LLY  LP  +FLWGHLFLS  +++DK  V +KAKV+E+G++VEIVGDLLVAAD CLSSIRQ +LP+FKLRYSGY  WRGVFDFS+ E  
Subjt:  FRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKENS

Query:  ETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYDCDPL
        ET+ +++KAYP++GKCLY DL S +H   +E+  KK+NW+W+VN P+P++K+R   M                                           
Subjt:  ETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYDCDPL

Query:  EQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATSTHNMQ
                  +GE                                                                                       
Subjt:  EQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATSTHNMQ

Query:  EIQIRNTPFLDNSGMQLIQLLHIVQEARIWQILDEAILGKCLQKWEAQNLKLALVEYQSFRLSVISVQILYSRRIGQIRQVLTLSNRDPLNLNKQKPKAV
                                                                                                      +KPKAV
Subjt:  EIQIRNTPFLDNSGMQLIQLLHIVQEARIWQILDEAILGKCLQKWEAQNLKLALVEYQSFRLSVISVQILYSRRIGQIRQVLTLSNRDPLNLNKQKPKAV

Query:  IIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGRILTYDDNLNFRAALWA
        I+GGSIAGIS AH LT+AGWDV VLEK+ +PP+   TGAGLGL+ LSQ+++ SWL  P+ L   T PLT +QN A D E K    LT D+N NF AA W 
Subjt:  IIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGRILTYDDNLNFRAALWA

Query:  DLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVVGDLLVAADGCLSSIRQTFLPNFKLRL--LCLER-VFDYSKNENSETVMNIH
        DLH LL   L  ++FLWGH F+S  ++DDK SV +KAK++ET ++VE+VGDLLVAADGCLSSIRQ +LP+FKLR    C  R V D+SK ENSET+  I 
Subjt:  DLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVVGDLLVAADGCLSSIRQTFLPNFKLRL--LCLER-VFDYSKNENSETVMNIH

Query:  RAYPEISKC---------------VPKKKINWLWYAR---------SMTMKVNEEMVRKLHEQVDEIWVPEFAKFVRETKDPFINAIYDYDPLKKLVWDN
        +AYP++ KC               +  KK+NW+WY           S+T KVN +M++K+H++ +++W+PE  K ++ET+DPFIN IYD DPL+KL WDN
Subjt:  RAYPEISKC---------------VPKKKINWLWYAR---------SMTMKVNEEMVRKLHEQVDEIWVPEFAKFVRETKDPFINAIYDYDPLKKLVWDN

Query:  VVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQKWGAQGLKSALMEYQSLRLPVVSAQVLHSRRAGRIKQGLTLPDCEPFDPNVATSTQNFPELQIRNV
        VVLVG+AAHPTTPHC RSTNM+ILDAAVLGKCL K+G + L+SAL EYQ +RLP  S QVLH+RR GRIKQGL LPD EPF+P  A    +  EL  RN 
Subjt:  VVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQKWGAQGLKSALMEYQSLRLPVVSAQVLHSRRAGRIKQGLTLPDCEPFDPNVATSTQNFPELQIRNV

Query:  PYLNDVP
        P+ NDVP
Subjt:  PYLNDVP

XP_038886774.1 aurachin C monooxygenase/isomerase-like [Benincasa hispida]1.9e-20985.38Show/hide
Query:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQ--------NRAIDGESK
        MVEKK+KPKAVIVGGSIAGISCAHTLLKAGWEVQVLE+S+TPP GCSTGAGLALDPLSQ+LLQSWLSRPELLLE + PLATEQ          +IDGESK
Subjt:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQ--------NRAIDGESK

Query:  KARILTNDENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGW
        +AR+L+NDENFNFRA HWADLHSLLY ELP  IFLWGHLFLSLCIS+DKTSVKIKA+VVESGEIVEI+GDLLVAAD CLSSIRQTFLPNFKLRYSGYYGW
Subjt:  KARILTNDENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGW

Query:  RGVFDFSKKENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKD
        RGVFDFS++EN ETVMNIRKAYPEIGKCLYMDLAS +HILLFE+PKKKINWVWFVNEPQPQLKARSM+MKVNE MVKKLHQQVD+IWVPELAK+V+ETKD
Subjt:  RGVFDFSKKENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKD

Query:  PFINVIYDCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPF
        PFINVIYDCDPLEQ+VW NVVLVGEAAHPTTPHCARSTNM LLDA +LGECLLKWGL+DLK ALAEYQSLRLPVIYAQVLHSR VGRIKQG T S  E F
Subjt:  PFINVIYDCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPF

Query:  DPNVATSTHNMQEIQIRNTPFLDN
        DP+ AT+T N  E+QIRNTPFLD+
Subjt:  DPNVATSTHNMQEIQIRNTPFLDN

TrEMBL top hitse value%identityAlignment
A0A0A0LLV2 FAD_binding_3 domain-containing protein6.3e-19579.9Show/hide
Query:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND
        MVEK+QKPKAVIVGGSIAGISCAHTL+KAGWEVQVL+KS +PP GCSTGAGL LDPLSQKLLQSW+SRPELLL+ST P+ TEQNRAI GE K  RILTND
Subjt:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND

Query:  ENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSK
        ENFN+RA HWADLHSLLYKELP  IFLWGH FLSL IS+DKTSVKIKAKV ++ E+VEIVGDLLVAAD CLSSIR+TFLPN KLRYSGYY WRGVFDFSK
Subjt:  ENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSK

Query:  KENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYD
        KEN E VM ++K YPEIGKCLYMDLA  +HILLFE+P  KINWVWFVNE +P  KARSMTMKVN++MVK+LH++ DD+WVPELAK+++ETKDPFINVIYD
Subjt:  KENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYD

Query:  CDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATST
        CDPLEQ+VW NVVLVGEAAHPTTPHCARSTNMTL DASILGECL    L +LK ALAEYQSLRLP+++AQV HSR VGRIKQGLT   CEPFDPN+ T+T
Subjt:  CDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATST

Query:  HNMQEIQIRNTPF
         N+QE+QIRN PF
Subjt:  HNMQEIQIRNTPF

A0A1R3JXF1 Monooxygenase, FAD-binding protein1.9e-26851.04Show/hide
Query:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND
        M+EKK K KA+IVGGSIAG+SCAH L  AGWEV VLEK+  PP G  TGAGL LDPLSQ+L+ SWL  P LL ++T PL  +Q++A D  +K +  LT D
Subjt:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND

Query:  ENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSK
        E FNFRA HWADLH LLY  LP  IF WGH F+S  IS DK SVK+KAKV++  EI+EI G+LLVAAD CLS IRQ+FLP+ KLRYSGY  WRGV +FS 
Subjt:  ENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSK

Query:  KENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYD
        KE+SET+  IRKAYP++GKCLY DL+  +H + +E+  +++NW++++N+P+P +K  S+TMKV+E+M+ ++ Q+ + +WVPEL ++++ETK+PF+N IYD
Subjt:  KENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYD

Query:  CDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATST
        CDPL QL W NVVL+G+AAHPTTPH  RSTNM++LDA++LG+CL KWG+ DL  AL EYQ++RLPV   QVLHSR +GRIKQ                  
Subjt:  CDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATST

Query:  HNMQEIQIRNTPFLDNSGMQLIQLLHIVQEARIWQILDEAILGKCLQKWEAQNLKLALVEYQSFRLSVISVQILYSRRIGQIRQVLTLSNRDPLNLNKQK
                                                                                    R IG I +             K K
Subjt:  HNMQEIQIRNTPFLDNSGMQLIQLLHIVQEARIWQILDEAILGKCLQKWEAQNLKLALVEYQSFRLSVISVQILYSRRIGQIRQVLTLSNRDPLNLNKQK

Query:  PKAVIIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGRILTYDDNLNFRA
         KA+I+GGSIAG+SCAH LT  GW V VLEK+ +PPT    GAGLGLDPL+Q L+ SWL  P LL ++T PLT +QN+A D  +     LT D+  NFRA
Subjt:  PKAVIIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGRILTYDDNLNFRA

Query:  ALWADLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVVGDLLVAADGCLSSIRQTFLPNFKLRL--LCLER-VFDYSKNENSETV
        A WADLH LLY  LPPDIF WG+ F+S C S++K SVK+KAK+++TDE++E+ G+LLVAADGCLS IRQ FLP+ KLR    C  R V D++  E+SET+
Subjt:  ALWADLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVVGDLLVAADGCLSSIRQTFLPNFKLRL--LCLER-VFDYSKNENSETV

Query:  MNIHRAYPEISKC---------------VPKKKINWLWYA---------RSMTMKVNEEMVRKLHEQVDEIWVPEFAKFVRETKDPFINAIYDYDPLKKL
          I +AYPE+ KC               +P K++NW++Y           S+TMKV+E+M+ ++ ++ + +WVPE  + ++ETK+PF+N +YD DPL+++
Subjt:  MNIHRAYPEISKC---------------VPKKKINWLWYA---------RSMTMKVNEEMVRKLHEQVDEIWVPEFAKFVRETKDPFINAIYDYDPLKKL

Query:  VWDNVVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQKWGAQGLKSALMEYQSLRLPVVSAQVLHSRRAGRIKQGLTLPDCEPFDPNVATSTQNFPELQ
         WDNVVLVG+AAHPTTPH  RSTNM+ILDAAVLGKCL+KWG + L SAL EYQS+RLPV S QVLHSR  GRIKQGL LP+ EPFDP  AT  ++  +LQ
Subjt:  VWDNVVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQKWGAQGLKSALMEYQSLRLPVVSAQVLHSRRAGRIKQGLTLPDCEPFDPNVATSTQNFPELQ

Query:  IRNVPYLNDVPQI
         +N+P+   VP +
Subjt:  IRNVPYLNDVPQI

A0A371EJK7 Aurachin C monooxygenase/isomerase (Fragment)1.3e-20043.44Show/hide
Query:  KQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENFN
        K KP+AVIVGGSIAGIS AHTL  AGW V  LEK+  PPAG  TGAGL LDPLS ++++SWL +P+LL   T PL  +QN+A D E K +  L  DE+ N
Subjt:  KQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENFN

Query:  FRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKENS
        FRA HW  LH LLY  LP  +FLWGHLFLS  +++DK  V +KAKV+E+G++VEIVGDLLVAAD CLSSIRQ +LP+FKLRYSGY  WRGVFDFS+ E  
Subjt:  FRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKENS

Query:  ETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYDCDPL
        ET+ +++KAYP++GKCLY DL S +H   +E+  KK+NW+W+VN P+P++K+R   M                                           
Subjt:  ETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYDCDPL

Query:  EQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATSTHNMQ
                  +GE                                                                                       
Subjt:  EQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATSTHNMQ

Query:  EIQIRNTPFLDNSGMQLIQLLHIVQEARIWQILDEAILGKCLQKWEAQNLKLALVEYQSFRLSVISVQILYSRRIGQIRQVLTLSNRDPLNLNKQKPKAV
                                                                                                      +KPKAV
Subjt:  EIQIRNTPFLDNSGMQLIQLLHIVQEARIWQILDEAILGKCLQKWEAQNLKLALVEYQSFRLSVISVQILYSRRIGQIRQVLTLSNRDPLNLNKQKPKAV

Query:  IIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGRILTYDDNLNFRAALWA
        I+GGSIAGIS AH LT+AGWDV VLEK+ +PP+   TGAGLGL+ LSQ+++ SWL  P+ L   T PLT +QN A D E K    LT D+N NF AA W 
Subjt:  IIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGRILTYDDNLNFRAALWA

Query:  DLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVVGDLLVAADGCLSSIRQTFLPNFKLRL--LCLER-VFDYSKNENSETVMNIH
        DLH LL   L  ++FLWGH F+S  ++DDK SV +KAK++ET ++VE+VGDLLVAADGCLSSIRQ +LP+FKLR    C  R V D+SK ENSET+  I 
Subjt:  DLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVVGDLLVAADGCLSSIRQTFLPNFKLRL--LCLER-VFDYSKNENSETVMNIH

Query:  RAYPEISKC---------------VPKKKINWLWYAR---------SMTMKVNEEMVRKLHEQVDEIWVPEFAKFVRETKDPFINAIYDYDPLKKLVWDN
        +AYP++ KC               +  KK+NW+WY           S+T KVN +M++K+H++ +++W+PE  K ++ET+DPFIN IYD DPL+KL WDN
Subjt:  RAYPEISKC---------------VPKKKINWLWYAR---------SMTMKVNEEMVRKLHEQVDEIWVPEFAKFVRETKDPFINAIYDYDPLKKLVWDN

Query:  VVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQKWGAQGLKSALMEYQSLRLPVVSAQVLHSRRAGRIKQGLTLPDCEPFDPNVATSTQNFPELQIRNV
        VVLVG+AAHPTTPHC RSTNM+ILDAAVLGKCL K+G + L+SAL EYQ +RLP  S QVLH+RR GRIKQGL LPD EPF+P  A    +  EL  RN 
Subjt:  VVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQKWGAQGLKSALMEYQSLRLPVVSAQVLHSRRAGRIKQGLTLPDCEPFDPNVATSTQNFPELQIRNV

Query:  PYLNDVP
        P+ NDVP
Subjt:  PYLNDVP

A0A5A7VGU5 2,6-dihydroxypyridine 3-monooxygenase-like1.0e-18979.28Show/hide
Query:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND
        MVEKK  PKAVIVGGSIAGISCA TL+KAGWEVQVL+KS +PP GCSTGAGL LDPLSQKL+QSW+SRPELLLEST PL TEQNRAIDGE K  RILT+D
Subjt:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND

Query:  ENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSK
        ENFN+RA HWADLHSLLYKELP  IFLWGH FLSL IS+DKTSVK+KAKV ++ E+VEIVGDLLVAAD CLSSIR+TFLPN KLRYSGYY WRGVFDFSK
Subjt:  ENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSK

Query:  KENSETVMNI-RKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIY
        KEN E VM + +KAYPEIGKCLYMDLA  +HILLFE+PK KINWVWFVNE +PQ KARSMTMKVN +MVK+LH+QVDD+WVPEL K+++ETKDPFINVIY
Subjt:  KENSETVMNI-RKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIY

Query:  DCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLK-WGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVAT
        DCDPLEQ+VW NVVLVGEAAHPTTPHCARSTNMTLLDASILG+CL +   L +L+ ALAEYQ+LRLP+++AQVLHSR VG+IKQGLT S  EPFDP+VAT
Subjt:  DCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLK-WGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVAT

Query:  STHNMQEIQIRNTPF
        +T  +Q++QIRN PF
Subjt:  STHNMQEIQIRNTPF

A0A6J1KNQ1 uncharacterized protein LOC1114962242.3e-18975.71Show/hide
Query:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND
        M++KKQKPKA+IVGGSIAGISCAHTL+KAGWEV VL+K+ TPP GCSTGAGL LD LSQ+L+QSWLSRPELL+ESTSPL TEQNRAIDGESK  RILTND
Subjt:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND

Query:  ENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSK
        EN NFRAVHWADLH LLY ELPP IFLWGHLFLS+CIS+DK SVKI AKV+++ EIVEIVGDLLVAAD CLSSIRQ FLPNFKLRYSGYY WRGVFDFS+
Subjt:  ENFNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSK

Query:  KENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYD
         ENSE+ +NI KAYP++GK LY DLAS +H+ +FEVPKKKINW+W+VN+P+PQ+K RSMTMKVNEEM ++LH+Q  +IWVPE A +VRETKDPFIN IYD
Subjt:  KENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYD

Query:  CDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATST
        CDPLEQLVW NVVLVGE+AHP TPHC RSTNM++LDA++LG+CL KWG  DL  ALAEYQSLRLP+I  QVLHSR VGRIKQGL     + FDPN+A   
Subjt:  CDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATST

Query:  HNMQEIQIRNTPFLDN--SGMQLI
         N+QE+QIRNTPF D+   G+ LI
Subjt:  HNMQEIQIRNTPFLDN--SGMQLI

SwissProt top hitse value%identityAlignment
H1ZZA4 Aurachin C monooxygenase/isomerase2.0e-1229.07Show/hide
Query:  EIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKAR
        E  GDLLV AD   S++R   L     RYSGY  WRGV D S+         +R+         Y   +    +    VP  +    WF     P+    
Subjt:  EIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKAR

Query:  SMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFI--NVIYDCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLA
            +   E++++        W   + +++  T    I    I+D  P+ Q V G  VL+G+AAHP TP+  +     + DA +L  CL      +L  A
Subjt:  SMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFI--NVIYDCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLA

Query:  LAEYQSLRLPVIYAQVLHSRRVGRIKQ
        LA YQ++R+      V  S R+G+I Q
Subjt:  LAEYQSLRLPVIYAQVLHSRRVGRIKQ

J4VWM7 FAD-dependent monooxygenase OpS42.1e-0923.88Show/hide
Query:  SNRDPLNLNKQKPKAVIIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGR
        S R+PL+L       V+IGG +AG+S A    + G    +LEK+   P     GAGL L P S RLL+ W    +L  ++  P T    R  DG     R
Subjt:  SNRDPLNLNKQKPKAVIIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPLSQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGR

Query:  ILTYDDNL--NFRAALWADLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVV--------GDLLVAADGCLSSIRQTFLPNFKLR
           +D+ +   + A  W D+H     +L   +     H       D +T  ++++  ++TD +  ++        GD+++AADG  S  R    P+    
Subjt:  ILTYDDNL--NFRAALWADLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVV--------GDLLVAADGCLSSIRQTFLPNFKLR

Query:  LLCLERVFDYSKNENSETVMNIHRAY--PEISKCVPKKKIN-WLW---YARSMTMKVNEEM-------------VRKLHEQVDEI------WVPEFAKFV
           L      + +     ++ +      PE++  V K  +N W+    +A + +++   E+               +    ++E+      W P   +F+
Subjt:  LLCLERVFDYSKNENSETVMNIHRAY--PEISKCVPKKKIN-WLW---YARSMTMKVNEEM-------------VRKLHEQVDEI------WVPEFAKFV

Query:  RETKDPFINAIYDYDPLKKLVWDN----VVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQK-WGAQGLKSALMEYQSLR
           K      +     L K  W++      + G++ HP  P+ A+  N A+ D AVLG+ L     A  +   L  YQ +R
Subjt:  RETKDPFINAIYDYDPLKKLVWDN----VVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQK-WGAQGLKSALMEYQSLR

Q3S4B7 3-hydroxybenzoate 6-hydroxylase4.6e-0925.98Show/hide
Query:  VIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND---ENFN--F
        ++ GG I G++ A  L++ G+ V+VLE++  P  G   GAG+ L P +     +     +    +        + AIDG S   RI T +   + F   +
Subjt:  VIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTND---ENFN--F

Query:  RAVHWADLH-SLL--YKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKE
          +H  D+H SLL   +E     FL      +L I  D+ SV +  +   + +     G  L+ AD   S +R+ F+ +   R +G+  +R V D     
Subjt:  RAVHWADLH-SLL--YKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKE

Query:  NSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKK---KINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIY
                +K +PE  +     +    +  L   P +   + N V   +  QP+   +    + ++E V+   Q +     P+  +++   K        
Subjt:  NSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKK---KINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIY

Query:  DCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRI
        D +P+ Q  +G V L+G+AAHPTT + A+   M + D   LGE  L+    D   A   YQ  R+      VL SR +GRI
Subjt:  DCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRI

Q88FY2 6-hydroxynicotinate 3-monooxygenase2.4e-1321.95Show/hide
Query:  KPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKK----ARILTNDEN
        + K  IVG  + G + A  L +AG++V+V E++   PA    GAG+ + P   K+ +      +L L  + P          G+             +  
Subjt:  KPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKK----ARILTNDEN

Query:  FNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEI--------VGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRG
          +  +H  DLH+L  + + P    +G               K   K+V+ G+ V +        V D+++ AD   S IR+  L      YSG+   R 
Subjt:  FNFRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEI--------VGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRG

Query:  VFDFSKKENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPF
        +            +N+ +       C+    +   H++++    K+  + +    P      +   +  ++E ++   +     + P + K++  T+   
Subjt:  VFDFSKKENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPF

Query:  INVIYDCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLR
           + + +PL     G +VL+G+A HP  PH A+   M + DA++L  CL + GL D + A A Y++ R
Subjt:  INVIYDCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLR

Q93NG3 2,6-dihydroxypyridine 3-monooxygenase1.8e-1323.68Show/hide
Query:  KAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDP-LSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENFNFRA
        +  +VGGSI+G++ A  L  AG +V V E+S  P +G   G G+ + P L   LL+  +    + + S+S    E   A+ GE             ++R 
Subjt:  KAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDP-LSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENFNFRA

Query:  VHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKENSETV
          +  ++  LY+   P+ +      + L  S D  +V+++       E   ++G     AD   S +R+  L   +  Y+GY  WRGV      E ++ V
Subjt:  VHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKENSETV

Query:  MNIRKAYPEIGKCLYMDLASASHILLFEVPKK------KINWVWFVNEPQ-PQLKARSMTMK------------VNEEMVKKLHQQVDDIWVPELAKIVR
         N         K  Y  L    H++ + +P +      ++N+ W+ N  + P L      ++            +N   +++ H + + ++ P    +V 
Subjt:  MNIRKAYPEIGKCLYMDLASASHILLFEVPKK------KINWVWFVNEPQ-PQLKARSMTMK------------VNEEMVKKLHQQVDDIWVPELAKIVR

Query:  ETKDPFINVIYDCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRV-GRIKQGLTFS
            PF+ V+ D   ++++V G V+L+G+AA    PH A        DA  L E   K    DL+ +L  +++ +L   +A +   +++  R++ G +F 
Subjt:  ETKDPFINVIYDCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRV-GRIKQGLTFS

Query:  TCEPFDPNVATSTHNMQE
          EP +P  A     + E
Subjt:  TCEPFDPNVATSTHNMQE

Arabidopsis top hitse value%identityAlignment
AT2G29720.1 FAD/NAD(P)-binding oxidoreductase family protein7.8e-1222.17Show/hide
Query:  QKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENFNF
        ++ K VIVGG I G++ A  L + G    VLE++ +   G   GA L L     ++L +    P+L  +          +    E +  +   ND++   
Subjt:  QKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENFNF

Query:  RAVHWADLHSLLYKELPPKIFLWGHLFLSL-CISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGV------FDF
        R V    L   L  +LPP+   +     S+   +N  T +++K       +    + ++++  D   S +  T++   + +Y GY  +RG+        F
Subjt:  RAVHWADLHSLLYKELPPKIFLWGHLFLSL-CISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGV------FDF

Query:  SKKENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVI
         +K N      +R  Y                     VP       WF+    P L  + M    +  +++K  +++   W  +L  ++  T D  I+  
Subjt:  SKKENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVI

Query:  YDCDPL-EQLVW---------GNVVLVGEAAHPTTPHCARSTNMTLLDASILGECL---LKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQ
            PL ++ +W         G VVLVG+A HP TP+  +     L D+ +L   L   +  G   ++ A+  Y+S R   ++   + +  VG++ Q
Subjt:  YDCDPL-EQLVW---------GNVVLVGEAAHPTTPHCARSTNMTLLDASILGECL---LKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQ

AT5G11330.1 FAD/NAD(P)-binding oxidoreductase family protein1.2e-14260.79Show/hide
Query:  KQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENFN
        K+K KA+IVGGSIAG+SCAH+L  A W+V VLEKS+ PP G  TGAGL LDP ++++++SWL+ P+LL E T PL+ +QN+  D E K  RILT DE+F+
Subjt:  KQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENFN

Query:  FRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKENS
        FRA +W+D+ SLLY  LP  +FLWGH FLS  +S D+++VK+K  VVE+ E VEI GDLL+AAD CLSSIR+TFLP+FKLRYSGY  WRGVFDFS  ENS
Subjt:  FRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKENS

Query:  ETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYDCDPL
        ETV  I+K YP++GKCLY DL   +H + +E+  KK+NW+W+VN+P+P LK+ S+T+KV++EM+ K+HQ+ + IW+PELA+++ ETKDPF+NVIYD DPL
Subjt:  ETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYDCDPL

Query:  EQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGL
        E++ WGN+VLVG+AAHPTTPH  RSTNM++LDA +LG+CL   G  ++ L L EYQ +RLPV+  QVL++RR+GRIKQGL
Subjt:  EQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGL

AT5G11330.2 FAD/NAD(P)-binding oxidoreductase family protein1.1e-13061.05Show/hide
Query:  KQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENFN
        K+K KA+IVGGSIAG+SCAH+L  A W+V VLEKS+ PP G  TGAGL LDP ++++++SWL+ P+LL E T PL+ +QN+  D E K  RILT DE+F+
Subjt:  KQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENFN

Query:  FRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKENS
        FRA +W+D+ SLLY  LP  +FLWGH FLS  +S D+++VK+K  VVE+ E VEI GDLL+AAD CLSSIR+TFLP+FKLRYSGY  WRGVFDFS  ENS
Subjt:  FRAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKENS

Query:  ETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYDCDPL
        ETV  I+K YP++GKCLY DL   +H + +E+  KK+NW+W+VN+P+P LK+ S+T+KV++EM+ K+HQ+ + IW+PELA+++ ETKDPF+NVIYD DPL
Subjt:  ETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYDCDPL

Query:  EQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWG
        E++ WGN+VLVG+AAHPTTPH  RSTNM++LDA +LG+CL   G
Subjt:  EQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDASILGECLLKWG

AT5G67030.1 zeaxanthin epoxidase (ZEP) (ABA1)8.1e-0923.1Show/hide
Query:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTP---------PAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGES
        + EKK+K + ++ GG I G+  A    K G++V V EK  +          P    + A  AL+ +        +   E ++E+        N  +DG S
Subjt:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTP---------PAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGES

Query:  ----KKARILTNDENFNF---RAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKL
             K    T   +      R +    L  +L + +   +       +    S DK +V     V+E+G+  E  GDLLV AD   S +R       + 
Subjt:  ----KKARILTNDENFNF---RAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKL

Query:  RYSGYYGWRGVFDFSKKENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELA
         YSGY  + G+ DF           I      +G  ++  L    + +  +V   K+ W  F  EP     A           +KK   ++ D W   + 
Subjt:  RYSGYYGWRGVFDFSKKENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELA

Query:  KIVRETKDPFI--NVIYDCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDA
         ++  T++  I    IYD  P      G V L+G++ H   P+  +   M + D+
Subjt:  KIVRETKDPFI--NVIYDCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDA

AT5G67030.2 zeaxanthin epoxidase (ZEP) (ABA1)8.1e-0923.1Show/hide
Query:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTP---------PAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGES
        + EKK+K + ++ GG I G+  A    K G++V V EK  +          P    + A  AL+ +        +   E ++E+        N  +DG S
Subjt:  MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTP---------PAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGES

Query:  ----KKARILTNDENFNF---RAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKL
             K    T   +      R +    L  +L + +   +       +    S DK +V     V+E+G+  E  GDLLV AD   S +R       + 
Subjt:  ----KKARILTNDENFNF---RAVHWADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKL

Query:  RYSGYYGWRGVFDFSKKENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELA
         YSGY  + G+ DF           I      +G  ++  L    + +  +V   K+ W  F  EP     A           +KK   ++ D W   + 
Subjt:  RYSGYYGWRGVFDFSKKENSETVMNIRKAYPEIGKCLYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELA

Query:  KIVRETKDPFI--NVIYDCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDA
         ++  T++  I    IYD  P      G V L+G++ H   P+  +   M + D+
Subjt:  KIVRETKDPFI--NVIYDCDPLEQLVWGNVVLVGEAAHPTTPHCARSTNMTLLDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGAGAAGAAGCAGAAGCCCAAAGCAGTGATTGTAGGAGGGAGCATTGCCGGAATATCTTGCGCTCACACTCTGCTTAAAGCAGGTTGGGAAGTTCAAGTCCTCGA
AAAATCCACCACGCCGCCGGCCGGTTGTTCCACCGGCGCCGGACTCGCCCTTGATCCTCTCTCTCAGAAGCTTCTCCAGTCGTGGCTTTCCCGGCCCGAACTTCTCCTCG
AATCCACTTCGCCTCTTGCGACGGAGCAGAATCGAGCAATCGATGGAGAAAGCAAGAAGGCACGAATTTTGACTAATGATGAGAATTTCAATTTTAGAGCAGTACATTGG
GCTGACCTCCATAGCCTTCTATACAAAGAGCTTCCACCTAAAATATTTCTTTGGGGTCATCTTTTCCTTTCACTTTGCATTTCTAATGACAAAACATCTGTGAAAATCAA
AGCAAAAGTTGTGGAAAGTGGTGAAATTGTTGAAATAGTTGGAGATTTGCTTGTTGCAGCTGATGAGTGTCTCTCTTCCATACGTCAAACATTTCTTCCTAACTTCAAAT
TAAGATATTCAGGATATTATGGTTGGAGAGGGGTTTTTGATTTCTCAAAGAAAGAGAATTCAGAGACAGTGATGAACATTCGTAAGGCATATCCTGAAATTGGTAAATGT
TTGTACATGGATTTGGCTTCTGCTTCTCATATTTTACTGTTTGAGGTTCCAAAGAAGAAAATCAATTGGGTTTGGTTTGTTAATGAACCACAACCCCAACTCAAGGCTAG
ATCAATGACTATGAAAGTAAATGAAGAAATGGTGAAAAAATTGCACCAACAAGTGGATGACATTTGGGTCCCTGAGCTTGCAAAAATCGTTAGAGAAACAAAAGATCCTT
TCATCAATGTCATCTATGATTGTGATCCATTAGAACAACTAGTTTGGGGCAATGTGGTTTTAGTTGGAGAAGCAGCTCACCCAACGACTCCTCATTGTGCAAGAAGCACA
AATATGACATTATTAGATGCATCTATTTTAGGCGAATGTCTCCTAAAGTGGGGACTAGTTGATCTAAAATTAGCCCTTGCAGAGTATCAATCTCTTCGCTTACCTGTCAT
TTATGCACAAGTCCTGCATTCTCGACGCGTCGGTCGAATAAAGCAAGGTTTGACATTTTCCACCTGCGAACCCTTTGACCCAAATGTAGCTACTTCTACACATAATATGC
AAGAGATTCAAATTAGAAACACACCTTTTCTTGATAATTCAGGGATGCAGCTCATCCAACTACTCCACATTGTGCAAGAAGCACGAATATGGCAAATATTAGATGAAGCT
ATTTTGGGCAAATGCCTTCAAAAGTGGGAAGCACAAAATCTAAAATTAGCCCTCGTTGAGTATCAGTCTTTTCGGCTATCTGTCATTTCAGTACAAATCCTGTATTCTCG
ACGTATCGGTCAAATAAGGCAAGTTTTGACACTTTCCAACCGCGATCCTTTGAACCTGAATAAGCAGAAGCCCAAAGCCGTGATTATAGGGGGAAGCATCGCCGGAATAT
CTTGCGCTCACACCCTTACTATAGCCGGCTGGGACGTTCAGGTTCTCGAAAAATCCCCCACGCCGCCGACCGACTGTTCCACCGGCGCTGGACTCGGCCTCGATCCTCTC
TCTCAGAGGCTCCTCCAGTCGTGGCTTTCCCGTCCTGAACTTCTCCTCGAATCCACTTCGCCCCTTACGACGGAGCAGAATCGAGCAATTGATGGAGAAAGCAAGAGGGG
ACGGATCTTGACTTATGATGATAATCTAAATTTCAGAGCAGCACTTTGGGCTGATCTCCATAGCCTTTTATACAAAGAGCTTCCACCTGATATATTTCTTTGGGGTCATC
ATTTTGTTTCTCTTTGCATTTCTGATGACAAAACGTCTGTGAAAATCAAAGCAAAAATTGTAGAAACTGACGAAATCGTTGAAGTAGTTGGGGATTTGCTTGTTGCTGCT
GATGGCTGTCTCTCTTCCATTCGTCAAACATTTCTTCCTAACTTCAAGTTAAGGTTATTGTGCTTGGAGAGGGTTTTTGATTACTCAAAGAATGAGAATTCGGAGACAGT
AATGAACATCCATAGAGCGTATCCTGAAATAAGTAAATGCGTTCCAAAGAAGAAGATCAATTGGCTTTGGTATGCTAGATCAATGACTATGAAAGTAAACGAGGAGATGG
TGAGAAAATTGCACGAACAAGTGGATGAGATTTGGGTCCCTGAATTTGCAAAATTTGTTAGAGAAACAAAAGATCCATTCATCAATGCAATCTATGATTATGATCCATTA
AAAAAACTAGTTTGGGACAATGTCGTATTGGTCGGAGAAGCCGCTCATCCAACGACTCCTCATTGTGCAAGAAGCACGAATATGGCAATATTAGATGCAGCTGTTTTGGG
CAAATGCCTCCAAAAGTGGGGAGCACAAGGTCTAAAATCAGCCCTTATGGAGTATCAATCTCTTAGACTTCCTGTGGTATCTGCTCAAGTATTGCATTCTCGGCGCGCCG
GTCGAATAAAGCAAGGTTTAACACTTCCTGACTGCGAACCCTTCGATCCAAATGTAGCCACTTCGACACAAAACTTTCCAGAGCTTCAAATTAGAAATGTACCTTACCTT
AACGATGTTCCGCAAATATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTGAGAAGAAGCAGAAGCCCAAAGCAGTGATTGTAGGAGGGAGCATTGCCGGAATATCTTGCGCTCACACTCTGCTTAAAGCAGGTTGGGAAGTTCAAGTCCTCGA
AAAATCCACCACGCCGCCGGCCGGTTGTTCCACCGGCGCCGGACTCGCCCTTGATCCTCTCTCTCAGAAGCTTCTCCAGTCGTGGCTTTCCCGGCCCGAACTTCTCCTCG
AATCCACTTCGCCTCTTGCGACGGAGCAGAATCGAGCAATCGATGGAGAAAGCAAGAAGGCACGAATTTTGACTAATGATGAGAATTTCAATTTTAGAGCAGTACATTGG
GCTGACCTCCATAGCCTTCTATACAAAGAGCTTCCACCTAAAATATTTCTTTGGGGTCATCTTTTCCTTTCACTTTGCATTTCTAATGACAAAACATCTGTGAAAATCAA
AGCAAAAGTTGTGGAAAGTGGTGAAATTGTTGAAATAGTTGGAGATTTGCTTGTTGCAGCTGATGAGTGTCTCTCTTCCATACGTCAAACATTTCTTCCTAACTTCAAAT
TAAGATATTCAGGATATTATGGTTGGAGAGGGGTTTTTGATTTCTCAAAGAAAGAGAATTCAGAGACAGTGATGAACATTCGTAAGGCATATCCTGAAATTGGTAAATGT
TTGTACATGGATTTGGCTTCTGCTTCTCATATTTTACTGTTTGAGGTTCCAAAGAAGAAAATCAATTGGGTTTGGTTTGTTAATGAACCACAACCCCAACTCAAGGCTAG
ATCAATGACTATGAAAGTAAATGAAGAAATGGTGAAAAAATTGCACCAACAAGTGGATGACATTTGGGTCCCTGAGCTTGCAAAAATCGTTAGAGAAACAAAAGATCCTT
TCATCAATGTCATCTATGATTGTGATCCATTAGAACAACTAGTTTGGGGCAATGTGGTTTTAGTTGGAGAAGCAGCTCACCCAACGACTCCTCATTGTGCAAGAAGCACA
AATATGACATTATTAGATGCATCTATTTTAGGCGAATGTCTCCTAAAGTGGGGACTAGTTGATCTAAAATTAGCCCTTGCAGAGTATCAATCTCTTCGCTTACCTGTCAT
TTATGCACAAGTCCTGCATTCTCGACGCGTCGGTCGAATAAAGCAAGGTTTGACATTTTCCACCTGCGAACCCTTTGACCCAAATGTAGCTACTTCTACACATAATATGC
AAGAGATTCAAATTAGAAACACACCTTTTCTTGATAATTCAGGGATGCAGCTCATCCAACTACTCCACATTGTGCAAGAAGCACGAATATGGCAAATATTAGATGAAGCT
ATTTTGGGCAAATGCCTTCAAAAGTGGGAAGCACAAAATCTAAAATTAGCCCTCGTTGAGTATCAGTCTTTTCGGCTATCTGTCATTTCAGTACAAATCCTGTATTCTCG
ACGTATCGGTCAAATAAGGCAAGTTTTGACACTTTCCAACCGCGATCCTTTGAACCTGAATAAGCAGAAGCCCAAAGCCGTGATTATAGGGGGAAGCATCGCCGGAATAT
CTTGCGCTCACACCCTTACTATAGCCGGCTGGGACGTTCAGGTTCTCGAAAAATCCCCCACGCCGCCGACCGACTGTTCCACCGGCGCTGGACTCGGCCTCGATCCTCTC
TCTCAGAGGCTCCTCCAGTCGTGGCTTTCCCGTCCTGAACTTCTCCTCGAATCCACTTCGCCCCTTACGACGGAGCAGAATCGAGCAATTGATGGAGAAAGCAAGAGGGG
ACGGATCTTGACTTATGATGATAATCTAAATTTCAGAGCAGCACTTTGGGCTGATCTCCATAGCCTTTTATACAAAGAGCTTCCACCTGATATATTTCTTTGGGGTCATC
ATTTTGTTTCTCTTTGCATTTCTGATGACAAAACGTCTGTGAAAATCAAAGCAAAAATTGTAGAAACTGACGAAATCGTTGAAGTAGTTGGGGATTTGCTTGTTGCTGCT
GATGGCTGTCTCTCTTCCATTCGTCAAACATTTCTTCCTAACTTCAAGTTAAGGTTATTGTGCTTGGAGAGGGTTTTTGATTACTCAAAGAATGAGAATTCGGAGACAGT
AATGAACATCCATAGAGCGTATCCTGAAATAAGTAAATGCGTTCCAAAGAAGAAGATCAATTGGCTTTGGTATGCTAGATCAATGACTATGAAAGTAAACGAGGAGATGG
TGAGAAAATTGCACGAACAAGTGGATGAGATTTGGGTCCCTGAATTTGCAAAATTTGTTAGAGAAACAAAAGATCCATTCATCAATGCAATCTATGATTATGATCCATTA
AAAAAACTAGTTTGGGACAATGTCGTATTGGTCGGAGAAGCCGCTCATCCAACGACTCCTCATTGTGCAAGAAGCACGAATATGGCAATATTAGATGCAGCTGTTTTGGG
CAAATGCCTCCAAAAGTGGGGAGCACAAGGTCTAAAATCAGCCCTTATGGAGTATCAATCTCTTAGACTTCCTGTGGTATCTGCTCAAGTATTGCATTCTCGGCGCGCCG
GTCGAATAAAGCAAGGTTTAACACTTCCTGACTGCGAACCCTTCGATCCAAATGTAGCCACTTCGACACAAAACTTTCCAGAGCTTCAAATTAGAAATGTACCTTACCTT
AACGATGTTCCGCAAATATAG
Protein sequenceShow/hide protein sequence
MVEKKQKPKAVIVGGSIAGISCAHTLLKAGWEVQVLEKSTTPPAGCSTGAGLALDPLSQKLLQSWLSRPELLLESTSPLATEQNRAIDGESKKARILTNDENFNFRAVHW
ADLHSLLYKELPPKIFLWGHLFLSLCISNDKTSVKIKAKVVESGEIVEIVGDLLVAADECLSSIRQTFLPNFKLRYSGYYGWRGVFDFSKKENSETVMNIRKAYPEIGKC
LYMDLASASHILLFEVPKKKINWVWFVNEPQPQLKARSMTMKVNEEMVKKLHQQVDDIWVPELAKIVRETKDPFINVIYDCDPLEQLVWGNVVLVGEAAHPTTPHCARST
NMTLLDASILGECLLKWGLVDLKLALAEYQSLRLPVIYAQVLHSRRVGRIKQGLTFSTCEPFDPNVATSTHNMQEIQIRNTPFLDNSGMQLIQLLHIVQEARIWQILDEA
ILGKCLQKWEAQNLKLALVEYQSFRLSVISVQILYSRRIGQIRQVLTLSNRDPLNLNKQKPKAVIIGGSIAGISCAHTLTIAGWDVQVLEKSPTPPTDCSTGAGLGLDPL
SQRLLQSWLSRPELLLESTSPLTTEQNRAIDGESKRGRILTYDDNLNFRAALWADLHSLLYKELPPDIFLWGHHFVSLCISDDKTSVKIKAKIVETDEIVEVVGDLLVAA
DGCLSSIRQTFLPNFKLRLLCLERVFDYSKNENSETVMNIHRAYPEISKCVPKKKINWLWYARSMTMKVNEEMVRKLHEQVDEIWVPEFAKFVRETKDPFINAIYDYDPL
KKLVWDNVVLVGEAAHPTTPHCARSTNMAILDAAVLGKCLQKWGAQGLKSALMEYQSLRLPVVSAQVLHSRRAGRIKQGLTLPDCEPFDPNVATSTQNFPELQIRNVPYL
NDVPQI