; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034266 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034266
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationchr3:5831932..5836629
RNA-Seq ExpressionLag0034266
SyntenyLag0034266
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR026992 - Non-haem dioxygenase N-terminal domain
IPR027443 - Isopenicillin N synthase-like superfamily
IPR044861 - Isopenicillin N synthase-like, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050082.1 protein SRG1-like [Cucumis melo var. makuwa]1.7e-14775.21Show/hide
Query:  SATISATSAMAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMK
        +A +S   AMA  NPSG VQDVASKGEVPERYIHKESDRGA +APLM A +IDI LLSSSS SGPEL+K RHGLQSWGCF A NHGM+SE L+EVRQ+ K
Subjt:  SATISATSAMAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMK

Query:  EFFALPMEEKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEF
        +FF L MEEKLK  +EE + EGYGN MILSN Q+LDWTDRL LT+YP +SRR KYWPT P+RFREV+ EYT NV+ + EKILKAMARSLDLDE+SF+N++
Subjt:  EFFALPMEEKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEF

Query:  VDGYKL-AYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLAT
         +  KL A FNFYP C NPDLVLG+KPH+DGS+ITILLQDKEVEGLQ +K NEW+NAPIVPDALLV VGDQ EITSNGIFKS VHRVLTN+ERERISLA 
Subjt:  VDGYKL-AYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLAT

Query:  LYLPHSEKEIEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
         YLP SEKEIEPLE+LINE++PRLYK+VKNF  LY++YYQ+GQRP+EAARI
Subjt:  LYLPHSEKEIEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI

XP_022134811.1 uncharacterized protein LOC111006992 [Momordica charantia]3.1e-14978.36Show/hide
Query:  MAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEE
        MAG NP+G+VQDVASKGEVPERYIHKE DRGALDAPLMEA +IDI LLSS SN+GPEL+K RHGL SWGCF A NHGMS E LEEVRQV K FFALPME+
Subjt:  MAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEE

Query:  KLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AY
        KLK SREED  EGYGN MI SN Q+LDWTDRL LT+ PEESRR KYWPT PERFREV++EYT NV+ L EKILKAMARSLDLDENSF+N++    +L A 
Subjt:  KLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AY

Query:  FNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKE
        FNFY  C NP+LVLG+KPH+DGS+ITILLQDKEVEGLQ LKGNEW+NAPI+PDALLV VGDQGEITSNGIFKS VHRVLTN+ERERISLA  YLP  +KE
Subjt:  FNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKE

Query:  IEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
        IEPLEKLINET PRLY+TVKNF  L++QYYQ+GQRP EAA+I
Subjt:  IEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI

XP_022960723.1 protein SRG1-like isoform X1 [Cucurbita moschata]3.4e-14875.5Show/hide
Query:  SATISATSAMAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMK
        ++  SA  AMAG NPSG VQDVASKGEVPERYIHKESDRGA DAPLM A +ID+ LLSSSS SGPEL+K RHGLQSWGCF   NHGMS+E L+E+R++ K
Subjt:  SATISATSAMAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMK

Query:  EFFALPMEEKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEF
        +FF LPMEEK K SREED+ EGYGN MILSN Q+LDWTDRL LT+YP+ES R KYWPT PERFR V++EYT NV+ L EKILKAMA SLDL+E+SF+ ++
Subjt:  EFFALPMEEKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEF

Query:  VDGYKL-AYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLAT
         +  KL A FNFYP C NPDLVLG+KPH+DGS+ITILLQD+EVEGLQ L GNEWFNAPIVP ALL+ VGDQ EITSNGIFKS VHRVLTN+ERERISLA 
Subjt:  VDGYKL-AYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLAT

Query:  LYLPHSEKEIEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
         YLP  EKEIEPLEKLI+ETRPRLYKTVKNF  LY+QYYQ+GQRP+EAARI
Subjt:  LYLPHSEKEIEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI

XP_022987763.1 probable 2-oxoglutarate-dependent dioxygenase ANS isoform X2 [Cucurbita maxima]3.8e-14776.9Show/hide
Query:  MAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEE
        MAG NPSG VQDVASKGEVPERYIH ESDRGALDAPLM A +ID+ LLSS S SGPEL+K RHGLQSWGCF   NHGMS+E L+EVR++ K+FF LPMEE
Subjt:  MAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEE

Query:  KLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AY
        K K SREE++ EGYGN MILSN Q+LDWTDRL LT+YP+ES R KYWPT PERFREV++EYT NV+ L EKILKAMA SLDL+E+SF+ ++ +  KL A 
Subjt:  KLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AY

Query:  FNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKE
        FNFYP C NPDLVLG+KPH+DGS+ITILLQD+EVEGLQ L GNEWFNAPIVP ALLV VGDQ EITSNGIFKS VHRVLTN+ERERISLA  YLP SEKE
Subjt:  FNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKE

Query:  IEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
        IEPLEKLI+ETRPRLYK+VKNF  LY+QYYQ+GQRP+EAARI
Subjt:  IEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI

XP_023516521.1 protein SRG1-like isoform X1 [Cucurbita pepo subsp. pepo]2.5e-14676.61Show/hide
Query:  MAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEE
        MAG NPSG VQ VASKGEVPERYIHKESDRGA DAPLM A +ID+ LLSSSS SGPEL+K RHGLQSWGCF   NHGMS+E L+EVR++ K+FF LPMEE
Subjt:  MAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEE

Query:  KLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AY
        K K SREE++ EGYGN MILSN Q+LDWTDRL LT+YP+ES R KYWPT PERFREV++EYT NV+ L EKILKAMA SLDL+E+SF+ ++ +  KL A 
Subjt:  KLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AY

Query:  FNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKE
        FNFYP C NPDLVLG+KPH+DGS+ITILLQD+EVEGLQ L GNEWFNAPIVP ALLV VGDQ EITSNGIFKS VHRVLTN+ERERISLA  YLP  +KE
Subjt:  FNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKE

Query:  IEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
        IEPLEKLI+ETRPRLYKTVKNF  LY+QYYQ+GQRP+EAARI
Subjt:  IEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI

TrEMBL top hitse value%identityAlignment
A0A5A7U2M0 Protein SRG1-like8.2e-14875.21Show/hide
Query:  SATISATSAMAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMK
        +A +S   AMA  NPSG VQDVASKGEVPERYIHKESDRGA +APLM A +IDI LLSSSS SGPEL+K RHGLQSWGCF A NHGM+SE L+EVRQ+ K
Subjt:  SATISATSAMAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMK

Query:  EFFALPMEEKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEF
        +FF L MEEKLK  +EE + EGYGN MILSN Q+LDWTDRL LT+YP +SRR KYWPT P+RFREV+ EYT NV+ + EKILKAMARSLDLDE+SF+N++
Subjt:  EFFALPMEEKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEF

Query:  VDGYKL-AYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLAT
         +  KL A FNFYP C NPDLVLG+KPH+DGS+ITILLQDKEVEGLQ +K NEW+NAPIVPDALLV VGDQ EITSNGIFKS VHRVLTN+ERERISLA 
Subjt:  VDGYKL-AYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLAT

Query:  LYLPHSEKEIEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
         YLP SEKEIEPLE+LINE++PRLYK+VKNF  LY++YYQ+GQRP+EAARI
Subjt:  LYLPHSEKEIEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI

A0A5D3CIZ6 Protein SRG1-like7.7e-14674.36Show/hide
Query:  SATISATSAMAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMK
        +A +S   AMA  NPSG VQDVASKGEVPERYIHKESDRGA +APLM A +IDI LLSSSS SGPEL+K RHGLQSWGCF A NHGM+SE L+EVRQ+ K
Subjt:  SATISATSAMAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMK

Query:  EFFALPMEEKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEF
        +FF L MEEKLK  +EE + EGYGN MILSN Q+LDWTDRL LT+YP +SRR KYWP  P+RFREV+ EYT NV+ + EKILKAMARSLDLDE+SF+N++
Subjt:  EFFALPMEEKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEF

Query:  VDGYKL-AYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLAT
         +  KL A FNFYP C NPDLVLG+KPH+DGS+ITILLQDKEVEGLQ +K NEW+NAPIV DALLV VGDQ EITSNGIFKS VHRVLTN+ERERISLA 
Subjt:  VDGYKL-AYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLAT

Query:  LYLPHSEKEIEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
         YLP SEKEIEPLE+LINE++PRLYK+VKNF  LY++YYQ+G+RP+EAARI
Subjt:  LYLPHSEKEIEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI

A0A6J1BYU1 uncharacterized protein LOC1110069921.5e-14978.36Show/hide
Query:  MAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEE
        MAG NP+G+VQDVASKGEVPERYIHKE DRGALDAPLMEA +IDI LLSS SN+GPEL+K RHGL SWGCF A NHGMS E LEEVRQV K FFALPME+
Subjt:  MAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEE

Query:  KLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AY
        KLK SREED  EGYGN MI SN Q+LDWTDRL LT+ PEESRR KYWPT PERFREV++EYT NV+ L EKILKAMARSLDLDENSF+N++    +L A 
Subjt:  KLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AY

Query:  FNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKE
        FNFY  C NP+LVLG+KPH+DGS+ITILLQDKEVEGLQ LKGNEW+NAPI+PDALLV VGDQGEITSNGIFKS VHRVLTN+ERERISLA  YLP  +KE
Subjt:  FNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKE

Query:  IEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
        IEPLEKLINET PRLY+TVKNF  L++QYYQ+GQRP EAA+I
Subjt:  IEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI

A0A6J1H9T6 protein SRG1-like isoform X11.7e-14875.5Show/hide
Query:  SATISATSAMAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMK
        ++  SA  AMAG NPSG VQDVASKGEVPERYIHKESDRGA DAPLM A +ID+ LLSSSS SGPEL+K RHGLQSWGCF   NHGMS+E L+E+R++ K
Subjt:  SATISATSAMAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMK

Query:  EFFALPMEEKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEF
        +FF LPMEEK K SREED+ EGYGN MILSN Q+LDWTDRL LT+YP+ES R KYWPT PERFR V++EYT NV+ L EKILKAMA SLDL+E+SF+ ++
Subjt:  EFFALPMEEKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEF

Query:  VDGYKL-AYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLAT
         +  KL A FNFYP C NPDLVLG+KPH+DGS+ITILLQD+EVEGLQ L GNEWFNAPIVP ALL+ VGDQ EITSNGIFKS VHRVLTN+ERERISLA 
Subjt:  VDGYKL-AYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLAT

Query:  LYLPHSEKEIEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
         YLP  EKEIEPLEKLI+ETRPRLYKTVKNF  LY+QYYQ+GQRP+EAARI
Subjt:  LYLPHSEKEIEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI

A0A6J1JKD6 probable 2-oxoglutarate-dependent dioxygenase ANS isoform X21.8e-14776.9Show/hide
Query:  MAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEE
        MAG NPSG VQDVASKGEVPERYIH ESDRGALDAPLM A +ID+ LLSS S SGPEL+K RHGLQSWGCF   NHGMS+E L+EVR++ K+FF LPMEE
Subjt:  MAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEE

Query:  KLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AY
        K K SREE++ EGYGN MILSN Q+LDWTDRL LT+YP+ES R KYWPT PERFREV++EYT NV+ L EKILKAMA SLDL+E+SF+ ++ +  KL A 
Subjt:  KLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AY

Query:  FNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKE
        FNFYP C NPDLVLG+KPH+DGS+ITILLQD+EVEGLQ L GNEWFNAPIVP ALLV VGDQ EITSNGIFKS VHRVLTN+ERERISLA  YLP SEKE
Subjt:  FNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKE

Query:  IEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
        IEPLEKLI+ETRPRLYK+VKNF  LY+QYYQ+GQRP+EAARI
Subjt:  IEPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI

SwissProt top hitse value%identityAlignment
D4N501 Probable 2-oxoglutarate/Fe(II)-dependent dioxygenase2.4e-5134.63Show/hide
Query:  AVQDVA--SKGEVPERYI-HKESDRGALDAPLME----ASIIDIDLLSSSS--NSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPME
        +VQ++A  +  E+P RYI   E+ +  + A +++      +IDI+ L SS       EL +     + WG F   NHG+ + L++ V+  ++ FF L M 
Subjt:  AVQDVA--SKGEVPERYI-HKESDRGALDAPLME----ASIIDIDLLSSSS--NSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPME

Query:  EKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLD--ENSFVNE-FVDGYK
        EK+K  +++ D EG+G A + S DQ LDW D   +   P   R+   +   P   RE I  Y+  ++ L   + + M ++L +   E   ++E F D  +
Subjt:  EKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLD--ENSFVNE-FVDGYK

Query:  LAYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHS
        +   N+YPPCP P+L +GL PHSD   +TILLQ  EVEGLQI     W +   +P+A +V VGD  EI +NG+++SV HR + N+ +ER+S+AT + P+ 
Subjt:  LAYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHS

Query:  EKEIEPLEKLINETRPRLYKTVKNFASLYYQYYQK
        E EI P+  LI    P L+++   +  L  +++ +
Subjt:  EKEIEPLEKLINETRPRLYKTVKNFASLYYQYYQK

D4N502 Codeine O-demethylase8.7e-5438.8Show/hide
Query:  AVQDVA--SKGEVPERY-IHKESDRGALDAPLME---ASIIDI-DLLSSSSNSGP-ELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEE
        +VQ++A  +  E+P RY    ES    + A + +     +ID+ +LLS     G  EL K     + WG F   NHG+ + L++ ++  +K FF LPM E
Subjt:  AVQDVA--SKGEVPERY-IHKESDRGALDAPLME---ASIIDI-DLLSSSSNSGP-ELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEE

Query:  KLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDE-NSFVNEFVDGYKLAY
        K K  +++ D EG+G   I S DQ LDWT+  ++   P   R+   +P  P  FRE +  Y   ++ L   + + + +SL L E     + F DG +   
Subjt:  KLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDE-NSFVNEFVDGYKLAY

Query:  FNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKE
         N+YPPCP P+LVLGL  HSD S +TILLQ  EVEGLQI K   W +   +PDA +V VGD  EI +NGI++SV HR + N+ +ER+S+AT +    E E
Subjt:  FNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKE

Query:  IEPLEKLINETRPRLYK
        I P+  L+    P L+K
Subjt:  IEPLEKLINETRPRLYK

O80449 Jasmonate-induced oxygenase 41.9e-6139.18Show/hide
Query:  PSGAVQDVASKG--EVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSG-PE-LQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEK
        P  +VQ ++  G   VP RY+     R   +    +A  I+I +L  +   G PE L+  R   + WG F   NHG++  L+E VR   +EFF LP+EEK
Subjt:  PSGAVQDVASKG--EVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSG-PE-LQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEK

Query:  LKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKLA---
         K +   D  EGYG+ + +  D  LDW+D   L   P   R    WP+ P + RE+I +Y E VR LCE++ + ++ SL L  N  +     G K+    
Subjt:  LKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKLA---

Query:  YFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEK
          NFYP CP P L LGL  HSD   ITILL D++V GLQ+ +G+ W     VP+AL+V +GDQ +I SNGI+KSV H+V+ N+  ER+SLA  Y P S+ 
Subjt:  YFNFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEK

Query:  EIEPLEKLINETRPRLYKTVK--NFASLYYQYYQKGQRPIEA
         + P+E+L+   RP LYK ++   + SL  Q    G+  +++
Subjt:  EIEPLEKLINETRPRLYKTVK--NFASLYYQYYQKGQRPIEA

Q39224 Protein SRG15.8e-5835.76Show/hide
Query:  VPERYIHKESDRGALDAPL---MEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEKLKCSREEDDTEGYG
        VP RY+  + D+  +D      +E  IID+  L SS+    E++K     + WG F   NHG+ S  L++V+  +++FF LPMEEK K  +  D+ EG+G
Subjt:  VPERYIHKESDRGALDAPL---MEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEKLKCSREEDDTEGYG

Query:  NAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEF--VDGYKLAYFNFYPPCPNPDLVL
         A ++S DQ LDW D    T+ P E R+   +P  P  FR+ +  Y+  V+++ + ++  MAR+L++        F  VD  +    N+YPPCP PD V+
Subjt:  NAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEF--VDGYKLAYFNFYPPCPNPDLVL

Query:  GLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKEIEPLEKLINETRPR
        GL PHSD   +T+L+Q  +VEGLQI K  +W     +P+A +V +GD  EI +NG ++S+ HR + N+E+ER+S+AT +     KE+ P + L+   +  
Subjt:  GLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKEIEPLEKLINETRPR

Query:  LYK--TVKNFASLYYQYYQKGQRPIEAARI
         +K  T+K +    +     G+  ++A RI
Subjt:  LYK--TVKNFASLYYQYYQKGQRPIEAARI

Q94LP4 2-oxoglutarate-dependent dioxygenase 116.3e-5233.84Show/hide
Query:  VPERYIHKESDRGALDAPL---MEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEKLKCSREEDDTEGYG
        +PERYI  E+    +       M   IID+  L    +S  E  K R   Q WG FL  NHG+  E++  +++ + +FF+ P++ K + ++  +  EGYG
Subjt:  VPERYIHKESDRGALDAPL---MEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEKLKCSREEDDTEGYG

Query:  NAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVD---GYKLAYFNFYPPCPNPDLV
         + + S DQ LDW D L L ++P +SR L++WPT+P  FR+ I  Y+   ++L   + + MA+++     S ++ F +   G ++AY   YPPC   D V
Subjt:  NAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVD---GYKLAYFNFYPPCPNPDLV

Query:  LGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKEIEPLEKLINETRP
        +GL PHSD   +T+LL+   V+GLQI K  +WF+      AL+  +GD  EI SNG F+SV HR + N  +ERIS A  + P     I PL + + + + 
Subjt:  LGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKEIEPLEKLINETRP

Query:  RLYKTVK--NFASLYYQYYQKGQRPIEAARI
        + Y+++   +F    +     G+  +E  ++
Subjt:  RLYKTVK--NFASLYYQYYQKGQRPIEAARI

Arabidopsis top hitse value%identityAlignment
AT1G49390.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.1e-9952.35Show/hide
Query:  VQDVASKGE-VPERYIHKESDRGALD-----APLMEASIIDIDLLSSSSNSG-PELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEKL
        VQ+V + G+ +PERY+H  +  G         P M+   ID+ LL SSS  G  E++K    L +WG     NHG++   L+++ ++ K+FFALP EEK 
Subjt:  VQDVASKGE-VPERYIHKESDRGALD-----APLMEASIIDIDLLSSSSNSG-PELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEKL

Query:  KCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AYFN
        KC+RE  + +GYGN MILS++QVLDW DRL LT YPE+ R+LK+WP  P  F E + EYT   R L EK  KAMARSL+L+EN F+  + +   + + FN
Subjt:  KCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AYFN

Query:  FYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKEIE
        F+PPCP PD V+G+KPH+DGS+IT+LL DK+VEGLQ LK  +W+ APIVPD +L+ +GDQ EI SNGI+KS VHRV+TN E+ERIS+AT  +P  +KEI 
Subjt:  FYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKEIE

Query:  PLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
        P + L+ E RPRLYKTV  +  L+Y+YYQ+G+R IEAA I
Subjt:  PLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI

AT3G21420.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein9.9e-6939.49Show/hide
Query:  GAVQDV-----ASKGEVPERYIHKESDRGALDAPLM------EASIIDIDLLSSSSNSG--PELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFF
        G + DV     +   +VPER+I +E +RG + + L       +  +ID+  LS   N     E+ K     + WG F   NHG+  E++E++ +V  EFF
Subjt:  GAVQDV-----ASKGEVPERYIHKESDRGALDAPLM------EASIIDIDLLSSSSNSG--PELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFF

Query:  ALPMEEKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDG
         +P+EEK K   E    +GYG A I S DQ LDW +   L ++P + R  K WP+ P RF E +  Y++ +R LC+++LK +A SL L E  F   F + 
Subjt:  ALPMEEKLKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDG

Query:  YKLAYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDK-EVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYL
         +    N+YPPC +PDLVLGL PHSDGS++T+L Q K    GLQILK N W     +P+AL++ +GD  E+ SNG +KSV HR +TN E+ER+++ T Y 
Subjt:  YKLAYFNFYPPCPNPDLVLGLKPHSDGSSITILLQDK-EVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYL

Query:  PHSEKEIEPLEKLI-NETRPRLYKTVKNFASLYYQYYQ---KGQRPIEAARI
        P+ E EIEP+ +L+ +ET P  Y++  N     Y Y     +G++ ++ A+I
Subjt:  PHSEKEIEPLEKLI-NETRPRLYKTVKNFASLYYQYYQ---KGQRPIEAARI

AT5G20400.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.0e-10253.53Show/hide
Query:  VQDVASKGE-VPERYIHKESDRGALD-----APLMEASIIDIDLLSSSSNSG-PELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEKL
        VQ+V + GE +PERY+H  +  G +       P M+   ID++LL SSS +G  EL K    L +WG     NHG++   L+++ ++ KEFFALP EEK 
Subjt:  VQDVASKGE-VPERYIHKESDRGALD-----APLMEASIIDIDLLSSSSNSG-PELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEKL

Query:  KCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AYFN
        KC+RE D  +GYGN MIL +DQVLDW DRL +T YPE+ R+L +WP  P  FRE ++EYT   R + E+  KAMARSL+L+ENSF++ + +   L   FN
Subjt:  KCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AYFN

Query:  FYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKEIE
         YPPCP+PD V+G+KPH+DGS+IT+LL DK+V GLQ  K  +W+ APIVPD +L+ VGDQ EI SNGI+KS VHRV+TN E+ERIS+AT  +P ++KEI+
Subjt:  FYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKEIE

Query:  PLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
        P+ +L++E RPRLYKTVK +  LY++YYQ+G+RPIEAA I
Subjt:  PLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI

AT5G20550.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.0e-9050.44Show/hide
Query:  VQDVASKGE-VPERYIHKES--DRGA-LDA--PLMEASIIDIDLLSSSSNSG-PELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEKL
        VQ+V + GE +PERY+   +  D G  L+A  P+M+   ID+ LL S S+ G  EL K    L +WG     NHG++  LL+++ ++ KEF ALP EEK 
Subjt:  VQDVASKGE-VPERYIHKES--DRGA-LDA--PLMEASIIDIDLLSSSSNSG-PELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEKL

Query:  KCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AYFN
        K +RE    +GYGN MIL +DQVLDW DRL +T YPE+ R+LK+WP  P  FRE ++EYT     +  ++ KAMA SL+L+EN F++   +   +   FN
Subjt:  KCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AYFN

Query:  FYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIV-PDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKEI
         YPPCP PD V+G++PH+D S+ T+LL DK VEGLQ LK  +W+ AP+V  D +L+ VGDQ EI SNGI+KS VHRV+TNTE+ERIS+AT  +P ++KEI
Subjt:  FYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIV-PDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKEI

Query:  EPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
        +P++ L++E RPRLYK VKN+  L  +YY +GQRPI A+ I
Subjt:  EPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI

AT5G54000.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.2e-9249.85Show/hide
Query:  VQDVASKGE-VPERYIHKESDRGALDAPL------MEASIIDIDLLSSSSNSG-PELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEK
        VQ+V + GE +PERY++  +  G  D P       M+ SIID++LL SSS+ G  EL K    + +WG     NHG+S  LL+++ ++ K+FF LP +EK
Subjt:  VQDVASKGE-VPERYIHKESDRGALDAPL------MEASIIDIDLLSSSSNSG-PELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEK

Query:  LKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AYF
         K +RE    +G+GN MILS+DQVLDW DRL L  YPE+ R+LK+WP  P  FRE ++EYT   + + EK  KA+ARSL+L++N F+    +   L   F
Subjt:  LKCSREEDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKL-AYF

Query:  NFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKEI
        N YPPCP PD VLGLKPHSDGS+ T++L DK VEGLQ LK  +W+ A I+P  +L+ VGD  E+ SNGI+KS VHRV+ N ++ERI +AT      +KEI
Subjt:  NFYPPCPNPDLVLGLKPHSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKEI

Query:  EPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI
        +PL  L++E RPRLYK VK     ++ YYQ+G+RPIEAA I
Subjt:  EPLEKLINETRPRLYKTVKNFASLYYQYYQKGQRPIEAARI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACTGAACGCATCCGTAACGAAGTTGGCACTTGTAATCCTGGTTTGCATAGCCCTTTTAGCATCACCTTCAATGGCCCGCATGGCGGTGATGGCGGCAGCACCTGA
TCCGTTGGTAACAGAGGATTTCTCTTTGTTCCCTTGTATATCATCTGTTCCGGAGGTGACGAAATGCATGATAGATGTGTTTAGAAATGCAGTTGCTCCTCATCCTTCAT
GTTGTACAGCCATTTCTAAGCTCAAAGGTTGTTCTTCTGAGTTTCTCAAAGATATTCCCTCTGTTGACATGATTTTGATTAAGAGCATTTGTGCTTTGTGGGGTGCAATT
TTGGACCACCCCGATATACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGG
GGTCGGGTTTTTGGCCCGACCCCCTGCTCGGCCCGCAGGGGTCGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAA
TGGGACGGGCCAAGACCGAAGGGGTCGGGTGTTCAACGAACCCGGTAACCCGAAAATCCGGCCAACCCAACCCTAATCATAAGGGTTGGGTTAGTGCCACCATTTCTGCA
ACATCGGCGATGGCTGGAATTAACCCGTCCGGCGCCGTCCAAGACGTGGCTTCTAAAGGGGAAGTGCCGGAAAGATACATCCATAAAGAAAGCGATCGAGGAGCTCTAGA
TGCTCCTTTAATGGAAGCTTCTATAATTGACATCGATCTCCTCTCGTCTTCATCCAATTCCGGACCGGAACTCCAGAAATTCCGACATGGACTTCAGTCATGGGGCTGCT
TTCTGGCGAAAAATCATGGAATGTCCTCTGAACTTCTGGAAGAAGTTCGGCAAGTAATGAAAGAGTTCTTTGCTCTTCCAATGGAAGAAAAATTGAAATGCTCGAGAGAA
GAAGATGACACTGAAGGATATGGCAACGCAATGATTCTCTCCAACGACCAAGTTCTGGATTGGACTGATCGTTTGAACCTCACTCTTTATCCAGAAGAGAGCCGCCGTTT
GAAGTACTGGCCAACAACTCCTGAAAGATTCAGGGAAGTTATTTATGAGTACACTGAAAATGTGAGGGCGCTATGTGAGAAAATCCTCAAGGCCATGGCAAGGTCATTGG
ATTTGGATGAGAATAGCTTTGTCAACGAGTTTGTCGACGGATATAAACTCGCATATTTCAACTTCTACCCTCCATGTCCAAATCCCGATCTCGTTCTGGGTCTCAAGCCG
CACTCCGATGGATCATCAATCACCATTCTGTTGCAGGACAAGGAAGTAGAAGGTCTTCAGATCTTGAAAGGCAATGAGTGGTTCAATGCTCCAATTGTTCCTGATGCTCT
TCTCGTCCTTGTTGGGGATCAAGGAGAGATAACAAGTAATGGGATATTCAAGAGTGTAGTTCATAGGGTATTGACAAACACAGAGAGGGAGAGGATTTCATTGGCCACTC
TTTACCTTCCACATTCAGAGAAAGAAATTGAACCACTTGAGAAGCTCATCAATGAAACTCGGCCAAGGTTGTACAAGACTGTCAAGAACTTTGCTTCCCTTTACTATCAG
TACTACCAGAAAGGCCAGAGGCCAATCGAGGCTGCAAGAATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCACTGAACGCATCCGTAACGAAGTTGGCACTTGTAATCCTGGTTTGCATAGCCCTTTTAGCATCACCTTCAATGGCCCGCATGGCGGTGATGGCGGCAGCACCTGA
TCCGTTGGTAACAGAGGATTTCTCTTTGTTCCCTTGTATATCATCTGTTCCGGAGGTGACGAAATGCATGATAGATGTGTTTAGAAATGCAGTTGCTCCTCATCCTTCAT
GTTGTACAGCCATTTCTAAGCTCAAAGGTTGTTCTTCTGAGTTTCTCAAAGATATTCCCTCTGTTGACATGATTTTGATTAAGAGCATTTGTGCTTTGTGGGGTGCAATT
TTGGACCACCCCGATATACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGG
GGTCGGGTTTTTGGCCCGACCCCCTGCTCGGCCCGCAGGGGTCGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAA
TGGGACGGGCCAAGACCGAAGGGGTCGGGTGTTCAACGAACCCGGTAACCCGAAAATCCGGCCAACCCAACCCTAATCATAAGGGTTGGGTTAGTGCCACCATTTCTGCA
ACATCGGCGATGGCTGGAATTAACCCGTCCGGCGCCGTCCAAGACGTGGCTTCTAAAGGGGAAGTGCCGGAAAGATACATCCATAAAGAAAGCGATCGAGGAGCTCTAGA
TGCTCCTTTAATGGAAGCTTCTATAATTGACATCGATCTCCTCTCGTCTTCATCCAATTCCGGACCGGAACTCCAGAAATTCCGACATGGACTTCAGTCATGGGGCTGCT
TTCTGGCGAAAAATCATGGAATGTCCTCTGAACTTCTGGAAGAAGTTCGGCAAGTAATGAAAGAGTTCTTTGCTCTTCCAATGGAAGAAAAATTGAAATGCTCGAGAGAA
GAAGATGACACTGAAGGATATGGCAACGCAATGATTCTCTCCAACGACCAAGTTCTGGATTGGACTGATCGTTTGAACCTCACTCTTTATCCAGAAGAGAGCCGCCGTTT
GAAGTACTGGCCAACAACTCCTGAAAGATTCAGGGAAGTTATTTATGAGTACACTGAAAATGTGAGGGCGCTATGTGAGAAAATCCTCAAGGCCATGGCAAGGTCATTGG
ATTTGGATGAGAATAGCTTTGTCAACGAGTTTGTCGACGGATATAAACTCGCATATTTCAACTTCTACCCTCCATGTCCAAATCCCGATCTCGTTCTGGGTCTCAAGCCG
CACTCCGATGGATCATCAATCACCATTCTGTTGCAGGACAAGGAAGTAGAAGGTCTTCAGATCTTGAAAGGCAATGAGTGGTTCAATGCTCCAATTGTTCCTGATGCTCT
TCTCGTCCTTGTTGGGGATCAAGGAGAGATAACAAGTAATGGGATATTCAAGAGTGTAGTTCATAGGGTATTGACAAACACAGAGAGGGAGAGGATTTCATTGGCCACTC
TTTACCTTCCACATTCAGAGAAAGAAATTGAACCACTTGAGAAGCTCATCAATGAAACTCGGCCAAGGTTGTACAAGACTGTCAAGAACTTTGCTTCCCTTTACTATCAG
TACTACCAGAAAGGCCAGAGGCCAATCGAGGCTGCAAGAATCTAG
Protein sequenceShow/hide protein sequence
MSLNASVTKLALVILVCIALLASPSMARMAVMAAAPDPLVTEDFSLFPCISSVPEVTKCMIDVFRNAVAPHPSCCTAISKLKGCSSEFLKDIPSVDMILIKSICALWGAI
LDHPDIQGADEDNRGEIGLKDGPRRQNRQMGRAKTEGVGFLARPPARPAGVGADEDNRGEIGLKDGPRRQNRQMGRAKTEGVGCSTNPVTRKSGQPNPNHKGWVSATISA
TSAMAGINPSGAVQDVASKGEVPERYIHKESDRGALDAPLMEASIIDIDLLSSSSNSGPELQKFRHGLQSWGCFLAKNHGMSSELLEEVRQVMKEFFALPMEEKLKCSRE
EDDTEGYGNAMILSNDQVLDWTDRLNLTLYPEESRRLKYWPTTPERFREVIYEYTENVRALCEKILKAMARSLDLDENSFVNEFVDGYKLAYFNFYPPCPNPDLVLGLKP
HSDGSSITILLQDKEVEGLQILKGNEWFNAPIVPDALLVLVGDQGEITSNGIFKSVVHRVLTNTERERISLATLYLPHSEKEIEPLEKLINETRPRLYKTVKNFASLYYQ
YYQKGQRPIEAARI