; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021981 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021981
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationscaffold2:5975398..5978853
RNA-Seq ExpressionSpg021981
SyntenySpg021981
Gene Ontology termsGO:0016491 - oxidoreductase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR026992 - Non-haem dioxygenase N-terminal domain
IPR027443 - Isopenicillin N synthase-like superfamily
IPR044861 - Isopenicillin N synthase-like, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587867.1 Protein SRG1, partial [Cucurbita argyrosperma subsp. sororia]9.5e-17384.66Show/hide
Query:  MAEQSHPPQTLQQQLLVNGGDTPESYIYKGGYGAGDSNSNPLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEF
        M ++SHP QTLQQQLL+NGGDTPESYIYKGGY  GDSN++PLP+AEIPVVDLSQLS S GGEA LEE RLALSSWGCFQA NHGISSSFL+KMR ISE+F
Subjt:  MAEQSHPPQTLQQQLLVNGGDTPESYIYKGGYGAGDSNSNPLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEF

Query:  FALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGN
        FALP+EEKN+  RE DG EGYG+DLILSE QILDWTDRLYL+VNPEDER LKFWP+NP SFRED+HEFT+KVKEIIE VL+AMAASL +E KSFS+QVG 
Subjt:  FALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGN

Query:  RPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVAC
        RPV+VTRFNFYPPC TPHLVLGLKEHSDGSA TIVLLDKEVEGL++ KD+QWYR+PVPAIADSLLINVGEQ EIMSNGIFKSPVHRAVTNSERQRISVAC
Subjt:  RPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVAC

Query:  FCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYV-ETYFQYYQKGQRPVDGLKI
        FCCPEKDKEIKPIEGLIDE RPRLF +VKNYV ETYFQYYQKGQR VD LKI
Subjt:  FCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYV-ETYFQYYQKGQRPVDGLKI

XP_008465534.1 PREDICTED: protein SRG1-like [Cucumis melo]1.4e-15576.27Show/hide
Query:  MAEQSHPP--QTLQQQLLVNGGDTPESYIYKGGYGAGDSNSN-PLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGIS
        MA+ S  P  QTLQQQLL+NGG TPESYIYKGGY  GDSN+N PLPLA+IPV+DLSQLS +  GEAPL  LRLALS+WGCFQA NHGISSSFL+KMR IS
Subjt:  MAEQSHPP--QTLQQQLLVNGGDTPESYIYKGGYGAGDSNSN-PLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGIS

Query:  EEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQ
        E+FF+LPIEEK +  RE DG+EGYGNDLI S QQILDW+DRLYL+ NP+DER L+FWP NP SFRED+HE+TVK+ +II+TVL+AMA SLN+E  SF++Q
Subjt:  EEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQ

Query:  VGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRIS
        VG RP +  RFNFYPPC TPHLVLGLKEHSDG+AIT++LLDK+VEGL+L KDDQWYRVPVPA+ADSLLI +GEQAEIMSNGIFKS +HRAVTNSERQRIS
Subjt:  VGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRIS

Query:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        V  FCCPEKD EIKPIEGLIDE RPRLF S KNY++TYFQ YQKG+R VDGL+I
Subjt:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

XP_011655280.1 probable 2-oxoglutarate-dependent dioxygenase ANS isoform X2 [Cucumis sativus]1.4e-15576.79Show/hide
Query:  QSHPPQTLQQQLLVNGGDTPESYIYKGGY-GAGDSNSNPLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFA
        Q+   QTLQQQLL+NGG TPESYIYKGGY G G +N+ PLPLAEIPVVDLSQLS    GE PL +LRLALS+WGCFQA NH ISSSFL+K+R ISE+FF+
Subjt:  QSHPPQTLQQQLLVNGGDTPESYIYKGGY-GAGDSNSNPLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFA

Query:  LPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRP
        LPIEEK +  RE DG+EGYGNDL  S QQ LDW+DRLY + +PEDER L  WP NP SFRED+HE+TVK+ EIIETVL+AMA SLN+E  SF++QVG RP
Subjt:  LPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRP

Query:  VIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFC
         + TRFNFYPPC TPHLVLGLKEHSDGSAITI+LLDK+VEGLQL KDDQWYRVPVPAIADSLL+ +GEQAE+MSNGIFKS VHRAVTNSERQRISV CFC
Subjt:  VIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFC

Query:  CPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        CPEKD EIKP+EGLIDE RPRLF SVKNY+ETYFQ YQ+GQR VDGL+I
Subjt:  CPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

XP_011655698.2 probable 2-oxoglutarate-dependent dioxygenase ANS [Cucumis sativus]3.5e-15978.51Show/hide
Query:  QSHPPQTLQQQLLVNGGDTPESYIYKGGY-GAGDSNSNPLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFA
        Q+   QTLQQQLL+NGG TPESYIYKGGY G G +N+ PLPLAEIPVVDLSQLS    GE PL +LRLALS+WGCFQAINH ISSSFL+KMR ISE+FF+
Subjt:  QSHPPQTLQQQLLVNGGDTPESYIYKGGY-GAGDSNSNPLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFA

Query:  LPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRP
        LPIEEK +  RE DG+EGYGNDLILSEQQILDW+DRLY + NPEDER L+ WP NP SFRED+ E+TVK+ EIIETVL+AMA+SL++E  SF++QVG RP
Subjt:  LPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRP

Query:  VIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFC
         ++TRFNFYPPC TPHLVLGLKEHSDGSAITI+LLDK+VEGLQL KDDQWYRVPVPAIADSLLI +GEQAE+MSNGIFKS +HRAVTNSERQRIS+ CFC
Subjt:  VIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFC

Query:  CPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        CPEKD EIKPIEGLIDE RPRLF SVKNY+ETYFQ YQKG+RPVDGL+I
Subjt:  CPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

XP_022134811.1 uncharacterized protein LOC111006992 [Momordica charantia]3.4e-16279.88Show/hide
Query:  QTLQQQLLVNGGDTPESYIYKGGYGAGDSNSNPLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEK
        +T QQ+LL+NGGDTPESYIYK GYG GDSN+NPLPLAEIPVVDL+QLS SP   A LE+LRLAL+SWGCFQAINH ISSSFL+K+  IS +FF+LP+EEK
Subjt:  QTLQQQLLVNGGDTPESYIYKGGYGAGDSNSNPLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEK

Query:  NKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRF
        NKCCRE  G+EGYG D++ SEQQILDWTDRLYL VNPEDER LK+WPQNPQSFRED+HEFT+K+K+IIETVLMAMA S+N+EA SF+EQVG RP + TRF
Subjt:  NKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRF

Query:  NFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDK
        NFYPPC  P LVLGLKEHSDGSAITIVLLD+EVEGLQ  KDDQW+RVPVPA+ADSLLIN+GEQAEIMSNG+FKS VHRAVTNSE+QRISVACFCCPEKD+
Subjt:  NFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDK

Query:  EIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        EI+PIEGLIDE RPRL+ +VKNYV +YFQ YQKGQRPVD LKI
Subjt:  EIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

TrEMBL top hitse value%identityAlignment
A0A0A0LW96 Uncharacterized protein4.7e-13377.36Show/hide
Query:  SPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWP
        SPS  GE PL +LRLALS+WGCFQA NH ISSSFL+K+R ISE+FF+LPIEEK +  RE DG+EGYGNDL  S QQ LDW+DRLY + +PEDER L  WP
Subjt:  SPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWP

Query:  QNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRV
         NP SFRED+HE+TVK+ EIIETVL+AMA SLN+E  SF++QVG RP + TRFNFYPPC TPHLVLGLKEHSDGSAITI+LLDK+VEGLQL KDDQWYRV
Subjt:  QNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRV

Query:  PVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        PVPAIADSLL+ +GEQAE+MSNGIFKS VHRAVTNSERQRISV CFCCPEKD EIKP+EGLIDE RPRLF SVKNY+ETYFQ YQ+GQR VDGL+I
Subjt:  PVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

A0A1S3CP07 protein SRG1-like6.7e-15676.27Show/hide
Query:  MAEQSHPP--QTLQQQLLVNGGDTPESYIYKGGYGAGDSNSN-PLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGIS
        MA+ S  P  QTLQQQLL+NGG TPESYIYKGGY  GDSN+N PLPLA+IPV+DLSQLS +  GEAPL  LRLALS+WGCFQA NHGISSSFL+KMR IS
Subjt:  MAEQSHPP--QTLQQQLLVNGGDTPESYIYKGGYGAGDSNSN-PLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGIS

Query:  EEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQ
        E+FF+LPIEEK +  RE DG+EGYGNDLI S QQILDW+DRLYL+ NP+DER L+FWP NP SFRED+HE+TVK+ +II+TVL+AMA SLN+E  SF++Q
Subjt:  EEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQ

Query:  VGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRIS
        VG RP +  RFNFYPPC TPHLVLGLKEHSDG+AIT++LLDK+VEGL+L KDDQWYRVPVPA+ADSLLI +GEQAEIMSNGIFKS +HRAVTNSERQRIS
Subjt:  VGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRIS

Query:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        V  FCCPEKD EIKPIEGLIDE RPRLF S KNY++TYFQ YQKG+R VDGL+I
Subjt:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

A0A5A7U2F3 Protein SRG1-like1.6e-15476.34Show/hide
Query:  MAEQSHPP--QTLQQQLLVNGGDTPESYIYKGGYGAGDSNSN-PLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGIS
        MA+ S  P  QTLQQQLL+NGG TPESYIYKGGY  GDSN+N PLPLA+IPV+DLSQLS +  GEAPL  LRLALS+WGCFQA NHGISSSFL+KMR IS
Subjt:  MAEQSHPP--QTLQQQLLVNGGDTPESYIYKGGYGAGDSNSN-PLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGIS

Query:  EEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSF-REDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSE
        E+FF+LPIEEK +  RE DG+EGYGNDLI S QQILDW+DRLYL+ NPED R L+FWP NP SF RED+HE+TVK+ EIIETVL+AMA SLN+E  SF++
Subjt:  EEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSF-REDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSE

Query:  QVGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRI
        QVG RP +  RFNFYPPC TPHLVLGLKEHSDG+AIT++LLDK+VEGL+L KDDQWYRVPVPA+ADSLLI +GEQAEIMSNGIFKS +HRAVTNSER+RI
Subjt:  QVGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRI

Query:  SVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        SV  FCCPEKD EIKPIEGLIDE RPRLF S KNY++TYFQ YQKG+R VDGL+I
Subjt:  SVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

A0A5D3CEG2 Protein SRG1-like2.5e-15576.62Show/hide
Query:  MAEQSHPP--QTLQQQLLVNGGDTPESYIYKGGYGAGDSNSN-PLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGIS
        MA+ S  P  QTLQQQLL+NGG TPESYIYKGGY  GDSN+N PLPLA+IPV+DLSQLS +  GEAPL  LRLALS+WGCFQA NHGISSSFL+KMR IS
Subjt:  MAEQSHPP--QTLQQQLLVNGGDTPESYIYKGGYGAGDSNSN-PLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGIS

Query:  EEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSF-REDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSE
        E+FF+LPIEEK +  RE DG+EGYGNDLI S QQILDW+DRLYL+ NP+DER L+FWP NP SF RED+HE+TVK+ EIIETVL+AMA SLN+E  SF++
Subjt:  EEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSF-REDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSE

Query:  QVGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRI
        QVG RP +  RFNFYPPC TPHLVLGLKEHSDG+AIT++LLDK+VEGL+L KDDQWYRVPVPA+ADSLLI +GEQAEIMSNGIFKS +HRAVTNSERQRI
Subjt:  QVGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRI

Query:  SVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        SV  FCCPEKD EIKPIEGLIDE RPRLF S KNY++TYFQ YQKG+R VDGL+I
Subjt:  SVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

A0A6J1BYU1 uncharacterized protein LOC1110069921.6e-16279.88Show/hide
Query:  QTLQQQLLVNGGDTPESYIYKGGYGAGDSNSNPLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEK
        +T QQ+LL+NGGDTPESYIYK GYG GDSN+NPLPLAEIPVVDL+QLS SP   A LE+LRLAL+SWGCFQAINH ISSSFL+K+  IS +FF+LP+EEK
Subjt:  QTLQQQLLVNGGDTPESYIYKGGYGAGDSNSNPLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEK

Query:  NKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRF
        NKCCRE  G+EGYG D++ SEQQILDWTDRLYL VNPEDER LK+WPQNPQSFRED+HEFT+K+K+IIETVLMAMA S+N+EA SF+EQVG RP + TRF
Subjt:  NKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRF

Query:  NFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDK
        NFYPPC  P LVLGLKEHSDGSAITIVLLD+EVEGLQ  KDDQW+RVPVPA+ADSLLIN+GEQAEIMSNG+FKS VHRAVTNSE+QRISVACFCCPEKD+
Subjt:  NFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDK

Query:  EIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        EI+PIEGLIDE RPRL+ +VKNYV +YFQ YQKGQRPVD LKI
Subjt:  EIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

SwissProt top hitse value%identityAlignment
A2A1A0 S-norcoclaurine synthase 11.7e-5237.59Show/hide
Query:  EIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNP
        EIPV+DLS+L         L +   A   WG FQ INHG+    ++KM+  +E+FF LP +EKN   +  +G+EGYG   + SE+Q LDW D  +LI  P
Subjt:  EIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNP

Query:  EDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQ
          ER+++FWP +P SFRE + +++++++++   +   MA +L +E++  ++ +  R V        P   +    LGL  HSD + +T+++   EV GL 
Subjt:  EDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQ

Query:  LSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDE
        + KD++W  VP+  I  + ++N+G+  EIMSNGI+KS  HRAV N++++R+S+A F  PE   +I P+  L+ E
Subjt:  LSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDE

D4N502 Codeine O-demethylase6.6e-5234.91Show/hide
Query:  TLQQQLLVNGGDTPESYIYKGGYGAGDSNSNPLPLAEIPVVDLSQ-LSPSP-GGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEE
        ++Q+   +   + P  Y   G     +  ++      +PV+DL   LSP P  G+  L++L  A   WG FQ +NHG+ +  +D ++   + FF LP+ E
Subjt:  TLQQQLLVNGGDTPESYIYKGGYGAGDSNSNPLPLAEIPVVDLSQ-LSPSP-GGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEE

Query:  KNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTR
        K K  ++    EG+G   I SE Q LDWT+   ++  P   R    +P+ P  FRE +  +  K+K++   V   +  SL +        +    +   R
Subjt:  KNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTR

Query:  FNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKD
         N+YPPCP P LVLGL  HSD S +TI+L   EVEGLQ+ K+++W  + +  + D+ ++NVG+  EIM+NGI++S  HRAV NS ++R+S+A F   + +
Subjt:  FNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKD

Query:  KEIKPIEGLIDEGRPRLF
         EI PI  L+    P LF
Subjt:  KEIKPIEGLIDEGRPRLF

O80449 Jasmonate-induced oxygenase 47.6e-5636.36Show/hide
Query:  EIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNP
        EIPV+D++ +   P G   L  +R A   WG FQ +NHG++ S ++++RG   EFF LP+EEK K     D  EGYG+ L + +   LDW+D  +L   P
Subjt:  EIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNP

Query:  EDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVI--VTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEG
           R+   WP  P   RE + ++  +V+++ E +   ++ SL ++     + +G    +    R NFYP CP P L LGL  HSD   ITI+L D++V G
Subjt:  EDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVI--VTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEG

Query:  LQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVK--NYVETYFQYYQK
        LQ+ + D W  V + ++ ++L++N+G+Q +I+SNGI+KS  H+ + NS  +R+S+A F  P  D  + PIE L+   RP L+  ++   Y     Q    
Subjt:  LQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVK--NYVETYFQYYQK

Query:  GQRPVDGL
        G+  VD L
Subjt:  GQRPVDGL

Q39224 Protein SRG12.4e-5735.28Show/hide
Query:  EIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNP
        EIP++D+ +L  S   ++ +E+L  A   WG FQ +NHGI SSFLDK++   ++FF LP+EEK K  +  D +EG+G   ++SE Q LDW D  +  V P
Subjt:  EIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNP

Query:  EDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVT-RFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGL
         + R    +P+ P  FR+ +  ++ +V+ + + ++  MA +L I+ +   +   +   + + R N+YPPCP P  V+GL  HSD   +T+++   +VEGL
Subjt:  EDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVT-RFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGL

Query:  QLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLF--TSVKNYVETYFQYYQKG
        Q+ KD +W  VPV  + ++ ++N+G+  EI++NG ++S  HR V NSE++R+S+A F      KE+ P + L++  +   F   ++K Y +  F     G
Subjt:  QLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLF--TSVKNYVETYFQYYQKG

Query:  QRPVDGLKI
        +  +D L+I
Subjt:  QRPVDGLKI

Q94LP4 2-oxoglutarate-dependent dioxygenase 111.9e-5133.33Show/hide
Query:  IPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPE
        IP++DL +L      E    +LR A   WG F  INHG+    +  ++    +FF+ P++ K +  +  + +EGYG   + SE Q LDW D LYL V+P 
Subjt:  IPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPE

Query:  DERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQL
        D R L+FWP +P SFR+ +  ++ + K +   +   MA ++  + +S  +    +P  + R  +YPPC     V+GL  HSD   +T++L    V+GLQ+
Subjt:  DERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQL

Query:  SKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFT-SVKNYVETYFQYYQKGQRP
         KD +W+ +  P    +L+ N+G+  EI+SNG F+S  HRAV N  ++RIS A F  P ++  I P+   + +G+ +  + S  ++++  F     G+  
Subjt:  SKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFT-SVKNYVETYFQYYQKGQRP

Query:  VDGLKI
        V+ LK+
Subjt:  VDGLKI

Arabidopsis top hitse value%identityAlignment
AT1G17010.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.0e-6037.1Show/hide
Query:  LAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIV
        ++EIP++D+++L  S   ++ +E+L  A   +G FQ +NHGI  SFLDK++   ++FF LP+EEK K  +    +EG+G   ++SE Q LDW D  +LI+
Subjt:  LAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIV

Query:  NPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEG
         P   R    +P+ P  FR+ +  ++ +VK I + +L  MA +L I+ +   E  G+  +   R N+YPPCP P+LV GL  HSD   +TI+L   EV+G
Subjt:  NPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEG

Query:  LQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFTSV--KNYVETYFQYYQK
        LQ+ K+ +W+   V  + ++ ++NVG+  EI++NG ++S  HRA+ N E++R+S+A F     DKEI P   L+       F S+  K+Y+   F    K
Subjt:  LQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFTSV--KNYVETYFQYYQK

Query:  GQRPVDGLKI
        G+  +D ++I
Subjt:  GQRPVDGLKI

AT1G49390.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.6e-10150.15Show/hide
Query:  QQLLVNGGDTPESYIY-KGGYGAGDSNSNPLPLAEIPVVDLSQL-SPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNK
        Q+++  G   PE Y++   G G     +  +P  +IP +DLS L S S  G+  +++L  ALS+WG  Q +NHGI+ +FLDK+  ++++FFALP EEK+K
Subjt:  QQLLVNGGDTPESYIY-KGGYGAGDSNSNPLPLAEIPVVDLSQL-SPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNK

Query:  CCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFNF
        C RE   ++GYGND+ILS+ Q+LDW DRL+L   PED+R LKFWPQ P  F E + E+T+K + +IE    AMA SL +E   F E  G   V+ +RFNF
Subjt:  CCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFNF

Query:  YPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEI
        +PPCP P  V+G+K H+DGSAIT++L DK+VEGLQ  KD +WY+ P+  + D++LI +G+Q EIMSNGI+KSPVHR VTN E++RISVA FC P  DKEI
Subjt:  YPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEI

Query:  KPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
         P +GL+ E RPRL+ +V  YV+ +++YYQ+G+R ++   I
Subjt:  KPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

AT5G20400.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.4e-10552.03Show/hide
Query:  QQLLVNGGDTPESYIYKGGYGAGDSNSNPL----PLAEIPVVDLS-QLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEE
        Q+++  G   PE Y++      GD    PL    P  +IP +DL+  LS S  G+  L +L  ALS+WG  Q +NHGI+ +FLDK+  +++EFFALP EE
Subjt:  QQLLVNGGDTPESYIYKGGYGAGDSNSNPL----PLAEIPVVDLS-QLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEE

Query:  KNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTR
        K KC RE D ++GYGND+IL + Q+LDW DRLY+   PED+R L FWP+ P  FRE +HE+T+K + +IE    AMA SL +E  SF +  G    + TR
Subjt:  KNKCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTR

Query:  FNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKD
        FN YPPCP+P  V+G+K H+DGSAIT++L DK+V GLQ  KD +WY+ P+  + D++LINVG+Q EIMSNGI+KSPVHR VTN E++RISVA FC P  D
Subjt:  FNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKD

Query:  KEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        KEI+P+  L+ E RPRL+ +VK YVE YF+YYQ+G+RP++   I
Subjt:  KEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

AT5G20550.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.3e-9851.19Show/hide
Query:  QQLLVNGGDTPESYIYKGGYGAGDSNSN-PLPLAEIPVVDLS-QLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNK
        Q+++  G   PE Y+          + N  +P+ +IP +DLS  LSPS  G   L +L  ALS+WG  Q INHGI+ + LDK+  +++EF ALP EEK K
Subjt:  QQLLVNGGDTPESYIYKGGYGAGDSNSN-PLPLAEIPVVDLS-QLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNK

Query:  CCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFNF
          RE   ++GYGND+IL + Q+LDW DRLY+   PED+R LKFWP  P  FRE +HE+T+K   +   V  AMA SL +E   F +  G    + TRFN 
Subjt:  CCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFNF

Query:  YPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEI
        YPPCP P  V+G++ H+D SA T++L DK VEGLQ  KD +WY+ PV A +D++LINVG+Q EIMSNGI+KSPVHR VTN+E++RISVA FC P  DKEI
Subjt:  YPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEI

Query:  KPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPV
        +P++GL+ E RPRL+  VKNYV+   +YY +GQRP+
Subjt:  KPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPV

AT5G54000.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.7e-9849.42Show/hide
Query:  QQLLVNGGDTPESYIY-KGGYGAGDSNSNP-LPLAEIPVVDLSQL-SPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKN
        Q+++  G   PE Y+Y   G G GD   N  LP  +I ++DL+ L S S  G   L +L  A+S+WG  Q +NHGIS + LDK+  ++++FF LP +EK 
Subjt:  QQLLVNGGDTPESYIY-KGGYGAGDSNSNP-LPLAEIPVVDLSQL-SPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKN

Query:  KCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFN
        K  RE    +G+GND+ILS+ Q+LDW DRLYLI  PED+R LKFWP+NP  FRE +HE+T+K + ++E    A+A SL +E   F E  G    + TRFN
Subjt:  KCCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFN

Query:  FYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKE
         YPPCP P  VLGLK HSDGSA T++L DK VEGLQ  KD +WY+  +  +  ++LINVG+  E+MSNGI+KSPVHR V N +++RI VA FC  ++DKE
Subjt:  FYPPCPTPHLVLGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKE

Query:  IKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        I+P+ GL+ E RPRL+ +VK   + +F YYQ+G+RP++   I
Subjt:  IKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAACAATCTCACCCCCCACAAACACTCCAACAACAACTTCTCGTCAACGGCGGCGACACGCCGGAAAGCTATATTTACAAAGGCGGCTACGGCGCCGGAGATTC
CAACAGTAATCCACTTCCGCTGGCAGAGATTCCAGTCGTTGACCTCTCCCAACTCTCTCCATCGCCGGGCGGCGAGGCTCCGTTAGAGGAGCTCCGGCTGGCTCTGAGTT
CATGGGGATGTTTTCAGGCGATTAATCACGGCATTTCAAGTTCGTTTCTGGACAAGATGCGTGGAATAAGTGAGGAATTTTTTGCACTACCGATTGAAGAGAAGAACAAA
TGTTGCAGAGAAGGTGATGGGGTTGAAGGATATGGGAATGATCTGATTTTGTCAGAGCAACAAATTCTTGATTGGACCGATCGCTTGTATCTTATTGTGAATCCAGAGGA
TGAGAGACACCTCAAGTTTTGGCCTCAAAATCCTCAATCTTTCAGGGAAGATGTACACGAGTTTACAGTAAAAGTAAAGGAGATAATTGAAACAGTGTTGATGGCCATGG
CAGCATCCTTAAACATAGAGGCGAAGAGCTTTTCAGAGCAGGTGGGGAACCGCCCTGTTATAGTGACAAGGTTCAACTTCTATCCGCCATGTCCGACGCCTCACCTTGTT
CTTGGTCTCAAAGAACACTCCGATGGCTCAGCCATCACCATTGTTTTACTGGACAAGGAAGTTGAAGGCCTCCAATTGAGCAAGGATGACCAGTGGTACAGAGTCCCTGT
TCCTGCCATTGCCGATTCTCTTCTCATCAACGTTGGAGAACAAGCTGAGATTATGAGCAATGGGATCTTCAAAAGCCCTGTTCATCGGGCGGTGACGAATTCAGAGAGGC
AGAGGATTTCAGTGGCATGTTTCTGTTGCCCAGAAAAAGATAAAGAGATCAAGCCGATCGAGGGGTTGATCGACGAGGGGAGGCCGAGATTGTTCACAAGTGTGAAGAAC
TATGTTGAAACCTATTTTCAGTACTACCAAAAGGGACAGAGACCAGTTGATGGATTGAAGATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAACAATCTCACCCCCCACAAACACTCCAACAACAACTTCTCGTCAACGGCGGCGACACGCCGGAAAGCTATATTTACAAAGGCGGCTACGGCGCCGGAGATTC
CAACAGTAATCCACTTCCGCTGGCAGAGATTCCAGTCGTTGACCTCTCCCAACTCTCTCCATCGCCGGGCGGCGAGGCTCCGTTAGAGGAGCTCCGGCTGGCTCTGAGTT
CATGGGGATGTTTTCAGGCGATTAATCACGGCATTTCAAGTTCGTTTCTGGACAAGATGCGTGGAATAAGTGAGGAATTTTTTGCACTACCGATTGAAGAGAAGAACAAA
TGTTGCAGAGAAGGTGATGGGGTTGAAGGATATGGGAATGATCTGATTTTGTCAGAGCAACAAATTCTTGATTGGACCGATCGCTTGTATCTTATTGTGAATCCAGAGGA
TGAGAGACACCTCAAGTTTTGGCCTCAAAATCCTCAATCTTTCAGGGAAGATGTACACGAGTTTACAGTAAAAGTAAAGGAGATAATTGAAACAGTGTTGATGGCCATGG
CAGCATCCTTAAACATAGAGGCGAAGAGCTTTTCAGAGCAGGTGGGGAACCGCCCTGTTATAGTGACAAGGTTCAACTTCTATCCGCCATGTCCGACGCCTCACCTTGTT
CTTGGTCTCAAAGAACACTCCGATGGCTCAGCCATCACCATTGTTTTACTGGACAAGGAAGTTGAAGGCCTCCAATTGAGCAAGGATGACCAGTGGTACAGAGTCCCTGT
TCCTGCCATTGCCGATTCTCTTCTCATCAACGTTGGAGAACAAGCTGAGATTATGAGCAATGGGATCTTCAAAAGCCCTGTTCATCGGGCGGTGACGAATTCAGAGAGGC
AGAGGATTTCAGTGGCATGTTTCTGTTGCCCAGAAAAAGATAAAGAGATCAAGCCGATCGAGGGGTTGATCGACGAGGGGAGGCCGAGATTGTTCACAAGTGTGAAGAAC
TATGTTGAAACCTATTTTCAGTACTACCAAAAGGGACAGAGACCAGTTGATGGATTGAAGATTTAG
Protein sequenceShow/hide protein sequence
MAEQSHPPQTLQQQLLVNGGDTPESYIYKGGYGAGDSNSNPLPLAEIPVVDLSQLSPSPGGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRGISEEFFALPIEEKNK
CCREGDGVEGYGNDLILSEQQILDWTDRLYLIVNPEDERHLKFWPQNPQSFREDVHEFTVKVKEIIETVLMAMAASLNIEAKSFSEQVGNRPVIVTRFNFYPPCPTPHLV
LGLKEHSDGSAITIVLLDKEVEGLQLSKDDQWYRVPVPAIADSLLINVGEQAEIMSNGIFKSPVHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKN
YVETYFQYYQKGQRPVDGLKI