; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0565 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0565
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationMC06:4579899..4581180
RNA-Seq ExpressionMC06g0565
SyntenyMC06g0565
Gene Ontology termsGO:0016491 - oxidoreductase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR026992 - Non-haem dioxygenase N-terminal domain
IPR027443 - Isopenicillin N synthase-like superfamily
IPR044861 - Isopenicillin N synthase-like, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587867.1 Protein SRG1, partial [Cucurbita argyrosperma subsp. sororia]6.37e-19377.42Show/hide
Query:  QKLLINGGDTPESYIYKDGYGGGDSNNNPLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCC
        Q+LLINGGDTPESYIYK GY GGDSNN+PLP+AEIPVVDL+QLSSS    AALE+ RLAL+SWGCFQA NH ISSSFL K+ QIS QFF+LPMEEKN+  
Subjt:  QKLLINGGDTPESYIYKDGYGGGDSNNNPLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCC

Query:  RELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFY
        RE+ G EGYG+D++ SE QILDWTDRLYL VNPEDER+LK+WP+NP SFR EDLHEFTIK+K+IIE VL+AMA S+ VE  SF++QVGKRP L TRFNFY
Subjt:  RELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFY

Query:  PPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIE
        PPCS P LVLGLKEHSDGSA TIVLLD+EVEGL+ +KD+QW+R+PVPA+ADSLLIN+GEQ EIMSNG+FKS VHRAVTNSE+QRISVACFCCPEKD+EI+
Subjt:  PPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIE

Query:  PIKGLIDERRPRLYRNVKNYVG-SYFQDYQKGQRPVDKLKI
        PI+GLIDERRPRL+R VKNYV  +YFQ YQKGQR VD+LKI
Subjt:  PIKGLIDERRPRLYRNVKNYVG-SYFQDYQKGQRPVDKLKI

XP_011655280.1 probable 2-oxoglutarate-dependent dioxygenase ANS isoform X2 [Cucumis sativus]8.02e-18574.19Show/hide
Query:  QKLLINGGDTPESYIYKDGYGGGDSNNN-PLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKC
        Q+LLINGG TPESYIYK GY GG SNNN PLPLAEIPVVDL+QLSS       L DLRLAL++WGCFQA NHSISSSFL K+ +IS QFFSLP+EEK + 
Subjt:  QKLLINGGDTPESYIYKDGYGGGDSNNN-PLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKC

Query:  CRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNF
         RE+ G+EGYG D+ FS QQ LDW+DRLY   +PEDER+L  WP NP SFR EDLHE+T+K+ +IIETVL+AMARS+NVE NSFT+QVG+RP LFTRFNF
Subjt:  CRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNF

Query:  YPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREI
        YPPCS P LVLGLKEHSDGSAITI+LLD++VEGLQ RKDDQW+RVPVPA+ADSLL+ IGEQAE+MSNG+FKS+VHRAVTNSE+QRISV CFCCPEKD EI
Subjt:  YPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREI

Query:  EPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI
        +P++GLIDE+RPRL+R+VKNY+ +YFQ+YQ+GQR VD L+I
Subjt:  EPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI

XP_011655698.2 probable 2-oxoglutarate-dependent dioxygenase ANS [Cucumis sativus]8.02e-18574.19Show/hide
Query:  QKLLINGGDTPESYIYKDGYGGGDSNNN-PLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKC
        Q+LLINGG TPESYIYK GY GG SNNN PLPLAEIPVVDL+QLSS       L DLRLAL++WGCFQAINHSISSSFL K+ +IS QFFSLP+EEK + 
Subjt:  QKLLINGGDTPESYIYKDGYGGGDSNNN-PLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKC

Query:  CRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNF
         RE+ G+EGYG D++ SEQQILDW+DRLY   NPEDER+L+ WP NP SFR EDL E+T+K+ +IIETVL+AMA S++VE NSFT+QVGKRP L TRFNF
Subjt:  CRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNF

Query:  YPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREI
        YPPCS P LVLGLKEHSDGSAITI+LLD++VEGLQ RKDDQW+RVPVPA+ADSLLI IGEQAE+MSNG+FKS++HRAVTNSE+QRIS+ CFCCPEKD EI
Subjt:  YPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREI

Query:  EPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI
        +PI+GLIDE+RPRL+++VKNY+ +YFQ+YQKG+RPVD L+I
Subjt:  EPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI

XP_022134811.1 uncharacterized protein LOC111006992 [Momordica charantia]5.60e-24599.41Show/hide
Query:  QKLLINGGDTPESYIYKDGYGGGDSNNNPLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCC
        QKLLINGGDTPESYIYKDGYGGGDSNNNPLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCC
Subjt:  QKLLINGGDTPESYIYKDGYGGGDSNNNPLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCC

Query:  RELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFY
        RELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFR EDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFY
Subjt:  RELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFY

Query:  PPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIE
        PPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIE
Subjt:  PPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIE

Query:  PIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI
        PI+GLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI
Subjt:  PIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI

XP_031736054.1 probable 2-oxoglutarate-dependent dioxygenase ANS isoform X1 [Cucumis sativus]2.05e-18573.9Show/hide
Query:  QKLLINGGDTPESYIYKDGYGGGDSNNN-PLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKC
        Q+LLINGG TPESYIYK GY GG SNNN PLPLAEIPVVDL+QLSS       L DLRLAL++WGCFQA NHSISSSFL K+ +IS QFFSLP+EEK + 
Subjt:  QKLLINGGDTPESYIYKDGYGGGDSNNN-PLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKC

Query:  CRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNF
         RE+ G+EGYG D+ FS QQ LDW+DRLY   +PEDER+L  WP NP SF  EDLHE+T+K+ +IIETVL+AMARS+NVE NSFT+QVG+RP LFTRFNF
Subjt:  CRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNF

Query:  YPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREI
        YPPCS P LVLGLKEHSDGSAITI+LLD++VEGLQ RKDDQW+RVPVPA+ADSLL+ IGEQAE+MSNG+FKS+VHRAVTNSE+QRISV CFCCPEKD EI
Subjt:  YPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREI

Query:  EPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI
        +P++GLIDE+RPRL+R+VKNY+ +YFQ+YQ+GQR VD L+I
Subjt:  EPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI

TrEMBL top hitse value%identityAlignment
A0A0A0LW96 Uncharacterized protein1.74e-15472.48Show/hide
Query:  LSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYW
        L S       L DLRLAL++WGCFQA NHSISSSFL K+ +IS QFFSLP+EEK +  RE+ G+EGYG D+ FS QQ LDW+DRLY   +PEDER+L  W
Subjt:  LSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYW

Query:  PQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWF
        P NP SFR EDLHE+T+K+ +IIETVL+AMARS+NVE NSFT+QVG+RP LFTRFNFYPPCS P LVLGLKEHSDGSAITI+LLD++VEGLQ RKDDQW+
Subjt:  PQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWF

Query:  RVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI
        RVPVPA+ADSLL+ IGEQAE+MSNG+FKS+VHRAVTNSE+QRISV CFCCPEKD EI+P++GLIDE+RPRL+R+VKNY+ +YFQ+YQ+GQR VD L+I
Subjt:  RVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI

A0A1S3CP07 protein SRG1-like1.29e-18373.31Show/hide
Query:  QKLLINGGDTPESYIYKDGYGGGDSNNN-PLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKC
        Q+LLINGG TPESYIYK GY GGDSNNN PLPLA+IPV+DL+QLSS+    A L  LRLAL++WGCFQA NH ISSSFL K+ +IS QFFSLP+EEK + 
Subjt:  QKLLINGGDTPESYIYKDGYGGGDSNNN-PLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKC

Query:  CRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNF
         RE+ G+EGYG D++FS QQILDW+DRLYL  NP+DER+L++WP NP SFR EDLHE+T+KL +II+TVL+AMARS+NVE NSFT+QVGKRP LF RFNF
Subjt:  CRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNF

Query:  YPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREI
        YPPCS P LVLGLKEHSDG+AIT++LLD++VEGL+ RKDDQW+RVPVPA+ADSLLI IGEQAEIMSNG+FKS +HRAVTNSE+QRISV  FCCPEKD EI
Subjt:  YPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREI

Query:  EPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI
        +PI+GLIDE+RPRL+R+ KNY+ +YFQ+YQKG+R VD L+I
Subjt:  EPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI

A0A5A7U2F3 Protein SRG1-like9.43e-18473.02Show/hide
Query:  QKLLINGGDTPESYIYKDGYGGGDSNNN-PLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKC
        Q+LLINGG TPESYIYK GY GGDSNNN PLPLA+IPV+DL+QLSS+    A L  LRLAL++WGCFQA NH ISSSFL K+ +IS QFFSLP+EEK + 
Subjt:  QKLLINGGDTPESYIYKDGYGGGDSNNN-PLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKC

Query:  CRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNF
         RE+ G+EGYG D++FS QQILDW+DRLYL  NPED R+L++WP NP SF  EDLHE+T+KL +IIETVL+AMARS+NVE NSFT+QVGKRP LF RFNF
Subjt:  CRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNF

Query:  YPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREI
        YPPCS P LVLGLKEHSDG+AIT++LLD++VEGL+ RKDDQW+RVPVPA+ADSLLI IGEQAEIMSNG+FKS +HRAVTNSE++RISV  FCCPEKD EI
Subjt:  YPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREI

Query:  EPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI
        +PI+GLIDE+RPRL+R+ KNY+ +YFQ+YQKG+R VD L+I
Subjt:  EPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI

A0A5D3CEG2 Protein SRG1-like8.11e-18573.31Show/hide
Query:  QKLLINGGDTPESYIYKDGYGGGDSNNN-PLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKC
        Q+LLINGG TPESYIYK GY GGDSNNN PLPLA+IPV+DL+QLSS+    A L  LRLAL++WGCFQA NH ISSSFL K+ +IS QFFSLP+EEK + 
Subjt:  QKLLINGGDTPESYIYKDGYGGGDSNNN-PLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKC

Query:  CRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNF
         RE+ G+EGYG D++FS QQILDW+DRLYL  NP+DER+L++WP NP SF  EDLHE+T+KL +IIETVL+AMARS+NVE NSFT+QVGKRP LF RFNF
Subjt:  CRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNF

Query:  YPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREI
        YPPCS P LVLGLKEHSDG+AIT++LLD++VEGL+ RKDDQW+RVPVPA+ADSLLI IGEQAEIMSNG+FKS +HRAVTNSE+QRISV  FCCPEKD EI
Subjt:  YPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREI

Query:  EPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI
        +PI+GLIDE+RPRL+R+ KNY+ +YFQ+YQKG+R VD L+I
Subjt:  EPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI

A0A6J1BYU1 uncharacterized protein LOC1110069922.71e-24599.41Show/hide
Query:  QKLLINGGDTPESYIYKDGYGGGDSNNNPLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCC
        QKLLINGGDTPESYIYKDGYGGGDSNNNPLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCC
Subjt:  QKLLINGGDTPESYIYKDGYGGGDSNNNPLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCC

Query:  RELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFY
        RELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFR EDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFY
Subjt:  RELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFY

Query:  PPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIE
        PPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIE
Subjt:  PPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIE

Query:  PIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI
        PI+GLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI
Subjt:  PIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI

SwissProt top hitse value%identityAlignment
A0A4D6Q440 Flavonol synthase 15.6e-4834.67Show/hide
Query:  PLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYGTDMVFSEQQILDWTDRLYLK
        P+ EIPV+DL        + A       A   WG FQ INH I    + ++ ++  +FF LP EEK     ++  +EGYGT +    +    W D L+  
Subjt:  PLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYGTDMVFSEQQILDWTDRLYLK

Query:  VNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQV-GKRPELFTRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDRE
        + PE      +WPQNP  +R  +  E+   L +++  +L +++  + +E + F E + G   EL  + N+YP C RPDL LG+  H+D SAIT+ L+  E
Subjt:  VNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQV-GKRPELFTRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDRE

Query:  VEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLIDERRPRLYRNVKNYVGSYFQDYQ
        V GLQ  KDD WF      + ++++++IG+Q EI+SNG +KS +HR   N EK R+S   FC P  +  +  +  L+ E  P  Y+  K      ++DYQ
Subjt:  VEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLIDERRPRLYRNVKNYVGSYFQDYQ

D4N502 Codeine O-demethylase5.6e-4837.68Show/hide
Query:  IPVVDLAQLSSSPPSTAALE--DLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVN
        +PV+DL  L S  P    LE   L  A   WG FQ +NH + +  ++ I      FF+LPM EK K  ++    EG+G   + SE Q LDWT+   +   
Subjt:  IPVVDLAQLSSSPPSTAALE--DLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVN

Query:  PEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSIN-VEANSFTEQVGKRPELFTRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDREVE
        P   R+   +P+ P  FR E L  +  K+K++   V   + +S+  VE    T+      +   R N+YPPC RP+LVLGL  HSD S +TI+L   EVE
Subjt:  PEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSIN-VEANSFTEQVGKRPELFTRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDREVE

Query:  GLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLIDERRPRLYR
        GLQ RK+++W  + +  + D+ ++N+G+  EIM+NG+++S  HRAV NS K+R+S+A F   + + EI PI  L+    P L++
Subjt:  GLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLIDERRPRLYR

O80449 Jasmonate-induced oxygenase 44.3e-4833.66Show/hide
Query:  EIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNP
        EIPV+D+  +   P     L  +R A   WG FQ +NH ++ S + ++     +FF LP+EEK K        EGYG+ +   +   LDW+D  +L   P
Subjt:  EIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNP

Query:  EDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPEL--FTRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDREVE
           R    WP  P   R E + ++  +++++ E +   ++ S+ ++ N   + +G   ++    R NFYP C +P L LGL  HSD   ITI+L D +V 
Subjt:  EDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPEL--FTRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDREVE

Query:  GLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLIDERRPRLYRNVK--NYVGSYFQDYQ
        GLQ R+ D W  V + ++ ++L++NIG+Q +I+SNG++KS  H+ + NS  +R+S+A F  P  D  + PI+ L+   RP LY+ ++   Y     Q   
Subjt:  GLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLIDERRPRLYRNVK--NYVGSYFQDYQ

Query:  KGQRPVDKL
         G+  VD L
Subjt:  KGQRPVDKL

Q39224 Protein SRG19.9e-5335.16Show/hide
Query:  EIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNP
        EIP++D+ +L SS    + +E L  A   WG FQ +NH I SSFL+K+      FF+LPMEEK K  +    +EG+G   V SE Q LDW D  +  V P
Subjt:  EIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNP

Query:  EDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFT-RFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDREVEG
         + R+   +P+ P  FR + L  ++ +++ + + ++  MAR++ ++     +       + + R N+YPPC +PD V+GL  HSD   +T+++   +VEG
Subjt:  EDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFT-RFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDREVEG

Query:  LQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLIDERRPRLYR--NVKNYVGSYFQDYQK
        LQ +KD +W  VPV  + ++ ++NIG+  EI++NG ++S  HR V NSEK+R+S+A F      +E+ P K L++ ++   ++   +K Y    F     
Subjt:  LQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLIDERRPRLYR--NVKNYVGSYFQDYQK

Query:  GQRPVDKLKI
        G+  +D L+I
Subjt:  GQRPVDKLKI

Q94LP4 2-oxoglutarate-dependent dioxygenase 114.2e-5134.94Show/hide
Query:  PESYIYKDGYGGGDSNNNPLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYG
        PE YI  +       NN    +A IP++DL +L     S      LR A   WG F  INH +    +  + +    FFS P++ K +  +    LEGYG
Subjt:  PESYIYKDGYGGGDSNNNPLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYG

Query:  TDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFYPPCSRPDLVL
           VFSE Q LDW D LYL V+P D R L++WP +P SFR + +  ++ + K +   +   MA+++  +  S  +   ++P    R  +YPPC + D V+
Subjt:  TDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFYPPCSRPDLVL

Query:  GLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLIDERR
        GL  HSD   +T++L    V+GLQ +KD +WF +  P  A  L+ NIG+  EI+SNG F+S  HRAV N  K+RIS A F  P ++  I P+   + + +
Subjt:  GLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLIDERR

Query:  PRLYRNVK--NYVGSYFQDYQKGQRPVDKLKI
         + YR++   +++   F     G+  V+ LK+
Subjt:  PRLYRNVK--NYVGSYFQDYQKGQRPVDKLKI

Arabidopsis top hitse value%identityAlignment
AT1G49390.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.4e-9448.52Show/hide
Query:  QKLLINGGDTPESYIY-KDGYGGGDSNNNPLPLAEIPVVDLAQL-SSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNK
        Q+++  G   PE Y++   G G     N  +P  +IP +DL+ L SSS      ++ L  AL++WG  Q +NH I+ +FL+KI++++ QFF+LP EEK+K
Subjt:  QKLLINGGDTPESYIY-KDGYGGGDSNNNPLPLAEIPVVDLAQL-SSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNK

Query:  CCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFN
        C RE   ++GYG DM+ S+ Q+LDW DRL+L   PED+RQLK+WPQ P  F  E L E+T+K + +IE    AMARS+ +E N F E  G+   + +RFN
Subjt:  CCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFN

Query:  FYPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDRE
        F+PPC RPD V+G+K H+DGSAIT++L D++VEGLQ+ KD +W++ P+  + D++LI +G+Q EIMSNG++KS VHR VTN EK+RISVA FC P  D+E
Subjt:  FYPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDRE

Query:  IEPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVD
        I P  GL+ E RPRLY+ V  YV  +++ YQ+G+R ++
Subjt:  IEPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVD

AT3G21420.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.6e-5839.62Show/hide
Query:  EIPVVDLAQLSSSPPSTAALEDLRL--ALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYGTDMVFSEQQILDWTDRLYLKV
        +IPV+DL++LS         E L+L  A   WG FQ INH I    +  I +++++FF +P+EEK K   E   ++GYG   +FSE Q LDW +   L V
Subjt:  EIPVVDLAQLSSSPPSTAALEDLRL--ALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYGTDMVFSEQQILDWTDRLYLKV

Query:  NPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDR-EV
        +P   R  K WP  P  F  E L  ++ +++++ + +L  +A S+ ++   F E  G+  +   R N+YPPCS PDLVLGL  HSDGSA+T++   +   
Subjt:  NPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDR-EV

Query:  EGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLI-DERRPRLYR--NVKNYVGSYFQD
         GLQ  KD+ W  VPV  + ++L+INIG+  E++SNG +KS  HRAVTN EK+R+++  F  P  + EIEP+  L+ DE  P  YR  N  +Y   Y  +
Subjt:  EGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLI-DERRPRLYR--NVKNYVGSYFQD

Query:  YQKGQRPVDKLKI
          +G++ +D  KI
Subjt:  YQKGQRPVDKLKI

AT5G20400.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein6.7e-9749.41Show/hide
Query:  QKLLINGGDTPESYIY-KDGYGGGDSNNNPLPLAEIPVVDL-AQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNK
        Q+++  G   PE Y++   G G     N  +P  +IP +DL   LSSS      L  L  AL++WG  Q +NH I+ +FL+KI++++ +FF+LP EEK K
Subjt:  QKLLINGGDTPESYIY-KDGYGGGDSNNNPLPLAEIPVVDL-AQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNK

Query:  CCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFN
        C RE+  ++GYG DM+  + Q+LDW DRLY+   PED+RQL +WP+ P  FR E LHE+T+K + +IE    AMARS+ +E NSF +  G+   L TRFN
Subjt:  CCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFN

Query:  FYPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDRE
         YPPC  PD V+G+K H+DGSAIT++L D++V GLQ++KD +W++ P+  + D++LIN+G+Q EIMSNG++KS VHR VTN EK+RISVA FC P  D+E
Subjt:  FYPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDRE

Query:  IEPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVD
        I+P+  L+ E RPRLY+ VK YV  YF+ YQ+G+RP++
Subjt:  IEPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVD

AT5G20550.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.5e-9147.77Show/hide
Query:  QKLLINGGDTPESYIYKDGY-GGGDSNNNPLPLAEIPVVDLA-QLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNK
        Q+++  G   PE Y+        G   N  +P+ +IP +DL+  LS S      L  L  AL++WG  Q INH I+ + L+KI++++ +F +LP EEK K
Subjt:  QKLLINGGDTPESYIYKDGY-GGGDSNNNPLPLAEIPVVDLA-QLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNK

Query:  CCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFN
          RE+  ++GYG DM+  + Q+LDW DRLY+   PED+RQLK+WP  P  FR E LHE+T+K   +   V  AMA S+ +E N F +  G+   + TRFN
Subjt:  CCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFN

Query:  FYPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDRE
         YPPC RPD V+G++ H+D SA T++L D+ VEGLQ+ KD +W++ PV A +D++LIN+G+Q EIMSNG++KS VHR VTN+EK+RISVA FC P  D+E
Subjt:  FYPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDRE

Query:  IEPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPV
        I+P+ GL+ E RPRLY+ VKNYV    + Y +GQRP+
Subjt:  IEPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPV

AT5G54000.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.5e-9348.1Show/hide
Query:  QKLLINGGDTPESYIY-KDGYGGGDSN-NNPLPLAEIPVVDLAQL-SSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKN
        Q+++  G   PE Y+Y   G G GD   N  LP  +I ++DL  L SSS      L  L  A+++WG  Q +NH IS + L+KIH+++ QFF LP +EK 
Subjt:  QKLLINGGDTPESYIY-KDGYGGGDSN-NNPLPLAEIPVVDLAQL-SSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKN

Query:  KCCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRF
        K  RE+   +G+G DM+ S+ Q+LDW DRLYL   PED+RQLK+WP+NP  FR E LHE+T+K + ++E    A+ARS+ +E N F E  G+   L TRF
Subjt:  KCCRELYGLEGYGTDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRF

Query:  NFYPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDR
        N YPPC RPD VLGLK HSDGSA T++L D+ VEGLQ+ KD +W++  +  +  ++LIN+G+  E+MSNG++KS VHR V N +K+RI VA FC  ++D+
Subjt:  NFYPPCSRPDLVLGLKEHSDGSAITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDR

Query:  EIEPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI
        EI+P+ GL+ E RPRLY+ VK    ++F  YQ+G+RP++   I
Subjt:  EIEPIKGLIDERRPRLYRNVKNYVGSYFQDYQKGQRPVDKLKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAAAAACTTCTCATCAACGGCGGCGACACGCCGGAGAGTTACATTTACAAAGACGGCTATGGCGGCGGAGATTCCAACAATAACCCACTTCCCTTGGCGGAGATTCCGGT
CGTTGACCTTGCACAACTCTCGTCGTCGCCGCCCAGCACGGCGGCGTTAGAGGACCTCCGGTTGGCTCTCACTTCGTGGGGCTGTTTTCAGGCCATTAATCACAGCATTT
CCAGTTCGTTTCTAAACAAGATTCATCAAATAAGCAACCAATTTTTTTCACTGCCTATGGAAGAGAAGAACAAATGTTGCAGAGAACTTTATGGGCTTGAAGGATATGGA
ACTGATATGGTTTTCTCAGAGCAACAAATTCTTGATTGGACCGATCGGTTATATCTCAAAGTAAATCCAGAAGATGAGAGACAGCTCAAATATTGGCCGCAGAATCCTCA
ATCATTCAGGCCGGAAGATCTACACGAGTTCACAATCAAATTAAAGCAGATAATTGAAACAGTACTGATGGCCATGGCAAGATCCATAAACGTGGAGGCGAATAGCTTTA
CAGAGCAAGTGGGGAAGCGTCCAGAGTTGTTCACGAGATTCAATTTCTATCCGCCATGTTCGAGGCCGGATCTTGTTCTTGGGCTCAAAGAACACTCAGATGGGTCTGCC
ATCACCATTGTTTTACTGGACAGAGAAGTCGAAGGTCTCCAATGGCGGAAGGACGACCAGTGGTTCAGAGTCCCTGTTCCTGCCATGGCCGATTCTCTTCTAATCAATAT
TGGGGAACAAGCCGAGATTATGAGCAATGGGGTGTTCAAGAGTGCTGTTCATCGGGCCGTGACGAACTCGGAGAAGCAGAGGATTTCGGTGGCATGCTTCTGCTGCCCAG
AAAAGGATAGAGAGATCGAGCCAATCAAGGGGCTGATCGACGAGAGGAGGCCGAGATTGTATAGAAATGTGAAGAACTATGTGGGTTCATATTTCCAAGACTACCAGAAG
GGACAGAGACCAGTTGATAAATTGAAGATT
mRNA sequenceShow/hide mRNA sequence
CAAAAACTTCTCATCAACGGCGGCGACACGCCGGAGAGTTACATTTACAAAGACGGCTATGGCGGCGGAGATTCCAACAATAACCCACTTCCCTTGGCGGAGATTCCGGT
CGTTGACCTTGCACAACTCTCGTCGTCGCCGCCCAGCACGGCGGCGTTAGAGGACCTCCGGTTGGCTCTCACTTCGTGGGGCTGTTTTCAGGCCATTAATCACAGCATTT
CCAGTTCGTTTCTAAACAAGATTCATCAAATAAGCAACCAATTTTTTTCACTGCCTATGGAAGAGAAGAACAAATGTTGCAGAGAACTTTATGGGCTTGAAGGATATGGA
ACTGATATGGTTTTCTCAGAGCAACAAATTCTTGATTGGACCGATCGGTTATATCTCAAAGTAAATCCAGAAGATGAGAGACAGCTCAAATATTGGCCGCAGAATCCTCA
ATCATTCAGGCCGGAAGATCTACACGAGTTCACAATCAAATTAAAGCAGATAATTGAAACAGTACTGATGGCCATGGCAAGATCCATAAACGTGGAGGCGAATAGCTTTA
CAGAGCAAGTGGGGAAGCGTCCAGAGTTGTTCACGAGATTCAATTTCTATCCGCCATGTTCGAGGCCGGATCTTGTTCTTGGGCTCAAAGAACACTCAGATGGGTCTGCC
ATCACCATTGTTTTACTGGACAGAGAAGTCGAAGGTCTCCAATGGCGGAAGGACGACCAGTGGTTCAGAGTCCCTGTTCCTGCCATGGCCGATTCTCTTCTAATCAATAT
TGGGGAACAAGCCGAGATTATGAGCAATGGGGTGTTCAAGAGTGCTGTTCATCGGGCCGTGACGAACTCGGAGAAGCAGAGGATTTCGGTGGCATGCTTCTGCTGCCCAG
AAAAGGATAGAGAGATCGAGCCAATCAAGGGGCTGATCGACGAGAGGAGGCCGAGATTGTATAGAAATGTGAAGAACTATGTGGGTTCATATTTCCAAGACTACCAGAAG
GGACAGAGACCAGTTGATAAATTGAAGATT
Protein sequenceShow/hide protein sequence
QKLLINGGDTPESYIYKDGYGGGDSNNNPLPLAEIPVVDLAQLSSSPPSTAALEDLRLALTSWGCFQAINHSISSSFLNKIHQISNQFFSLPMEEKNKCCRELYGLEGYG
TDMVFSEQQILDWTDRLYLKVNPEDERQLKYWPQNPQSFRPEDLHEFTIKLKQIIETVLMAMARSINVEANSFTEQVGKRPELFTRFNFYPPCSRPDLVLGLKEHSDGSA
ITIVLLDREVEGLQWRKDDQWFRVPVPAMADSLLINIGEQAEIMSNGVFKSAVHRAVTNSEKQRISVACFCCPEKDREIEPIKGLIDERRPRLYRNVKNYVGSYFQDYQK
GQRPVDKLKI