; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0022067 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0022067
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationchr02:13822190..13824275
RNA-Seq ExpressionPay0022067
SyntenyPay0022067
Gene Ontology termsGO:0016491 - oxidoreductase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR026992 - Non-haem dioxygenase N-terminal domain
IPR027443 - Isopenicillin N synthase-like superfamily
IPR044861 - Isopenicillin N synthase-like, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050083.1 protein SRG1-like [Cucumis melo var. makuwa]2.9e-19094.65Show/hide
Query:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
        MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
Subjt:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS

Query:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD
        EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNP+D RRLEFWPLNPPSFS+EDLHEYTVKLM+II+TVLIAMARSLNVEPNSFTD
Subjt:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD

Query:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI
        QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSER+RI
Subjt:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI

Query:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
        SVVSFC             LIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
Subjt:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI

TYK10347.1 protein SRG1-like [Cucumis melo var. makuwa]5.2e-19295.49Show/hide
Query:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
        MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
Subjt:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS

Query:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD
        EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFS+EDLHEYTVKLM+II+TVLIAMARSLNVEPNSFTD
Subjt:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD

Query:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI
        QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI
Subjt:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI

Query:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
        SVVSFC             LIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
Subjt:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI

XP_008465534.1 PREDICTED: protein SRG1-like [Cucumis melo]5.8e-19195.77Show/hide
Query:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
        MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
Subjt:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS

Query:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD
        EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSF +EDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD
Subjt:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD

Query:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI
        QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI
Subjt:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI

Query:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
        SVVSFC             LIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
Subjt:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI

XP_011655280.1 probable 2-oxoglutarate-dependent dioxygenase ANS isoform X2 [Cucumis sativus]4.2e-17385.07Show/hide
Query:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
        MAQNSQ  QTQTLQQQLLINGGHTPESYIYKGGYHGG SNNNTPLPLA+IPV+DLSQLSS S GE PLN LRLALSTWGCFQATNH ISSSFLEK+RKIS
Subjt:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS

Query:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD
        EQFFSLPIEEKMRYGREVDGMEGYGNDL FSNQQ LDWSDRLY VT+P+DERRL+ WPLNPPSF +EDLHEYTVK+M+II+TVLIAMARSLNVEPNSFTD
Subjt:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD

Query:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI
        QVG+RPTLF RFNFYPPCSTPHLVLGLKEHSDG+AIT+LLLDKQVEGL+LRKDDQWYRVPVPA+ADSLL++IGEQAE+MSNGIFKS +HRAVTNSERQRI
Subjt:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI

Query:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
        SVV FC             LIDEKRPRLFRS KNYL+TYFQNYQ+G+R+VDGLRI
Subjt:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI

XP_031736054.1 probable 2-oxoglutarate-dependent dioxygenase ANS isoform X1 [Cucumis sativus]4.5e-17585.35Show/hide
Query:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
        MAQNSQ  QTQTLQQQLLINGGHTPESYIYKGGYHGG SNNNTPLPLA+IPV+DLSQLSS S GE PLN LRLALSTWGCFQATNH ISSSFLEK+RKIS
Subjt:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS

Query:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD
        EQFFSLPIEEKMRYGREVDGMEGYGNDL FSNQQ LDWSDRLY VT+P+DERRL+ WPLNPPSFS+EDLHEYTVK+M+II+TVLIAMARSLNVEPNSFTD
Subjt:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD

Query:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI
        QVG+RPTLF RFNFYPPCSTPHLVLGLKEHSDG+AIT+LLLDKQVEGL+LRKDDQWYRVPVPA+ADSLL++IGEQAE+MSNGIFKS +HRAVTNSERQRI
Subjt:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI

Query:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
        SVV FC             LIDEKRPRLFRS KNYL+TYFQNYQ+G+R+VDGLRI
Subjt:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI

TrEMBL top hitse value%identityAlignment
A0A0A0LW96 Uncharacterized protein9.2e-14283.56Show/hide
Query:  LSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFW
        L S S GE PLN LRLALSTWGCFQATNH ISSSFLEK+RKISEQFFSLPIEEKMRYGREVDGMEGYGNDL FSNQQ LDWSDRLY VT+P+DERRL+ W
Subjt:  LSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFW

Query:  PLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWY
        PLNPPSF +EDLHEYTVK+M+II+TVLIAMARSLNVEPNSFTDQVG+RPTLF RFNFYPPCSTPHLVLGLKEHSDG+AIT+LLLDKQVEGL+LRKDDQWY
Subjt:  PLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWY

Query:  RVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
        RVPVPA+ADSLL++IGEQAE+MSNGIFKS +HRAVTNSERQRISVV FC             LIDEKRPRLFRS KNYL+TYFQNYQ+G+R+VDGLRI
Subjt:  RVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI

A0A1S3CP07 protein SRG1-like2.8e-19195.77Show/hide
Query:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
        MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
Subjt:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS

Query:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD
        EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSF +EDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD
Subjt:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD

Query:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI
        QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI
Subjt:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI

Query:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
        SVVSFC             LIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
Subjt:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI

A0A5A7U2F3 Protein SRG1-like1.4e-19094.65Show/hide
Query:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
        MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
Subjt:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS

Query:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD
        EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNP+D RRLEFWPLNPPSFS+EDLHEYTVKLM+II+TVLIAMARSLNVEPNSFTD
Subjt:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD

Query:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI
        QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSER+RI
Subjt:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI

Query:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
        SVVSFC             LIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
Subjt:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI

A0A5D3CEG2 Protein SRG1-like2.5e-19295.49Show/hide
Query:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
        MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
Subjt:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS

Query:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD
        EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFS+EDLHEYTVKLM+II+TVLIAMARSLNVEPNSFTD
Subjt:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD

Query:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI
        QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI
Subjt:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI

Query:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
        SVVSFC             LIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
Subjt:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI

A0A6J1BYU1 uncharacterized protein LOC1110069922.0e-13668.73Show/hide
Query:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS
        MA+      T+T QQ+LLINGG TPESYIYK GY GGDSNNN PLPLA+IPV+DL+QLSS+    A L  LRLAL++WGCFQA NH ISSSFL K+ +IS
Subjt:  MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKIS

Query:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD
         QFFSLP+EEK +  RE+ G+EGYG D++FS QQILDW+DRLYL  NP+DER+L++WP NP SF +EDLHE+T+KL +II+TVL+AMARS+NVE NSFT+
Subjt:  EQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTD

Query:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI
        QVGKRP LF RFNFYPPCS P LVLGLKEHSDG+AIT++LLD++VEGL+ RKDDQW+RVPVPA+ADSLLI IGEQAEIMSNG+FKS +HRAVTNSE+QRI
Subjt:  QVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRI

Query:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
        SV  FC             LIDE+RPRL+R+ KNY+ +YFQ+YQKG+R VD L+I
Subjt:  SVVSFC------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI

SwissProt top hitse value%identityAlignment
A2A1A0 S-norcoclaurine synthase 13.3e-4838.13Show/hide
Query:  QIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNP
        +IPV+DLS+L         L     A   WG FQ  NHG+    +EKM+  +E FF LP +EK  Y +  +GMEGYG   + S +Q LDW+D  +L+T P
Subjt:  QIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNP

Query:  KDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGL
          ER + FWP +P SF +E + +Y+++L K+   +   MA++L +E    T  +  R    +     P  S+    LGL  HSD T +T+L+   +V GL
Subjt:  KDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGL

Query:  ELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSF
         ++KD++W  VP+  +  + ++ IG+  EIMSNGI+KS+ HRAV N++++R+S+ +F
Subjt:  ELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSF

D4N502 Codeine O-demethylase2.9e-4436.68Show/hide
Query:  IPVLDLSQLSSAS--VGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTN
        +PV+DL  L S    VG+  L+ L  A   WG FQ  NHG+ +  ++ ++   + FF+LP+ EK +YG++    EG+G   I S  Q LDW++   +++ 
Subjt:  IPVLDLSQLSSAS--VGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTN

Query:  PKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLN-VEPNSFTDQVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVE
        P   R+   +P  P  F +E L  Y  K+ K+   V   + +SL  VE    TD + +      R N+YPPC  P LVLGL  HSD + +T+LL   +VE
Subjt:  PKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLN-VEPNSFTDQVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVE

Query:  GLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSF
        GL++RK+++W  + +  + D+ ++ +G+  EIM+NGI++S+ HRAV NS ++R+S+ +F
Subjt:  GLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSF

O80449 Jasmonate-induced oxygenase 44.1e-4634.63Show/hide
Query:  QIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNP
        +IPVLD++ +     G   L  +R A   WG FQ  NHG++ S +E++R    +FF LP+EEK +Y    D  EGYG+ L       LDWSD  +L   P
Subjt:  QIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNP

Query:  KDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQV--GKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVE
           R    WP  PP   +E + +Y  ++ K+ + +   ++ SL ++PN     +  G +     R NFYP C  P L LGL  HSD   IT+LL D++V 
Subjt:  KDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQV--GKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVE

Query:  GLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSFC------------WLIDEKRPRLFRSTK--NYLQTYFQNYQ
        GL++R+ D W  V + +V ++L++ IG+Q +I+SNGI+KS+ H+ + NS  +R+S+  F              L+   RP L++  +   Y     Q   
Subjt:  GLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSFC------------WLIDEKRPRLFRSTK--NYLQTYFQNYQ

Query:  KGERSVDGL
         G+  VD L
Subjt:  KGERSVDGL

Q39224 Protein SRG13.9e-4933.01Show/hide
Query:  QIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNP
        +IP++D+ +L S++  ++ +  L  A   WG FQ  NHGI SSFL+K++   + FF+LP+EEK ++ +  D +EG+G   + S  Q LDW+D  +    P
Subjt:  QIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNP

Query:  KDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFT---DQVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQV
         + R+   +P  P  F ++ L  Y+ ++  +   ++  MAR+L ++P       D V    ++  R N+YPPC  P  V+GL  HSD   +TVL+    V
Subjt:  KDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFT---DQVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQV

Query:  EGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSF------------CWLIDEKRPRLFR--STKNYLQTYFQNY
        EGL+++KD +W  VPV  + ++ ++ IG+  EI++NG ++S+ HR V NSE++R+S+ +F              L++ ++   F+  + K Y    F   
Subjt:  EGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSF------------CWLIDEKRPRLFR--STKNYLQTYFQNY

Query:  QKGERSVDGLRI
          G+  +D LRI
Subjt:  QKGERSVDGLRI

Q94LP4 2-oxoglutarate-dependent dioxygenase 119.1e-4634.78Show/hide
Query:  IPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSL
        +P  Q L +       H PE YI +      +  NN    +A IP++DL +L      E     LR A   WG F   NHG+    +  +++    FFS 
Subjt:  IPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSL

Query:  PIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRP
        P++ K  Y +  + +EGYG   +FS  Q LDW+D LYL  +P D R L FWP +P SF ++ +  Y+ +   +   +   MA+++  +P S  D   ++P
Subjt:  PIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRP

Query:  TLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSF
            R  +YPPC     V+GL  HSD   +T+LL    V+GL+++KD +W+ +  P  A  L+  IG+  EI+SNG F+S+ HRAV N  ++RIS   F
Subjt:  TLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSF

Arabidopsis top hitse value%identityAlignment
AT1G49390.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.2e-8545.27Show/hide
Query:  PQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASV-GEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSL
        P+ +T+ Q+++  G   PE Y++     G     N  +P   IP +DLS L S+SV G+  +  L  ALSTWG  Q  NHGI+ +FL+K+ K+++QFF+L
Subjt:  PQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASV-GEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSL

Query:  PIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRP
        P EEK +  RE   ++GYGND+I S+ Q+LDW DRL+L T P+D+R+L+FWP  P  FS E L EYT+K   +I+    AMARSL +E N F +  G+  
Subjt:  PIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRP

Query:  TLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSFC
         +  RFNF+PPC  P  V+G+K H+DG+AIT+LL DK VEGL+  KD +WY+ P+  V D++LI +G+Q EIMSNGI+KS +HR VTN E++RISV +FC
Subjt:  TLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSFC

Query:  ------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
                     L+ E RPRL+++   Y+  +++ YQ+G R+++   I
Subjt:  ------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI

AT3G21420.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.3e-5237.7Show/hide
Query:  QIPVLDLSQLSSASVGEAPLNHLRL--ALSTWGCFQATNHGISSSFLEKMRKISEQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVT
        QIPV+DLS+LS     +     L+L  A   WG FQ  NHGI    +E + +++ +FF +P+EEK +Y  E   ++GYG   IFS  Q LDW +   L  
Subjt:  QIPVLDLSQLSSASVGEAPLNHLRL--ALSTWGCFQATNHGISSSFLEKMRKISEQFFSLPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVT

Query:  NPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDK-QV
        +P   R  + WP  P  FS E L  Y+ ++ ++   +L  +A SL ++   F +  G+      R N+YPPCS+P LVLGL  HSDG+A+TVL   K   
Subjt:  NPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRPTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDK-QV

Query:  EGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSFC-------------WLIDEKRPRLFRSTK--NYLQTYFQN
         GL++ KD+ W  VPV  + ++L+I IG+  E++SNG +KS+ HRAVTN E++R+++V+F               + DE  P  +RS    +Y   Y  N
Subjt:  EGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSFC-------------WLIDEKRPRLFRSTK--NYLQTYFQN

Query:  YQKGERSVDGLRI
          +G++S+D  +I
Subjt:  YQKGERSVDGLRI

AT5G20400.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.2e-8846.42Show/hide
Query:  PQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLS-QLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSL
        P+ +T+ Q+++  G   PE Y++     G     N  +P   IP +DL+  LSS+  G+  L+ L  ALSTWG  Q  NHGI+ +FL+K+ K++++FF+L
Subjt:  PQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLS-QLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSL

Query:  PIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRP
        P EEK +  RE+D ++GYGND+I  + Q+LDW DRLY+ T P+D+R+L FWP  P  F +E LHEYT+K   +I+    AMARSL +E NSF D  G+  
Subjt:  PIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRP

Query:  TLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSFC
        TL  RFN YPPC +P  V+G+K H+DG+AIT+LL DK V GL+ +KD +WY+ P+  V D++LI +G+Q EIMSNGI+KS +HR VTN E++RISV +FC
Subjt:  TLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSFC

Query:  ------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
                     L+ E RPRL+++ K Y++ YF+ YQ+G R ++   I
Subjt:  ------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI

AT5G20550.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein8.1e-8244.48Show/hide
Query:  PQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSAS-VGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSL
        P+ +T+ Q+++  G   PE Y+          + N  +P+  IP +DLS L S S  G   L+ L  ALSTWG  Q  NHGI+ + L+K+ K++++F +L
Subjt:  PQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSAS-VGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSL

Query:  PIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRP
        P EEK +Y RE+  ++GYGND+I  + Q+LDW DRLY+ T P+D+R+L+FWP  P  F +E LHEYT+K   + + V  AMA SL +E N F D  G+  
Subjt:  PIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRP

Query:  TLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSFC
        T+  RFN YPPC  P  V+G++ H+D +A T+LL DK VEGL+  KD +WY+ PV A +D++LI +G+Q EIMSNGI+KS +HR VTN+E++RISV +FC
Subjt:  TLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSFC

Query:  ------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSV
                     L+ E RPRL++  KNY+    + Y +G+R +
Subjt:  ------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSV

AT5G54000.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein9.5e-8343.71Show/hide
Query:  PQTQTLQQQLLINGGHTPESYIYKGGYHG-GDSNNNTPLPLAQIPVLDLSQL-SSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFS
        P+ +T+ Q+++  G   PE Y+Y     G GD   N  LP  +I ++DL+ L SS+  G   L+ L  A+STWG  Q  NHGIS + L+K+ ++++QFF 
Subjt:  PQTQTLQQQLLINGGHTPESYIYKGGYHG-GDSNNNTPLPLAQIPVLDLSQL-SSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFS

Query:  LPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKR
        LP +EK +Y RE+   +G+GND+I S+ Q+LDW DRLYL+T P+D+R+L+FWP NP  F +E LHEYT+K   +++    A+ARSL +E N F +  G+ 
Subjt:  LPIEEKMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKR

Query:  PTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSF
         TL  RFN YPPC  P  VLGLK HSDG+A T++L DK VEGL+  KD +WY+  +  +  ++LI +G+  E+MSNGI+KS +HR V N +++RI V +F
Subjt:  PTLFKRFNFYPPCSTPHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSF

Query:  C------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI
        C             L+ E RPRL+++ K   + +F  YQ+G R ++   I
Subjt:  C------------WLIDEKRPRLFRSTKNYLQTYFQNYQKGERSVDGLRI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACAAAATTCTCAGATCCCACAAACACAAACACTCCAACAACAACTTCTCATCAATGGCGGCCATACGCCCGAAAGTTATATTTACAAAGGCGGCTACCACGGCGG
AGATTCCAACAATAATACTCCACTTCCATTGGCACAGATTCCGGTCCTTGACCTTTCTCAACTTTCATCTGCGTCGGTGGGCGAGGCTCCGCTGAACCACCTCCGGTTGG
CTCTTAGTACTTGGGGATGCTTTCAGGCAACTAATCACGGTATTTCAAGTTCATTTTTGGAGAAAATGAGAAAAATAAGTGAGCAATTTTTCTCATTGCCCATTGAAGAG
AAGATGAGATATGGAAGAGAAGTGGATGGAATGGAAGGATACGGGAATGATTTAATTTTTTCAAACCAACAAATTCTAGATTGGTCCGATCGTTTATATCTCGTGACAAA
TCCTAAGGATGAAAGACGTCTCGAGTTTTGGCCTTTAAATCCTCCATCTTTCAGCAAGGAAGATCTACATGAGTATACAGTAAAATTAATGAAAATAATCGACACAGTGC
TGATAGCCATGGCAAGATCCCTAAACGTGGAGCCCAATAGCTTTACCGATCAAGTAGGAAAGCGACCAACTTTATTCAAAAGGTTCAATTTCTATCCGCCGTGTTCGACA
CCACATCTTGTTCTTGGACTCAAAGAACACTCCGATGGCACAGCTATCACCGTTCTTCTACTCGACAAACAAGTTGAAGGCCTCGAATTGCGAAAAGATGATCAGTGGTA
TAGAGTCCCTGTTCCTGCCGTTGCTGATTCTCTTCTCATCATCATTGGGGAACAAGCCGAAATCATGAGTAATGGAATCTTCAAGAGCCTTATTCATCGGGCGGTGACGA
ATTCAGAGAGACAGAGGATTTCAGTGGTGTCTTTCTGCTGGCTGATCGACGAGAAGAGACCGAGATTGTTCAGAAGTACGAAGAACTATTTGCAAACATATTTCCAGAAC
TACCAAAAGGGAGAGAGATCAGTAGATGGATTGAGGATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCACAAAATTCTCAGATCCCACAAACACAAACACTCCAACAACAACTTCTCATCAATGGCGGCCATACGCCCGAAAGTTATATTTACAAAGGCGGCTACCACGGCGG
AGATTCCAACAATAATACTCCACTTCCATTGGCACAGATTCCGGTCCTTGACCTTTCTCAACTTTCATCTGCGTCGGTGGGCGAGGCTCCGCTGAACCACCTCCGGTTGG
CTCTTAGTACTTGGGGATGCTTTCAGGCAACTAATCACGGTATTTCAAGTTCATTTTTGGAGAAAATGAGAAAAATAAGTGAGCAATTTTTCTCATTGCCCATTGAAGAG
AAGATGAGATATGGAAGAGAAGTGGATGGAATGGAAGGATACGGGAATGATTTAATTTTTTCAAACCAACAAATTCTAGATTGGTCCGATCGTTTATATCTCGTGACAAA
TCCTAAGGATGAAAGACGTCTCGAGTTTTGGCCTTTAAATCCTCCATCTTTCAGCAAGGAAGATCTACATGAGTATACAGTAAAATTAATGAAAATAATCGACACAGTGC
TGATAGCCATGGCAAGATCCCTAAACGTGGAGCCCAATAGCTTTACCGATCAAGTAGGAAAGCGACCAACTTTATTCAAAAGGTTCAATTTCTATCCGCCGTGTTCGACA
CCACATCTTGTTCTTGGACTCAAAGAACACTCCGATGGCACAGCTATCACCGTTCTTCTACTCGACAAACAAGTTGAAGGCCTCGAATTGCGAAAAGATGATCAGTGGTA
TAGAGTCCCTGTTCCTGCCGTTGCTGATTCTCTTCTCATCATCATTGGGGAACAAGCCGAAATCATGAGTAATGGAATCTTCAAGAGCCTTATTCATCGGGCGGTGACGA
ATTCAGAGAGACAGAGGATTTCAGTGGTGTCTTTCTGCTGGCTGATCGACGAGAAGAGACCGAGATTGTTCAGAAGTACGAAGAACTATTTGCAAACATATTTCCAGAAC
TACCAAAAGGGAGAGAGATCAGTAGATGGATTGAGGATTTAG
Protein sequenceShow/hide protein sequence
MAQNSQIPQTQTLQQQLLINGGHTPESYIYKGGYHGGDSNNNTPLPLAQIPVLDLSQLSSASVGEAPLNHLRLALSTWGCFQATNHGISSSFLEKMRKISEQFFSLPIEE
KMRYGREVDGMEGYGNDLIFSNQQILDWSDRLYLVTNPKDERRLEFWPLNPPSFSKEDLHEYTVKLMKIIDTVLIAMARSLNVEPNSFTDQVGKRPTLFKRFNFYPPCST
PHLVLGLKEHSDGTAITVLLLDKQVEGLELRKDDQWYRVPVPAVADSLLIIIGEQAEIMSNGIFKSLIHRAVTNSERQRISVVSFCWLIDEKRPRLFRSTKNYLQTYFQN
YQKGERSVDGLRI