; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1872 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1872
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionalpha-ketoglutarate-dependent dioxygenase alkB
Genome locationMC09:23616810..23621334
RNA-Seq ExpressionMC09g1872
SyntenyMC09g1872
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573589.1 Alpha-ketoglutarate-dependent dioxygenase alkB, partial [Cucurbita argyrosperma subsp. sororia]1.17e-23990.56Show/hide
Query:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
        MYGSDKGTDD ERTAFR AEKKYK YYDDT+KSSKKKKLPKQVDLSEVIDFKRI ECYN DGALPLG+NAT+CDLDGPVFCLENRPGFYFIPGALSLEEQ
Subjt:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ

Query:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG
        CQWIR+SL  FPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVE NEISA NVDSD EPSV  GN++GWKFVEENTVSSRR TCKSVPAS LLRKLRWSTLG
Subjt:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR
        LQFDWSKRSYDISL HN++PSALCQL KRMAAAAMP GEEFKPEAAIVNYFASGD+LGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR
Subjt:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
        SGDVVLMAGEARECFHGVPRIF D E++E SLLEK FS++DDLHFLEYIRTSRININIRQ
Subjt:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ

XP_022151981.1 alpha-ketoglutarate-dependent dioxygenase alkB [Momordica charantia]2.52e-266100Show/hide
Query:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
        MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
Subjt:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ

Query:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG
        CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG
Subjt:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR
        LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR
Subjt:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
        SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
Subjt:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ

XP_022945036.1 alpha-ketoglutarate-dependent dioxygenase alkB [Cucurbita moschata]4.99e-24190.83Show/hide
Query:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
        MYGSDKGTDD ERTAFR AEKKYK YYDDT+KSSKKKKLPKQVDLSEVIDFKRILECYN DGALPLG+NAT+CDLDGPVFCLENRPGFYFIPGALSLEEQ
Subjt:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ

Query:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG
        CQWIRESL  FPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVE NEISA NVDSD EPS+  GN++GWKFVEENTVSSRR TCKSVPAS LLRKLRWSTLG
Subjt:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR
        LQFDWSKRSYDISL HN++PSALCQL KRMAAAAMP GEEFKPEAAIVNYFASGD+LGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR
Subjt:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
        SGDVVLMAGEARECFHGVPRIF D E++E SLLEK+FS++DDLHFLEYIRTSRININIRQ
Subjt:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ

XP_022966806.1 alpha-ketoglutarate-dependent dioxygenase alkB [Cucurbita maxima]1.59e-23790.56Show/hide
Query:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
        MYGSDKGTDD ERTAFR AEKKYK YYDDT+KSSKKKKLPKQVDLSEVIDFKRILECYN DGALPLG+NAT+CDLDGPVFCLENRPGFYFIPGALSLEEQ
Subjt:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ

Query:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG
        CQ IRESL  FPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVE NEISA NVDSD EPSV  GN++GWKFVEENTVSSRR TCKSVPAS LLRKLRWSTLG
Subjt:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR
        LQFDWSKRSYDISL HN +PSALCQLAKRMAAAAMP GEEFKPEAAIVNYFASGD+LGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQD PVAMFLR
Subjt:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
        SGDVVLMAGEARECFHGVPRIF D E++E SLLEK+FS++DDLHFLEY+RTSRININIRQ
Subjt:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ

XP_023541203.1 alpha-ketoglutarate-dependent dioxygenase alkB [Cucurbita pepo subsp. pepo]3.21e-23789.72Show/hide
Query:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
        MYGSDKGTDD ERTAFR AEKKYK YYDDT+KSSKKKKLPKQVDLSEVIDFKRILECYN DGALPLG+NAT+CDLDGPVFCLENRPGFYFI GALSLEEQ
Subjt:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ

Query:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG
        CQWIRESL  FPQPPNRTNHNAIYGPIQDLFIAAKEK+VLVE NEISA NVDSD EPS+  GN++GWKFVEENTVSSRR TCKSVPAS LLRKLRWSTLG
Subjt:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR
        LQFDWSKRSYDISL HN++PSALCQL KRMAAAAMP GEEFKPEAAIVNYFASGD+LGGHLDDMEADWSKPIVS+SLGCKAIFLLGGKSRQD PVAMFLR
Subjt:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
        SGDVVLMAGEARECFHGVPRIF D E++E SLLEK+FS++DDLHFLEYIRTSRININIRQ
Subjt:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ

TrEMBL top hitse value%identityAlignment
A0A0A0KS00 Fe2OG dioxygenase domain-containing protein1.64e-23086.98Show/hide
Query:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
        MYGSDKGTDD ERTAFR AEKKYK YYDDT+KSSKKKKLPK VDLSEVIDFK ILE Y  DG+LP+G+NAT CDLDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ

Query:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTC-KSVPASVLLRKLRWSTL
        CQWIRESL  FPQPPNRTNHNAIYGPIQDLFIAAKE +VLVE +EIS F +DSD+EPS+S GNTH WKFVEENTVSSRR T  KS+PASVLLRKLRWSTL
Subjt:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTC-KSVPASVLLRKLRWSTL

Query:  GLQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISL HNK+PSALCQLAKRMAAAAMP GEEFKPEAAIVNYFASGD+LGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
        RSGDVVLMAGEARECFHGVPRIFID E++E S LE   +NQDDLH LEYIRTSRININIRQ
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ

A0A1S3BE32 LOW QUALITY PROTEIN: alpha-ketoglutarate-dependent dioxygenase alkB4.03e-23186.98Show/hide
Query:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
        MYGSDKGTDD ERTAFR AEKKYK YYDDT+KSSKKKKLPK VDLSEVIDFK ILE Y  DG+LP+G+ AT CDLD PVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ

Query:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRR-TTCKSVPASVLLRKLRWSTL
        CQWIRESL  FPQPPNRTNHNAIYGPIQDLFIAAKEK+VLVE +EIS FN+DSD+EPS+S G+TH WKFVEENTVSSRR T CKS+ ASVLLRKLRWSTL
Subjt:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRR-TTCKSVPASVLLRKLRWSTL

Query:  GLQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISL HNK+PSALCQLAKRMAAAAMP GEEFKPEAAIVNYFASGD+LGGHLDDMEADWSKPIVSMSLGCKA FLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
        RSGDVVLMAGEARECFHGVPRIFID E++E S LE+  SNQDDLHFLEYIRTSRININIRQ
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ

A0A6J1DG90 alpha-ketoglutarate-dependent dioxygenase alkB1.22e-266100Show/hide
Query:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
        MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
Subjt:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ

Query:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG
        CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG
Subjt:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR
        LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR
Subjt:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
        SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
Subjt:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ

A0A6J1FZQ9 alpha-ketoglutarate-dependent dioxygenase alkB2.42e-24190.83Show/hide
Query:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
        MYGSDKGTDD ERTAFR AEKKYK YYDDT+KSSKKKKLPKQVDLSEVIDFKRILECYN DGALPLG+NAT+CDLDGPVFCLENRPGFYFIPGALSLEEQ
Subjt:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ

Query:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG
        CQWIRESL  FPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVE NEISA NVDSD EPS+  GN++GWKFVEENTVSSRR TCKSVPAS LLRKLRWSTLG
Subjt:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR
        LQFDWSKRSYDISL HN++PSALCQL KRMAAAAMP GEEFKPEAAIVNYFASGD+LGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR
Subjt:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
        SGDVVLMAGEARECFHGVPRIF D E++E SLLEK+FS++DDLHFLEYIRTSRININIRQ
Subjt:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ

A0A6J1HQB4 alpha-ketoglutarate-dependent dioxygenase alkB7.71e-23890.56Show/hide
Query:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
        MYGSDKGTDD ERTAFR AEKKYK YYDDT+KSSKKKKLPKQVDLSEVIDFKRILECYN DGALPLG+NAT+CDLDGPVFCLENRPGFYFIPGALSLEEQ
Subjt:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ

Query:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG
        CQ IRESL  FPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVE NEISA NVDSD EPSV  GN++GWKFVEENTVSSRR TCKSVPAS LLRKLRWSTLG
Subjt:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR
        LQFDWSKRSYDISL HN +PSALCQLAKRMAAAAMP GEEFKPEAAIVNYFASGD+LGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQD PVAMFLR
Subjt:  LQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
        SGDVVLMAGEARECFHGVPRIF D E++E SLLEK+FS++DDLHFLEY+RTSRININIRQ
Subjt:  SGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ

SwissProt top hitse value%identityAlignment
O60066 Alpha-ketoglutarate-dependent dioxygenase abh12.9e-2726.27Show/hide
Query:  ERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQCQWIRESL-TS
        +   FR+ EK+YK   D          +P   D+SEV+D          +     G  A   ++   VF  +  PG   +   +S E Q Q ++  + T 
Subjt:  ERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQCQWIRESL-TS

Query:  FPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFV----EENTVSSRRTTCKSVPASVLLRKLRWSTLGLQFDWS
           P N+TN +  Y                                  +  GN   W+       E+ +     T       ++ +KLRW TLG Q+DW+
Subjt:  FPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFV----EENTVSSRRTTCKSVPASVLLRKLRWSTLGLQFDWS

Query:  KRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVL
         + Y         P  L    +++   +      +K EAAIVN+++ GD+L  H+D+ E D + P++S+S+G   I+L+G +SR + P A+ L SGDVV+
Subjt:  KRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVL

Query:  MAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
        M G +R+ FH VP+I  +   +      K +          +I   R+N N+RQ
Subjt:  MAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ

P0CB42 Nucleic acid dioxygenase ALKBH14.0e-2931.4Show/hide
Query:  LENRPGFYFIPGALSLEEQCQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTT
        LE  PGF FIP       Q  W+++ L  + Q PN  N                                   ++  ++   T G     +  + S+  T
Subjt:  LENRPGFYFIPGALSLEEQCQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTT

Query:  CKSVPASVLLRKLRWSTLGLQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKA
         K  P S LL +LRW TLG  ++W  + Y     +   PS L  L++++A A    G  F+ EA I+NY+    +LG H+D  E D SKP++S S G  A
Subjt:  CKSVPASVLLRKLRWSTLGLQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKA

Query:  IFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDVETD------ETSL--------LEKQFSNQDDLHFLEYIRTSRININIRQ
        IFLLGG  R + P AMF+ SGD+++M+G +R   H VPR+    + +      ET L        L +  S +D      Y+RT+R+N+ +RQ
Subjt:  IFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDVETD------ETSL--------LEKQFSNQDDLHFLEYIRTSRININIRQ

Q13686 Nucleic acid dioxygenase ALKBH12.1e-3031.4Show/hide
Query:  LENRPGFYFIPGALSLEEQCQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTT
        L+  PGF FIP       Q  W+++ L  + Q PN  N         D  ++ +E + L E ++                      +F+     + RR  
Subjt:  LENRPGFYFIPGALSLEEQCQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTT

Query:  CKSVPASVLLRKLRWSTLGLQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKA
            P S LL KLRW T+G  ++W  + Y     +   PS L  L++++AAA     E+F+ EA I+NY+    +LG H+D  E D SKP++S S G  A
Subjt:  CKSVPASVLLRKLRWSTLGLQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKA

Query:  IFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDVETD------ETSL--------LEKQFSNQDDLHFLEYIRTSRININIRQ
        IFLLGG  R + P AMF+ SGD+++M+G +R   H VPR+  + E +      E  L        + +  S +D      Y++T+R+N+ +RQ
Subjt:  IFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDVETD------ETSL--------LEKQFSNQDDLHFLEYIRTSRININIRQ

Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB7.0e-4232Show/hide
Query:  KSSKKKKLPKQ----VDLSEVIDF-----------KRILECYN----HDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQCQWIRESLTSFP
        KS+  K +PK+    +D S V+DF           K I++C +    HD              +  V+ L+  PGFYFI    +  +Q +WI+ +L  + 
Subjt:  KSSKKKKLPKQ----VDLSEVIDF-----------KRILECYN----HDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQCQWIRESLTSFP

Query:  QPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLGLQFDWSKRSYDI
         PPN  N    +GPI++L+    EK ++ E  +    + D +IE      + +G     E   + R+          LL KL WSTLG Q+ W+ R Y  
Subjt:  QPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLGLQFDWSKRSYDI

Query:  SLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEAR
           + + P  L +L +++A A     + +  EAA VN+++    +GGHLDD E +  KPI+S+S G  A+FL+G ++R   PV +F+RSGD+V+M G +R
Subjt:  SLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEAR

Query:  ECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYI--RTSRININIRQ
         C+HGV +I   VE      L  +  +QD  + ++++  +  R+NIN RQ
Subjt:  ECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYI--RTSRININIRQ

Q9SA98 Alpha-ketoglutarate-dependent dioxygenase alkB2.7e-13465.93Show/hide
Query:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
        MY S   +DD +RTAFR AEKKYK YY+   K S+KKKLPK +DLSE++DF  I + +N+DG LP G+  ++ D   PVFC++NRPGFYFIP ALSL+EQ
Subjt:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ

Query:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTV-SSRRTTCKSVPASVLLRKLRWSTL
        C+WI+ESLTSFPQPPNRTNHNAIYGPI DLF +AKE +VLV+           D+         + WKF EE  +  + R++CKSV ASVLLRKLRWSTL
Subjt:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTV-SSRRTTCKSVPASVLLRKLRWSTL

Query:  GLQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKR+YD+SL HN +P ALCQLAK  AA AMP GEEF+PE AIVNYF  GD+LGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS+ DPP AM+L
Subjt:  GLQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
        RSGDVVLMAGEARECFHG+PRIF   E  +   LE + S++    F EYI+TSRININIRQ
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.9e-13565.93Show/hide
Query:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ
        MY S   +DD +RTAFR AEKKYK YY+   K S+KKKLPK +DLSE++DF  I + +N+DG LP G+  ++ D   PVFC++NRPGFYFIP ALSL+EQ
Subjt:  MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQ

Query:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTV-SSRRTTCKSVPASVLLRKLRWSTL
        C+WI+ESLTSFPQPPNRTNHNAIYGPI DLF +AKE +VLV+           D+         + WKF EE  +  + R++CKSV ASVLLRKLRWSTL
Subjt:  CQWIRESLTSFPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTV-SSRRTTCKSVPASVLLRKLRWSTL

Query:  GLQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKR+YD+SL HN +P ALCQLAK  AA AMP GEEF+PE AIVNYF  GD+LGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS+ DPP AM+L
Subjt:  GLQFDWSKRSYDISLLHNKMPSALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ
        RSGDVVLMAGEARECFHG+PRIF   E  +   LE + S++    F EYI+TSRININIRQ
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFSNQDDLHFLEYIRTSRININIRQ

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein2.9e-1135.25Show/hide
Query:  PEAAIVNYFASGDSLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFS
        P+  IVN+++S   LG H D  E++ S     P+VS S+G  A FL G +  +D    + L SGDV+L  G +R+ FHGV  I  D  T   +LL++   
Subjt:  PEAAIVNYFASGDSLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFS

Query:  NQDDLHFLEYIRTSRININIRQ
                  +R  R+N+  RQ
Subjt:  NQDDLHFLEYIRTSRININIRQ

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein4.2e-1034.71Show/hide
Query:  PEAAIVNYFASGDSLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFS
        P+  IVN+++    LG H D  E++ S     PIVS S+G  A FL G K   +    + L SGDV++  GE+R  FHGV  I     +   SLL +   
Subjt:  PEAAIVNYFASGDSLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFS

Query:  NQDDLHFLEYIRTSRININIR
                  +RT R+N+  R
Subjt:  NQDDLHFLEYIRTSRININIR

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein4.2e-1034.71Show/hide
Query:  PEAAIVNYFASGDSLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFS
        P+  IVN+++    LG H D  E++ S     PIVS S+G  A FL G K   +    + L SGDV++  GE+R  FHGV  I     +   SLL +   
Subjt:  PEAAIVNYFASGDSLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDVETDETSLLEKQFS

Query:  NQDDLHFLEYIRTSRININIR
                  +RT R+N+  R
Subjt:  NQDDLHFLEYIRTSRININIR

AT5G43260.1 chaperone protein dnaJ-related4.9e-0664.52Show/hide
Query:  GSGRMFCSSCGGTGTGRPIPAQLSVRRTNHP
        GSGR  CS+CGG+GTGRP+PAQ++V+  N P
Subjt:  GSGRMFCSSCGGTGTGRPIPAQLSVRRTNHP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACGGATCCGACAAAGGCACCGACGATTTGGAGCGCACCGCTTTCAGAATAGCAGAAAAGAAATACAAGTTCTACTACGACGACACCTTTAAATCTTCCAAAAAGAA
AAAACTACCGAAACAAGTGGATTTGTCGGAGGTTATCGATTTCAAGCGCATTCTAGAATGTTACAATCACGATGGCGCACTTCCGCTGGGCATGAACGCGACTCAGTGCG
ATCTCGATGGGCCAGTTTTCTGCTTGGAGAATCGTCCTGGATTTTATTTCATTCCTGGAGCGTTAAGCTTAGAAGAGCAATGCCAATGGATTAGGGAGAGTCTAACGAGT
TTCCCGCAGCCTCCTAACAGAACCAATCACAACGCTATTTATGGACCAATTCAAGACCTGTTCATTGCAGCAAAGGAAAAGAGAGTTTTAGTGGAAGCAAATGAAATCTC
TGCTTTCAACGTTGATTCCGACATTGAACCTTCGGTTAGCACTGGAAATACTCATGGATGGAAGTTTGTGGAGGAAAATACTGTTTCATCCAGAAGGACGACCTGCAAAT
CAGTTCCAGCTTCTGTGTTACTTAGAAAGTTGCGTTGGAGCACCCTCGGCCTACAATTTGATTGGTCCAAGCGAAGCTATGACATATCTCTGCTACATAATAAGATGCCC
TCTGCACTGTGTCAACTTGCCAAAAGAATGGCGGCAGCTGCAATGCCGGCTGGGGAAGAATTCAAGCCTGAAGCTGCAATAGTGAATTATTTTGCTTCGGGCGACAGTCT
CGGGGGTCACCTAGATGACATGGAAGCAGACTGGAGCAAGCCAATTGTTAGCATGAGTTTGGGATGCAAAGCTATTTTCCTCTTGGGAGGCAAGTCGAGACAGGATCCAC
CGGTAGCCATGTTTCTTCGAAGTGGAGATGTCGTGCTTATGGCTGGAGAAGCGAGGGAATGTTTTCATGGTGTACCTCGGATCTTCATCGATGTAGAAACTGACGAAACT
TCTCTTCTTGAAAAGCAGTTTTCAAATCAAGATGATTTGCATTTTCTGGAATACATTAGAACTTCAAGAATAAACATCAACATTAGACAGGGCTCCGGTCGCATGTTCTG
TAGCAGCTGCGGCGGAACGGGTACGGGTCGCCCCATCCCGGCCCAACTCTCCGTTCGCCGTACCAACCACCCCTCTTCCTCC
mRNA sequenceShow/hide mRNA sequence
AATAATATTGATGTGCATACTCGTTATAAATTAAAATTAAGACATGTAATTTTCTTTTCGATTACAACCAACTGTGAGTATGGAGATTTGAACGACCAACGTTATAGAAG
AATTATAGAAGAAGTAGATATCAATTAAAGACTTATTTGATAAATTCTTAAACTTGGATATATTTGCATTCACCAGAAAAAAAAAAAAAAGTTATGAAACTTCATGTGTC
TGACTAGTCAAAACCCTAAGCGTGATTACTCCTCCCCCGTCCATCGCCAGAATTCAGAAGCTCAACCTAACGCCCATCGAGTAACTCACCGGCGGTAGCGGATACCGGTG
GATAAAATGTACGGATCCGACAAAGGCACCGACGATTTGGAGCGCACCGCTTTCAGAATAGCAGAAAAGAAATACAAGTTCTACTACGACGACACCTTTAAATCTTCCAA
AAAGAAAAAACTACCGAAACAAGTGGATTTGTCGGAGGTTATCGATTTCAAGCGCATTCTAGAATGTTACAATCACGATGGCGCACTTCCGCTGGGCATGAACGCGACTC
AGTGCGATCTCGATGGGCCAGTTTTCTGCTTGGAGAATCGTCCTGGATTTTATTTCATTCCTGGAGCGTTAAGCTTAGAAGAGCAATGCCAATGGATTAGGGAGAGTCTA
ACGAGTTTCCCGCAGCCTCCTAACAGAACCAATCACAACGCTATTTATGGACCAATTCAAGACCTGTTCATTGCAGCAAAGGAAAAGAGAGTTTTAGTGGAAGCAAATGA
AATCTCTGCTTTCAACGTTGATTCCGACATTGAACCTTCGGTTAGCACTGGAAATACTCATGGATGGAAGTTTGTGGAGGAAAATACTGTTTCATCCAGAAGGACGACCT
GCAAATCAGTTCCAGCTTCTGTGTTACTTAGAAAGTTGCGTTGGAGCACCCTCGGCCTACAATTTGATTGGTCCAAGCGAAGCTATGACATATCTCTGCTACATAATAAG
ATGCCCTCTGCACTGTGTCAACTTGCCAAAAGAATGGCGGCAGCTGCAATGCCGGCTGGGGAAGAATTCAAGCCTGAAGCTGCAATAGTGAATTATTTTGCTTCGGGCGA
CAGTCTCGGGGGTCACCTAGATGACATGGAAGCAGACTGGAGCAAGCCAATTGTTAGCATGAGTTTGGGATGCAAAGCTATTTTCCTCTTGGGAGGCAAGTCGAGACAGG
ATCCACCGGTAGCCATGTTTCTTCGAAGTGGAGATGTCGTGCTTATGGCTGGAGAAGCGAGGGAATGTTTTCATGGTGTACCTCGGATCTTCATCGATGTAGAAACTGAC
GAAACTTCTCTTCTTGAAAAGCAGTTTTCAAATCAAGATGATTTGCATTTTCTGGAATACATTAGAACTTCAAGAATAAACATCAACATTAGACAGGGCTCCGGTCGCAT
GTTCTGTAGCAGCTGCGGCGGAACGGGTACGGGTCGCCCCATCCCGGCCCAACTCTCCGTTCGCCGTACCAACCACCCCTCTTCCTCC
Protein sequenceShow/hide protein sequence
MYGSDKGTDDLERTAFRIAEKKYKFYYDDTFKSSKKKKLPKQVDLSEVIDFKRILECYNHDGALPLGMNATQCDLDGPVFCLENRPGFYFIPGALSLEEQCQWIRESLTS
FPQPPNRTNHNAIYGPIQDLFIAAKEKRVLVEANEISAFNVDSDIEPSVSTGNTHGWKFVEENTVSSRRTTCKSVPASVLLRKLRWSTLGLQFDWSKRSYDISLLHNKMP
SALCQLAKRMAAAAMPAGEEFKPEAAIVNYFASGDSLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDVETDET
SLLEKQFSNQDDLHFLEYIRTSRININIRQGSGRMFCSSCGGTGTGRPIPAQLSVRRTNHPSSS