; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022266 (gene) of Snake gourd v1 genome

Gene IDTan0022266
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionalpha-ketoglutarate-dependent dioxygenase alkB
Genome locationLG04:403629..406118
RNA-Seq ExpressionTan0022266
SyntenyTan0022266
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573589.1 Alpha-ketoglutarate-dependent dioxygenase alkB, partial [Cucurbita argyrosperma subsp. sororia]2.3e-19893.92Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ
        MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRI ECYNQDGALPLGVNATKCDLD PVFCLENRPGFYF+PGALSLEEQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG
        CQWIR+SL +FPQPPNRTNHNAIYGP QDLFIAAKEKRVLVEDNEIS  NVDSD EPSV NGNS+GWKFVEENTVS RRGTCKSVPAS LLRKLRWSTLG
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR
        LQFDWSKRSYDISLPHN+IPSALC+L KRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFLR
Subjt:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF
        SGDVVLMAGEARECFHGVPRIFTDEESEEISL+E+ FSS+DDLHFLEYIRTSRININIRQVF
Subjt:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF

XP_022945036.1 alpha-ketoglutarate-dependent dioxygenase alkB [Cucurbita moschata]3.6e-19993.92Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ
        MYGSDKGTDD+ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLD PVFCLENRPGFYF+PGALSLEEQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG
        CQWIRESL +FPQPPNRTNHNAIYGP QDLFIAAKEKRVLVEDNEIS  NVDSD EPS+ NGNS+GWKFVEENTVS RRGTCKSVPAS LLRKLRWSTLG
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR
        LQFDWSKRSYDISLPHN+IPSALC+L KRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFLR
Subjt:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF
        SGDVVLMAGEARECFHGVPRIFTDEESEEISL+E++FSS+DDLHFLEYIRTSRININIRQVF
Subjt:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF

XP_022966806.1 alpha-ketoglutarate-dependent dioxygenase alkB [Cucurbita maxima]5.7e-19793.92Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ
        MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLD PVFCLENRPGFYF+PGALSLEEQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG
        CQ IRESL +FPQPPNRTNHNAIYGP QDLFIAAKEKRVLVEDNEIS  NVDSD EPSV NGNS+GWKFVEENTVS RRGTCKSVPAS LLRKLRWSTLG
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR
        LQFDWSKRSYDISLPHN IPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQD P+AMFLR
Subjt:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF
        SGDVVLMAGEARECFHGVPRIFTDEESEEISL+E++FSS+DDLHFLEY+RTSRININIRQVF
Subjt:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF

XP_023541203.1 alpha-ketoglutarate-dependent dioxygenase alkB [Cucurbita pepo subsp. pepo]9.8e-19793.09Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ
        MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLD PVFCLENRPGFYF+ GALSLEEQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG
        CQWIRESL +FPQPPNRTNHNAIYGP QDLFIAAKEK+VLVEDNEIS  NVDSD EPS+ NGNS+GWKFVEENTVS RRGTCKSVPAS LLRKLRWSTLG
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR
        LQFDWSKRSYDISLPHN+IPSALC+L KRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVS+SLGCKAIFLLGGKSRQD P+AMFLR
Subjt:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF
        SGDVVLMAGEARECFHGVPRIFTDEESEEISL+E++FSS+DDLHFLEYIRTSRININIRQVF
Subjt:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF

XP_038891364.1 alpha-ketoglutarate-dependent dioxygenase alkB [Benincasa hispida]8.5e-19392.01Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK IL CY QDGALPLGVNA KCDLD PVFCLENRPGFYF+PGALSL+EQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGT-CKSVPASVLLRKLRWSTL
        CQWIRESLTNFPQP NRTNHNAIYG  QDLFIAAK K+VLVED+EISDFNVDSD+E SVSNGN+H WKFVEENTVS +RGT CKS+PASVLLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGT-CKSVPASVLLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKRSYDISLPHNKIPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEISL+ER FS+QDDLHFLEYI+ SRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF

TrEMBL top hitse value%identityAlignment
A0A0A0KS00 Fe2OG dioxygenase domain-containing protein2.8e-18989.53Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ
        MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK ILE Y QDG+LP+GVNAT CDLD PVFCLENRPGFYF+PGALSL+EQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTC-KSVPASVLLRKLRWSTL
        CQWIRESL  FPQPPNRTNHNAIYGP QDLFIAAKE +VLVE +EISDF +DSD+EPS+SNGN+H WKFVEENTVS RRGT  KS+PASVLLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTC-KSVPASVLLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKRSY+ISLPHNKIPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS +E   ++QDDLH LEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF

A0A1S3BE32 LOW QUALITY PROTEIN: alpha-ketoglutarate-dependent dioxygenase alkB8.6e-19190.08Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ
        MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK ILE Y QDG+LP+GV AT CDLD PVFCLENRPGFYF+PGALSL+EQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGT-CKSVPASVLLRKLRWSTL
        CQWIRESL +FPQPPNRTNHNAIYGP QDLFIAAKEK+VLVE +EISDFN+DSD+EPS+SNG++H WKFVEENTVS RRGT CKS+ ASVLLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGT-CKSVPASVLLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKRSY+ISLPHNKIPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKA FLLGGKSRQDPPIAMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS +ER  S+QDDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF

A0A6J1DG90 alpha-ketoglutarate-dependent dioxygenase alkB4.1e-19391.16Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ
        MYGSDKGTDD ERTAFR AEKKYK YYDDT+KSSKKKKLPKQVDLSEVIDFKRILECYN DGALPLG+NAT+CDLD PVFCLENRPGFYF+PGALSLEEQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG
        CQWIRESLT+FPQPPNRTNHNAIYGP QDLFIAAKEKRVLVE NEIS FNVDSDIEPSVS GN+HGWKFVEENTVS RR TCKSVPASVLLRKLRWSTLG
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR
        LQFDWSKRSYDISL HNK+PSALC+LAKRMAAAAMP GEEFKPEAAIVNYFASGD+LGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFLR
Subjt:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF
        SGDVVLMAGEARECFHGVPRIF D E++E SL+E++FS+QDDLHFLEYIRTSRININIRQVF
Subjt:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF

A0A6J1FZQ9 alpha-ketoglutarate-dependent dioxygenase alkB1.7e-19993.92Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ
        MYGSDKGTDD+ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLD PVFCLENRPGFYF+PGALSLEEQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG
        CQWIRESL +FPQPPNRTNHNAIYGP QDLFIAAKEKRVLVEDNEIS  NVDSD EPS+ NGNS+GWKFVEENTVS RRGTCKSVPAS LLRKLRWSTLG
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR
        LQFDWSKRSYDISLPHN+IPSALC+L KRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFLR
Subjt:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF
        SGDVVLMAGEARECFHGVPRIFTDEESEEISL+E++FSS+DDLHFLEYIRTSRININIRQVF
Subjt:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF

A0A6J1HQB4 alpha-ketoglutarate-dependent dioxygenase alkB2.8e-19793.92Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ
        MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLD PVFCLENRPGFYF+PGALSLEEQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG
        CQ IRESL +FPQPPNRTNHNAIYGP QDLFIAAKEKRVLVEDNEIS  NVDSD EPSV NGNS+GWKFVEENTVS RRGTCKSVPAS LLRKLRWSTLG
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLG

Query:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR
        LQFDWSKRSYDISLPHN IPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQD P+AMFLR
Subjt:  LQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLR

Query:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF
        SGDVVLMAGEARECFHGVPRIFTDEESEEISL+E++FSS+DDLHFLEY+RTSRININIRQVF
Subjt:  SGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF

SwissProt top hitse value%identityAlignment
O60066 Alpha-ketoglutarate-dependent dioxygenase abh18.1e-2927.04Show/hide
Query:  ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQCQWIRESL-TN
        +   FR  EK+YK   D          +P   D+SEV+D          +     G  A   ++   VF  +  PG   +   +S E Q Q ++  + T 
Subjt:  ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQCQWIRESL-TN

Query:  FPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGW-KFVEENTVSFRRGTCKSVPASV---LLRKLRWSTLGLQFDWS
           P N+TN +  Y                                  +  GN   W ++   +  S   G  ++ P +V   + +KLRW TLG Q+DW+
Subjt:  FPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGW-KFVEENTVSFRRGTCKSVPASV---LLRKLRWSTLGLQFDWS

Query:  KRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVL
         + Y         P  L    +++   +      +K EAAIVN+++ GDTL  H+D+ E D + P++S+S+G   I+L+G +SR + P A+ L SGDVV+
Subjt:  KRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVL

Query:  MAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQV
        M G +R+ FH VP+I  +     +    + +          +I   R+N N+RQV
Subjt:  MAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQV

P0CB42 Nucleic acid dioxygenase ALKBH15.1e-3130.25Show/hide
Query:  KKKLPKQVDLSEVIDFKRILECYNQDGALPLGV-------NATKCDLD----APV-----FCLENRPGFYFVPGALSLEEQCQWIRESLTNFPQPPNRTN
        ++  P   DL  VIDF       +    +P  V       + T+ D +     PV     + LE  PGF F+P       Q  W+++ L  + Q PN  N
Subjt:  KKKLPKQVDLSEVIDFKRILECYNQDGALPLGV-------NATKCDLD----APV-----FCLENRPGFYFVPGALSLEEQCQWIRESLTNFPQPPNRTN

Query:  HNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLGLQFDWSKRSYDISLPHNKI
                 D  +  +E + L E ++                      + +    V+ RR      P S LL +LRW TLG  ++W  + Y     +   
Subjt:  HNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLGLQFDWSKRSYDISLPHNKI

Query:  PSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVP
        PS L  L++++A A    G  F+ EA I+NY+    TLG H+D  E D SKP++S S G  AIFLLGG  R + P AMF+ SGD+++M+G +R   H VP
Subjt:  PSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVP

Query:  RIFTDEESEEI---------------SLIERKFSSQDDLHFLEYIRTSRININIRQV
        R+    + E +               SL+E   S +D      Y+RT+R+N+ +RQV
Subjt:  RIFTDEESEEI---------------SLIERKFSSQDDLHFLEYIRTSRININIRQV

Q13686 Nucleic acid dioxygenase ALKBH14.6e-3231.86Show/hide
Query:  LENRPGFYFVPGALSLEEQCQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGT
        L+  PGF F+P       Q  W+++ L  + Q PN  N         D  ++ +E + L E ++                      +F+     + RR  
Subjt:  LENRPGFYFVPGALSLEEQCQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGT

Query:  CKSVPASVLLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKA
            P S LL KLRW T+G  ++W  + Y     +   PS L  L++++AAA     E+F+ EA I+NY+    TLG H+D  E D SKP++S S G  A
Subjt:  CKSVPASVLLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKA

Query:  IFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFTDEESEEI---------------SLIERKFSSQDDLHFLEYIRTSRININIRQV
        IFLLGG  R + P AMF+ SGD+++M+G +R   H VPR+  + E E +               S++E   S +D      Y++T+R+N+ +RQV
Subjt:  IFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFTDEESEEI---------------SLIERKFSSQDDLHFLEYIRTSRININIRQV

Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB3.4e-4331.12Show/hide
Query:  TAFRRAEKKYKLYYDDTYKSSKKKKLPKQ----VDLSEVIDF-----------KRILEC----------YNQDGALPLGVNATKCDLDAPVFCLENRPGF
        T F R ++ ++       KS+  K +PK+    +D S V+DF           K I++C          +N+D    L     K      V+ L+  PGF
Subjt:  TAFRRAEKKYKLYYDDTYKSSKKKKLPKQ----VDLSEVIDF-----------KRILEC----------YNQDGALPLGVNATKCDLDAPVFCLENRPGF

Query:  YFVPGALSLEEQCQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPAS
        YF+    +  +Q +WI+ +L ++  PPN  N    +GP ++L+    EK ++ E+ +    + D +IE      + +G     E   ++R+         
Subjt:  YFVPGALSLEEQCQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPAS

Query:  VLLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGK
         LL KL WSTLG Q+ W+ R Y     + + P  L +L +++A A     + +  EAA VN+++    +GGHLDD E +  KPI+S+S G  A+FL+G +
Subjt:  VLLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGK

Query:  SRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYI--RTSRININIRQVF
        +R   P+ +F+RSGD+V+M G +R C+HGV +I   E S ++ LI+     QD  + ++++  +  R+NIN RQVF
Subjt:  SRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYI--RTSRININIRQVF

Q9SA98 Alpha-ketoglutarate-dependent dioxygenase alkB8.7e-14067.22Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ
        MY S   +DD++RTAFRRAEKKYKLYY+   K S+KKKLPK +DLSE++DF  I + +N DG LP G+  +K D  +PVFC++NRPGFYF+P ALSL+EQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTV-SFRRGTCKSVPASVLLRKLRWSTL
        C+WI+ESLT+FPQPPNRTNHNAIYGP  DLF +AKE +VLV+D+                   ++ WKF EE  +    R +CKSV ASVLLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTV-SFRRGTCKSVPASVLLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKR+YD+SLPHN IP ALC+LAK  AA AMP GEEF+PE AIVNYF  GDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS+ DPP AM+L
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHG+PRIFT EE+ +I  +E + S +    F EYI+TSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein6.2e-14167.22Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ
        MY S   +DD++RTAFRRAEKKYKLYY+   K S+KKKLPK +DLSE++DF  I + +N DG LP G+  +K D  +PVFC++NRPGFYF+P ALSL+EQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTV-SFRRGTCKSVPASVLLRKLRWSTL
        C+WI+ESLT+FPQPPNRTNHNAIYGP  DLF +AKE +VLV+D+                   ++ WKF EE  +    R +CKSV ASVLLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTV-SFRRGTCKSVPASVLLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKR+YD+SLPHN IP ALC+LAK  AA AMP GEEF+PE AIVNYF  GDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS+ DPP AM+L
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHG+PRIFT EE+ +I  +E + S +    F EYI+TSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFSSQDDLHFLEYIRTSRININIRQVF

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein9.9e-0630.77Show/hide
Query:  PEAAIVNYFASGDTLGGH----LDDMEADWSK---------------------PIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHG
        P+  +VN++ S   LG H     D    D+ K                     PIVS S+G  A FL G +   D    + L SGDV++    +R  FHG
Subjt:  PEAAIVNYFASGDTLGGH----LDDMEADWSK---------------------PIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHG

Query:  VPRI
        V  I
Subjt:  VPRI

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein1.3e-1034.43Show/hide
Query:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFS
        P+  IVN+++S   LG H D  E++ S     P+VS S+G  A FL G +  +D    + L SGDV+L  G +R+ FHGV  I             RK +
Subjt:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFS

Query:  SQDDLHFLEYIRTSRININIRQ
        +   L     +R  R+N+  RQ
Subjt:  SQDDLHFLEYIRTSRININIRQ

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein2.3e-1034.71Show/hide
Query:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFS
        P+  IVN+++    LG H D  E++ S     PIVS S+G  A FL G K   +    + L SGDV++  GE+R  FHGV  I  +  S  +SL+     
Subjt:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFS

Query:  SQDDLHFLEYIRTSRININIR
                  +RT R+N+  R
Subjt:  SQDDLHFLEYIRTSRININIR

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein2.3e-1034.71Show/hide
Query:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFS
        P+  IVN+++    LG H D  E++ S     PIVS S+G  A FL G K   +    + L SGDV++  GE+R  FHGV  I  +  S  +SL+     
Subjt:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFTDEESEEISLIERKFS

Query:  SQDDLHFLEYIRTSRININIR
                  +RT R+N+  R
Subjt:  SQDDLHFLEYIRTSRININIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACGGATCCGACAAAGGCACAGACGACTCTGAGCGCACCGCTTTCAGAAGAGCAGAAAAGAAATACAAACTGTACTACGACGACACCTACAAATCTTCCAAAAAGAA
AAAGCTACCGAAACAAGTGGATTTGTCTGAGGTTATCGATTTCAAGCGTATCCTCGAATGTTACAATCAGGATGGTGCACTTCCGTTGGGCGTGAATGCGACTAAGTGCG
ATCTCGATGCACCAGTCTTCTGCTTGGAGAATCGTCCTGGGTTTTACTTTGTTCCCGGTGCGTTAAGTTTAGAAGAGCAATGCCAATGGATTAGGGAGAGTTTAACGAAT
TTCCCGCAGCCTCCTAACAGAACCAATCATAATGCTATTTATGGACCAAGTCAAGACCTTTTCATTGCAGCAAAGGAAAAGAGGGTTTTAGTTGAAGATAATGAAATCTC
TGATTTCAATGTTGATTCTGACATTGAACCTTCCGTTAGCAATGGAAATTCTCATGGATGGAAATTTGTGGAGGAAAATACTGTTTCATTCAGAAGGGGGACCTGCAAAT
CAGTTCCTGCTTCTGTATTACTTCGGAAGTTGCGTTGGAGTACACTCGGCCTACAATTTGATTGGTCCAAGCGAAGCTATGACATATCTCTGCCCCATAATAAGATACCC
TCTGCACTATGCAAACTTGCCAAAAGAATGGCGGCAGCTGCAATGCCAACTGGGGAAGAATTCAAGCCTGAAGCTGCAATAGTGAATTATTTTGCTTCGGGTGACACGCT
CGGGGGTCACCTAGATGACATGGAAGCTGACTGGAGCAAGCCAATTGTTAGCATGAGTTTGGGTTGCAAAGCTATTTTTCTATTGGGTGGCAAGTCGAGACAGGATCCGC
CCATAGCCATGTTTCTTCGAAGTGGAGATGTCGTGCTAATGGCTGGAGAAGCAAGGGAATGTTTCCATGGTGTACCACGGATCTTCACAGATGAAGAAAGTGAGGAAATT
TCTCTTATTGAAAGGAAGTTCTCAAGTCAAGATGATTTGCATTTTCTGGAATACATAAGAACTTCAAGAATAAACATCAACATTAGACAGGTTTTCTGA
mRNA sequenceShow/hide mRNA sequence
CCTAAACTTGATGCTCCCGTCAATCGCCATAATTCAGAAGCTCAATCTTACAGCCATCGATAACTCACCGGCGGCAGCGGATCCATAAAATGTACGGATCCGACAAAGGC
ACAGACGACTCTGAGCGCACCGCTTTCAGAAGAGCAGAAAAGAAATACAAACTGTACTACGACGACACCTACAAATCTTCCAAAAAGAAAAAGCTACCGAAACAAGTGGA
TTTGTCTGAGGTTATCGATTTCAAGCGTATCCTCGAATGTTACAATCAGGATGGTGCACTTCCGTTGGGCGTGAATGCGACTAAGTGCGATCTCGATGCACCAGTCTTCT
GCTTGGAGAATCGTCCTGGGTTTTACTTTGTTCCCGGTGCGTTAAGTTTAGAAGAGCAATGCCAATGGATTAGGGAGAGTTTAACGAATTTCCCGCAGCCTCCTAACAGA
ACCAATCATAATGCTATTTATGGACCAAGTCAAGACCTTTTCATTGCAGCAAAGGAAAAGAGGGTTTTAGTTGAAGATAATGAAATCTCTGATTTCAATGTTGATTCTGA
CATTGAACCTTCCGTTAGCAATGGAAATTCTCATGGATGGAAATTTGTGGAGGAAAATACTGTTTCATTCAGAAGGGGGACCTGCAAATCAGTTCCTGCTTCTGTATTAC
TTCGGAAGTTGCGTTGGAGTACACTCGGCCTACAATTTGATTGGTCCAAGCGAAGCTATGACATATCTCTGCCCCATAATAAGATACCCTCTGCACTATGCAAACTTGCC
AAAAGAATGGCGGCAGCTGCAATGCCAACTGGGGAAGAATTCAAGCCTGAAGCTGCAATAGTGAATTATTTTGCTTCGGGTGACACGCTCGGGGGTCACCTAGATGACAT
GGAAGCTGACTGGAGCAAGCCAATTGTTAGCATGAGTTTGGGTTGCAAAGCTATTTTTCTATTGGGTGGCAAGTCGAGACAGGATCCGCCCATAGCCATGTTTCTTCGAA
GTGGAGATGTCGTGCTAATGGCTGGAGAAGCAAGGGAATGTTTCCATGGTGTACCACGGATCTTCACAGATGAAGAAAGTGAGGAAATTTCTCTTATTGAAAGGAAGTTC
TCAAGTCAAGATGATTTGCATTTTCTGGAATACATAAGAACTTCAAGAATAAACATCAACATTAGACAGGTTTTCTGATTCTTTGGATCTGTTATCTTTCAAGCTTTGCT
AGAAAAATTTTCCAACATTATAGGGACGACGGGAGTGGCTTCGCTCATACATAGACAGATGAGAGAAGAGATTGATTACAGTCTGATTGCAGACAGAAGATTTATGAAGC
CGGACTCGAACAATGTTTTTTTGGCTGCCATTGGAGGGGATACCTTAGGACGGGCACAATAAAGGACTACGAGTATGGAAGAAACCCATCTGGAAGAAAGATTGGTTTTT
TTTTTTTCTTCTTCTCTCATATGGATGAGCTAAATATTTGGAAAATCTTTTGAGGTGATCTAGTTTAATTTAAAGACAATTGGAGATTAGATGATTAAATTATTTATTTA
TTTTTATAACTTGTCTTTACTAGTTGAACCATCTCTTGAAATGGACAAATTATTTAATGAGATCGCACAATGTGATTGACAGTTTTTTATCGTTTGATTAAGAGAGAAGT
ACCTCAAATTTATATATAATGGTTATTAGAAGAATCCAAGAAGCAGACACATTATGGCGG
Protein sequenceShow/hide protein sequence
MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKRILECYNQDGALPLGVNATKCDLDAPVFCLENRPGFYFVPGALSLEEQCQWIRESLTN
FPQPPNRTNHNAIYGPSQDLFIAAKEKRVLVEDNEISDFNVDSDIEPSVSNGNSHGWKFVEENTVSFRRGTCKSVPASVLLRKLRWSTLGLQFDWSKRSYDISLPHNKIP
SALCKLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFTDEESEEI
SLIERKFSSQDDLHFLEYIRTSRININIRQVF