; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC11G220970 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC11G220970
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionalpha-ketoglutarate-dependent dioxygenase alkB
Genome locationCicolChr11:25403138..25405423
RNA-Seq ExpressionCcUC11G220970
SyntenyCcUC11G220970
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573589.1 Alpha-ketoglutarate-dependent dioxygenase alkB, partial [Cucurbita argyrosperma subsp. sororia]2.1e-18789.81Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK I E Y QDGALPLGVNATKC+LDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIR+SL +FPQPPNRTNHNAIYGPIQDLFIAAK K++LVED+EIS   VDSD EPS+ NG+++ WKFVEENTVSSRRGT CKS+PASALLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISL HN+IPSALC+L KRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
Subjt:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+ FS++DDL+FLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF

XP_004135327.1 alpha-ketoglutarate-dependent dioxygenase alkB isoform X1 [Cucumis sativus]4.3e-19392.01Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK+ILE Y+QDG+LP+GVNAT C+LDGPVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL  FPQPPNRTNHNAIYGPIQDLFIAAK  K+LVE DEISDFK+DSDVEPSISNG+TH+WKFVEENTVSSRRGTA KSIPAS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISL HNKIPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLE H +NQDDL+ LEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF

XP_008446031.1 PREDICTED: LOW QUALITY PROTEIN: alpha-ketoglutarate-dependent dioxygenase alkB [Cucumis melo]3.9e-19492.56Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK+ILE YKQDG+LP+GV AT C+LD PVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL +FPQPPNRTNHNAIYGPIQDLFIAAK KK+LVE DEISDF +DSDVEPSISNGSTH+WKFVEENTVSSRRGTACKSI AS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISL HNKIPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKA FLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERH SNQDDL+FLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF

XP_022945036.1 alpha-ketoglutarate-dependent dioxygenase alkB [Cucurbita moschata]4.1e-18890.08Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDD+ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK ILE Y QDGALPLGVNATKC+LDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL +FPQPPNRTNHNAIYGPIQDLFIAAK K++LVED+EIS   VDSD EPS+ NG+++ WKFVEENTVSSRRGT CKS+PASALLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISL HN+IPSALC+L KRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
Subjt:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+ FS++DDL+FLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF

XP_038891364.1 alpha-ketoglutarate-dependent dioxygenase alkB [Benincasa hispida]3.9e-19493.11Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK IL  YKQDGALPLGVNA KC+LD PVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESLTNFPQP NRTNHNAIYG IQDLFIAAK KK+LVEDDEISDF VDSDVE S+SNG+TH+WKFVEENTVSS+RGTACKSIPAS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISL HNKIPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEIS LERHFSNQDDL+FLEYI+ SRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF

TrEMBL top hitse value%identityAlignment
A0A0A0KS00 Fe2OG dioxygenase domain-containing protein2.1e-19392.01Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK+ILE Y+QDG+LP+GVNAT C+LDGPVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL  FPQPPNRTNHNAIYGPIQDLFIAAK  K+LVE DEISDFK+DSDVEPSISNG+TH+WKFVEENTVSSRRGTA KSIPAS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISL HNKIPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLE H +NQDDL+ LEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF

A0A1S3BE32 LOW QUALITY PROTEIN: alpha-ketoglutarate-dependent dioxygenase alkB1.9e-19492.56Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK+ILE YKQDG+LP+GV AT C+LD PVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL +FPQPPNRTNHNAIYGPIQDLFIAAK KK+LVE DEISDF +DSDVEPSISNGSTH+WKFVEENTVSSRRGTACKSI AS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISL HNKIPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKA FLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERH SNQDDL+FLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF

A0A6J1DG90 alpha-ketoglutarate-dependent dioxygenase alkB4.6e-18588.15Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDD ERTAFR AEKKYK YYDDT+KSSKKKKLPKQVDLSEVIDFK ILE Y  DGALPLG+NAT+C+LDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESLT+FPQPPNRTNHNAIYGPIQDLFIAAK K++LVE +EIS F VDSD+EPS+S G+TH WKFVEENTVSSRR T CKS+PAS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISL HNK+PSALC+LAKRMAAAAMP GEEFKPEAAIVNYFASGD+LGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
Subjt:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFID E++E S LE+ FSNQDDL+FLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF

A0A6J1FZQ9 alpha-ketoglutarate-dependent dioxygenase alkB2.0e-18890.08Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDD+ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK ILE Y QDGALPLGVNATKC+LDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL +FPQPPNRTNHNAIYGPIQDLFIAAK K++LVED+EIS   VDSD EPS+ NG+++ WKFVEENTVSSRRGT CKS+PASALLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISL HN+IPSALC+L KRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
Subjt:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+ FS++DDL+FLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF

A0A6J1HQB4 alpha-ketoglutarate-dependent dioxygenase alkB5.5e-18689.81Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK ILE Y QDGALPLGVNATKC+LDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQ IRESL +FPQPPNRTNHNAIYGPIQDLFIAAK K++LVED+EIS   VDSD EPS+ NG+++ WKFVEENTVSSRRGT CKS+PASALLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISL HN IPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQD PVAMFL
Subjt:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+ FS++DDL+FLEY+RTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF

SwissProt top hitse value%identityAlignment
O60066 Alpha-ketoglutarate-dependent dioxygenase abh18.5e-2727.27Show/hide
Query:  ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQCQWIRESL-TN
        +   FR  EK+YK   D          +P   D+SEV+D          +     G  A    +   VF  +  PG   +   +S + Q Q ++  + T 
Subjt:  ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQCQWIRESL-TN

Query:  FPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRS
           P N+TN +  Y             ++ + +D I     + D E SI +G   +     +  V                 +KLRW TLG Q+DW+ + 
Subjt:  FPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRS

Query:  YDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAG
        Y         P  L +  +++   +      +K EAAIVN+++ GDTL  H+D+ E D + P++S+S+G   I+L+G +SR + P A+ L SGDVV+M G
Subjt:  YDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAG

Query:  EARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQV
         +R+ FH VP+I  +     +    + +          +I   R+N N+RQV
Subjt:  EARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQV

P0CB42 Nucleic acid dioxygenase ALKBH15.7e-3130.36Show/hide
Query:  KKKLPKQVDLSEVIDF-KSILERYKQDGA-----LPLGV------NATKCNLDGPV-----FCLENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRT
        ++  P   DL  VIDF ++ L R  + G       PL V      +A +  L+ PV     + LE  PGF FIP       Q  W+++ L  + Q PN  
Subjt:  KKKLPKQVDLSEVIDF-KSILERYKQDGA-----LPLGV------NATKCNLDGPV-----FCLENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRT

Query:  NHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGS-THSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLHH
        N                               +D  +    + G    S + +    V+ RR          +LL +LRW TLG  ++W  + Y    H+
Subjt:  NHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGS-THSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLHH

Query:  NKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFH
           PS L  L++++A A    G  F+ EA I+NY+    TLG H+D  E D SKP++S S G  AIFLLGG  R + P AMF+ SGD+++M+G +R   H
Subjt:  NKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFH

Query:  GVPRIFIDEESEEI--------------SFLERHFSNQDDLYFLEYIRTSRININIRQV
         VPR+    + E +              + L    S +D      Y+RT+R+N+ +RQV
Subjt:  GVPRIFIDEESEEI--------------SFLERHFSNQDDLYFLEYIRTSRININIRQV

Q13686 Nucleic acid dioxygenase ALKBH12.3e-3230.17Show/hide
Query:  KKKLPKQVDLSEVIDFKSI-LERYKQDGALPL-----------GVNATKCNLDGPV-----FCLENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRT
        ++  P   DL  VIDF +    R K  GA  +             NA +  L  PV     + L+  PGF FIP       Q  W+++ L  + Q PN  
Subjt:  KKKLPKQVDLSEVIDFKSI-LERYKQDGALPL-----------GVNATKCNLDGPV-----FCLENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRT

Query:  NHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLHHN
        N         D  ++ +  + L E                       S +F+     + RR          +LL KLRW T+G  ++W  + Y    H+ 
Subjt:  NHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLHHN

Query:  KIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHG
          PS L  L++++AAA     E+F+ EA I+NY+    TLG H+D  E D SKP++S S G  AIFLLGG  R + P AMF+ SGD+++M+G +R   H 
Subjt:  KIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHG

Query:  VPRIFIDEESEEI--------------SFLERHFSNQDDLYFLEYIRTSRININIRQV
        VPR+  + E E +                +    S +D      Y++T+R+N+ +RQV
Subjt:  VPRIFIDEESEEI--------------SFLERHFSNQDDLYFLEYIRTSRININIRQV

Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB1.7e-4330.13Show/hide
Query:  TAFRRAEKKYKLYYDDTYKSSKKKKLPKQ----VDLSEVIDFKSILERYKQDGALPLGVNATKCNLD---------------GPVFCLENRPGFYFIPGA
        T F R ++ ++       KS+  K +PK+    +D S V+DF ++    +++  L +   +     D                 V+ L+  PGFYFI   
Subjt:  TAFRRAEKKYKLYYDDTYKSSKKKKLPKQ----VDLSEVIDFKSILERYKQDGALPLGVNATKCNLD---------------GPVFCLENRPGFYFIPGA

Query:  LSLQEQCQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKI---LVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPA-SA
         +  +Q +WI+ +L ++  PPN  N    +GPI++L+   +++ I   L    +  D +++    P   NG                     + +P    
Subjt:  LSLQEQCQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKI---LVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPA-SA

Query:  LLRKLRWSTLGLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS
        LL KL WSTLG Q+ W+ R Y     + + P  L EL +++A A     + +  EAA VN+++    +GGHLDD E +  KPI+S+S G  A+FL+G ++
Subjt:  LLRKLRWSTLGLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS

Query:  RQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYI--RTSRININIRQVF
        R   PV +F+RSGD+V+M G +R C+HGV +I   E S ++  ++ +  +QD  Y ++++  +  R+NIN RQVF
Subjt:  RQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYI--RTSRININIRQVF

Q9SA98 Alpha-ketoglutarate-dependent dioxygenase alkB2.4e-13866.12Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ
        MY S   +DD++RTAFRRAEKKYKLYY+   K S+KKKLPK +DLSE++DF  I + +  DG LP G+  +K +   PVFC++NRPGFYFIP ALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        C+WI+ESLT+FPQPPNRTNHNAIYGPI DLF +AK  K+LV+DD                  + + WKF EE  +     ++CKS+ AS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKR+YD+SL HN IP ALC+LAK  AA AMP GEEF+PE AIVNYF  GDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS+ DPP AM+L
Subjt:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHG+PRIF  EE+ +I  LE   S++   +F EYI+TSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.7e-13966.12Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ
        MY S   +DD++RTAFRRAEKKYKLYY+   K S+KKKLPK +DLSE++DF  I + +  DG LP G+  +K +   PVFC++NRPGFYFIP ALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGALPLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        C+WI+ESLT+FPQPPNRTNHNAIYGPI DLF +AK  K+LV+DD                  + + WKF EE  +     ++CKS+ AS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKR+YD+SL HN IP ALC+LAK  AA AMP GEEF+PE AIVNYF  GDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS+ DPP AM+L
Subjt:  GLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHG+PRIF  EE+ +I  LE   S++   +F EYI+TSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein1.4e-0529.06Show/hide
Query:  PEAAIVNYFASGDTLGGH----LDDMEADWSK---------------------PIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHG
        P+  +VN++ S   LG H     D    D+ K                     PIVS S+G  A FL G +   D    + L SGDV++    +R  FHG
Subjt:  PEAAIVNYFASGDTLGGH----LDDMEADWSK---------------------PIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHG

Query:  V--------PRIFIDEE
        V        PR+F  ++
Subjt:  V--------PRIFIDEE

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein2.6e-1040.96Show/hide
Query:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRI
        P+  IVN+++S   LG H D  E++ S     P+VS S+G  A FL G +  +D    + L SGDV+L  G +R+ FHGV  I
Subjt:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRI

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein2.6e-1030.1Show/hide
Query:  PASALLRKLRWSTLGLQFDWS-----KRSYDISLHHNKIP---SALCELAKRMAAAAM--PTGEE--------FKPEAAIVNYFASGDTLGGHLDDMEAD
        P  ++  KL    + L  +W      +++ DI     +IP   + L E A R A A +   +G E          P+  IVN+++    LG H D  E++
Subjt:  PASALLRKLRWSTLGLQFDWS-----KRSYDISLHHNKIP---SALCELAKRMAAAAM--PTGEE--------FKPEAAIVNYFASGDTLGGHLDDMEAD

Query:  WS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIR
         S     PIVS S+G  A FL G K   +    + L SGDV++  GE+R  FHGV  I     S  +S L               +RT R+N+  R
Subjt:  WS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIR

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein2.6e-1030.1Show/hide
Query:  PASALLRKLRWSTLGLQFDWS-----KRSYDISLHHNKIP---SALCELAKRMAAAAM--PTGEE--------FKPEAAIVNYFASGDTLGGHLDDMEAD
        P  ++  KL    + L  +W      +++ DI     +IP   + L E A R A A +   +G E          P+  IVN+++    LG H D  E++
Subjt:  PASALLRKLRWSTLGLQFDWS-----KRSYDISLHHNKIP---SALCELAKRMAAAAM--PTGEE--------FKPEAAIVNYFASGDTLGGHLDDMEAD

Query:  WS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIR
         S     PIVS S+G  A FL G K   +    + L SGDV++  GE+R  FHGV  I     S  +S L               +RT R+N+  R
Subjt:  WS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCTGACCAGTCAAAACCCTAAGCTTGATTATCCGTCTCCCGTCCGTCACCAGAATTCAGAAGCTCAATCTTACGCCGATCGAGCATCTCACCGGCAACAG
CGGATCCTGGCGGATGAAATGTACGGATCCGACAAAAACACCGACGATTCCGAGCGCACCGCTTTCAGAAGAGCAGAAAAGAAATACAAGTTGTACTATGACGAC
ACCTACAAATCTTCGAAAAAGAAAAAACTACCCAAACAAGTCGATTTGTCTGAGGTTATCGATTTCAAGTCCATTCTCGAACGTTACAAACAAGATGGTGCACTT
CCGCTGGGCGTGAATGCGACTAAGTGCAATCTCGATGGGCCAGTCTTCTGCTTGGAGAATCGTCCTGGGTTTTATTTTATTCCTGGAGCATTGAGTTTACAAGAG
CAATGCCAATGGATCAGGGAGAGTTTAACGAATTTCCCACAGCCTCCCAACAGAACCAACCACAATGCTATTTATGGACCAATTCAAGACCTGTTCATTGCAGCA
AAGCGAAAGAAAATTCTAGTTGAAGATGATGAAATCTCTGATTTCAAAGTTGATTCTGACGTTGAACCTTCCATTAGCAATGGAAGTACTCATAGCTGGAAGTTT
GTAGAGGAGAATACTGTTTCATCCAGAAGAGGGACGGCCTGCAAATCAATTCCTGCTTCAGCATTACTTCGAAAATTGCGTTGGAGTACCCTCGGCCTACAATTT
GATTGGTCCAAGCGAAGCTATGACATATCTCTGCACCATAATAAGATACCCTCTGCGCTCTGTGAACTTGCCAAAAGAATGGCGGCAGCTGCAATGCCAACTGGG
GAAGAGTTCAAGCCTGAAGCTGCAATAGTGAATTACTTTGCTTCAGGTGACACACTCGGTGGTCACTTAGATGACATGGAAGCAGACTGGAGCAAGCCAATTGTT
AGCATGAGTTTGGGTTGCAAAGCTATTTTTCTTTTAGGTGGTAAGTCGAGACAGGATCCACCCGTAGCCATGTTTCTTCGAAGCGGAGATGTCGTGCTAATGGCT
GGAGAAGCAAGGGAATGTTTCCATGGTGTACCACGGATCTTCATCGATGAAGAAAGTGAAGAAATTTCTTTTCTTGAAAGGCATTTCTCAAATCAAGATGATTTG
TACTTTCTGGAATACATAAGAACTTCAAGAATAAACATCAACATTAGACAGGTTTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTCTGACCAGTCAAAACCCTAAGCTTGATTATCCGTCTCCCGTCCGTCACCAGAATTCAGAAGCTCAATCTTACGCCGATCGAGCATCTCACCGGCAACAG
CGGATCCTGGCGGATGAAATGTACGGATCCGACAAAAACACCGACGATTCCGAGCGCACCGCTTTCAGAAGAGCAGAAAAGAAATACAAGTTGTACTATGACGAC
ACCTACAAATCTTCGAAAAAGAAAAAACTACCCAAACAAGTCGATTTGTCTGAGGTTATCGATTTCAAGTCCATTCTCGAACGTTACAAACAAGATGGTGCACTT
CCGCTGGGCGTGAATGCGACTAAGTGCAATCTCGATGGGCCAGTCTTCTGCTTGGAGAATCGTCCTGGGTTTTATTTTATTCCTGGAGCATTGAGTTTACAAGAG
CAATGCCAATGGATCAGGGAGAGTTTAACGAATTTCCCACAGCCTCCCAACAGAACCAACCACAATGCTATTTATGGACCAATTCAAGACCTGTTCATTGCAGCA
AAGCGAAAGAAAATTCTAGTTGAAGATGATGAAATCTCTGATTTCAAAGTTGATTCTGACGTTGAACCTTCCATTAGCAATGGAAGTACTCATAGCTGGAAGTTT
GTAGAGGAGAATACTGTTTCATCCAGAAGAGGGACGGCCTGCAAATCAATTCCTGCTTCAGCATTACTTCGAAAATTGCGTTGGAGTACCCTCGGCCTACAATTT
GATTGGTCCAAGCGAAGCTATGACATATCTCTGCACCATAATAAGATACCCTCTGCGCTCTGTGAACTTGCCAAAAGAATGGCGGCAGCTGCAATGCCAACTGGG
GAAGAGTTCAAGCCTGAAGCTGCAATAGTGAATTACTTTGCTTCAGGTGACACACTCGGTGGTCACTTAGATGACATGGAAGCAGACTGGAGCAAGCCAATTGTT
AGCATGAGTTTGGGTTGCAAAGCTATTTTTCTTTTAGGTGGTAAGTCGAGACAGGATCCACCCGTAGCCATGTTTCTTCGAAGCGGAGATGTCGTGCTAATGGCT
GGAGAAGCAAGGGAATGTTTCCATGGTGTACCACGGATCTTCATCGATGAAGAAAGTGAAGAAATTTCTTTTCTTGAAAGGCATTTCTCAAATCAAGATGATTTG
TACTTTCTGGAATACATAAGAACTTCAAGAATAAACATCAACATTAGACAGGTTTTCTGA
Protein sequenceShow/hide protein sequence
MCLTSQNPKLDYPSPVRHQNSEAQSYADRASHRQQRILADEMYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILERYKQDGAL
PLGVNATKCNLDGPVFCLENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDVEPSISNGSTHSWKF
VEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLHHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIV
SMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLYFLEYIRTSRININIRQVF