; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG11G013630 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG11G013630
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionalpha-ketoglutarate-dependent dioxygenase alkB
Genome locationCG_Chr11:26811751..26814064
RNA-Seq ExpressionClCG11G013630
SyntenyClCG11G013630
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573589.1 Alpha-ketoglutarate-dependent dioxygenase alkB, partial [Cucurbita argyrosperma subsp. sororia]2.9e-18990.63Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK I E Y QDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIR+SL +FPQPPNRTNHNAIYGPIQDLFIAAK K++LVED+EIS   VDSD EPS+ NG++N WKFVEENTVSSRRGT CKS+PASALLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISLPHN+IPSALC+L KRMAAAAM TGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+ FS++DDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

XP_004135327.1 alpha-ketoglutarate-dependent dioxygenase alkB isoform X1 [Cucumis sativus]6.6e-19492.29Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK+ILESY+QDG+LP+GVNAT CDLDGPVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL  FPQPPNRTNHNAIYGPIQDLFIAAK  K+LVE DEISDFK+DSD+EPSISNG+T++WKFVEENTVSSRRGTA KSIPAS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISLPHNKIPSALC+LAKRMAAAAM TGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLE H +NQDDLH LEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

XP_008446031.1 PREDICTED: LOW QUALITY PROTEIN: alpha-ketoglutarate-dependent dioxygenase alkB [Cucumis melo]6.0e-19592.84Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK+ILESYKQDG+LP+GV AT CDLD PVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL +FPQPPNRTNHNAIYGPIQDLFIAAK KK+LVE DEISDF +DSD+EPSISNGST++WKFVEENTVSSRRGTACKSI AS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISLPHNKIPSALC+LAKRMAAAAM TGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKA FLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERH SNQDDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

XP_022945036.1 alpha-ketoglutarate-dependent dioxygenase alkB [Cucurbita moschata]5.8e-19090.91Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDD+ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK ILE Y QDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL +FPQPPNRTNHNAIYGPIQDLFIAAK K++LVED+EIS   VDSD EPS+ NG++N WKFVEENTVSSRRGT CKS+PASALLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISLPHN+IPSALC+L KRMAAAAM TGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+ FS++DDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

XP_038891364.1 alpha-ketoglutarate-dependent dioxygenase alkB [Benincasa hispida]1.3e-19493.11Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK IL  YKQDGALPLGVNA KCDLD PVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESLTNFPQP NRTNHNAIYG IQDLFIAAK KK+LVEDDEISDF VDSD+E S+SNG+T++WKFVEENTVSS+RGTACKSIPAS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISLPHNKIPSALC+LAKRMAAAAM TGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEIS LERHFSNQDDLHFLEYI+ SRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

TrEMBL top hitse value%identityAlignment
A0A0A0KS00 Fe2OG dioxygenase domain-containing protein3.2e-19492.29Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK+ILESY+QDG+LP+GVNAT CDLDGPVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL  FPQPPNRTNHNAIYGPIQDLFIAAK  K+LVE DEISDFK+DSD+EPSISNG+T++WKFVEENTVSSRRGTA KSIPAS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISLPHNKIPSALC+LAKRMAAAAM TGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLE H +NQDDLH LEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

A0A1S3BE32 LOW QUALITY PROTEIN: alpha-ketoglutarate-dependent dioxygenase alkB2.9e-19592.84Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK+ILESYKQDG+LP+GV AT CDLD PVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL +FPQPPNRTNHNAIYGPIQDLFIAAK KK+LVE DEISDF +DSD+EPSISNGST++WKFVEENTVSSRRGTACKSI AS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISLPHNKIPSALC+LAKRMAAAAM TGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKA FLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERH SNQDDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

A0A6J1DG90 alpha-ketoglutarate-dependent dioxygenase alkB1.0e-18488.43Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDD ERTAFR AEKKYK YYDDT+KSSKKKKLPKQVDLSEVIDFK ILE Y  DGALPLG+NAT+CDLDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESLT+FPQPPNRTNHNAIYGPIQDLFIAAK K++LVE +EIS F VDSDIEPS+S G+T+ WKFVEENTVSSRR T CKS+PAS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISL HNK+PSALC+LAKRMAAAAM  GEEFKPEAAIVNYFASGD+LGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFID E++E S LE+ FSNQDDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

A0A6J1FZQ9 alpha-ketoglutarate-dependent dioxygenase alkB2.8e-19090.91Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDD+ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK ILE Y QDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL +FPQPPNRTNHNAIYGPIQDLFIAAK K++LVED+EIS   VDSD EPS+ NG++N WKFVEENTVSSRRGT CKS+PASALLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISLPHN+IPSALC+L KRMAAAAM TGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+ FS++DDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

A0A6J1HQB4 alpha-ketoglutarate-dependent dioxygenase alkB7.6e-18890.63Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK ILE Y QDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQ IRESL +FPQPPNRTNHNAIYGPIQDLFIAAK K++LVED+EIS   VDSD EPS+ NG++N WKFVEENTVSSRRGT CKS+PASALLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISLPHN IPSALC+LAKRMAAAAM TGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQD PVAMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+ FS++DDLHFLEY+RTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

SwissProt top hitse value%identityAlignment
O60066 Alpha-ketoglutarate-dependent dioxygenase abh11.9e-2627.27Show/hide
Query:  ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQCQWIRESL-TN
        +   FR  EK+YK   D          +P   D+SEV+D          +     G  A   ++   VF  +  PG   +   +S + Q Q ++  + T 
Subjt:  ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQCQWIRESL-TN

Query:  FPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRS
           P N+TN +  Y             ++ + +D I     + D E SI +G   +     +  V                 +KLRW TLG Q+DW+ + 
Subjt:  FPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRS

Query:  YDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAG
        Y         P  L +  +++   +      +K EAAIVN+++ GDTL  H+D+ E D + P++S+S+G   I+L+G +SR + P A+ L SGDVV+M G
Subjt:  YDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAG

Query:  EARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQV
         +R+ FH VP+I  +     +    + +          +I   R+N N+RQV
Subjt:  EARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQV

P0CB42 Nucleic acid dioxygenase ALKBH13.1e-2929.89Show/hide
Query:  KKKLPKQVDLSEVIDF------KSILESYKQDGALPLGVNA-TKCDLD----GPV-----FCLENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRTN
        ++  P   DL  VIDF      +S      Q    PL V++ T+ D +     PV     + LE  PGF FIP       Q  W+++ L  + Q PN  N
Subjt:  KKKLPKQVDLSEVIDF------KSILESYKQDGALPLGVNA-TKCDLD----GPV-----FCLENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRTN

Query:  HNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGS-TNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLPHN
                                       +D  +    + G    S + +    V+ RR          +LL +LRW TLG  ++W  + Y     + 
Subjt:  HNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGS-TNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLPHN

Query:  KIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHG
          PS L  L++++A A    G  F+ EA I+NY+    TLG H+D  E D SKP++S S G  AIFLLGG  R + P AMF+ SGD+++M+G +R   H 
Subjt:  KIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHG

Query:  VPRIFIDEESEEI--------------SFLERHFSNQDDLHFLEYIRTSRININIRQV
        VPR+    + E +              + L    S +D      Y+RT+R+N+ +RQV
Subjt:  VPRIFIDEESEEI--------------SFLERHFSNQDDLHFLEYIRTSRININIRQV

Q13686 Nucleic acid dioxygenase ALKBH14.8e-3031.42Show/hide
Query:  LENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNS-WKFVEENTVSSRRG
        L+  PGF FIP       Q  W+++ L  + Q PN  N                                   ++  +S   T   W   E++    R  
Subjt:  LENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNS-WKFVEENTVSSRRG

Query:  TACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGC
         A K  P S LL KLRW T+G  ++W  + Y     +   PS L  L++++AAA     E+F+ EA I+NY+    TLG H+D  E D SKP++S S G 
Subjt:  TACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGC

Query:  KAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEI--------------SFLERHFSNQDDLHFLEYIRTSRININIRQV
         AIFLLGG  R + P AMF+ SGD+++M+G +R   H VPR+  + E E +                +    S +D      Y++T+R+N+ +RQV
Subjt:  KAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEI--------------SFLERHFSNQDDLHFLEYIRTSRININIRQV

Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB3.8e-4330.4Show/hide
Query:  TAFRRAEKKYKLYYDDTYKSSKKKKLPKQ----VDLSEVIDFKSILESYKQDGALPLGV--NATKCDL-------------DGPVFCLENRPGFYFIPGA
        T F R ++ ++       KS+  K +PK+    +D S V+DF ++  + +++  L +    N T  D              +  V+ L+  PGFYFI   
Subjt:  TAFRRAEKKYKLYYDDTYKSSKKKKLPKQ----VDLSEVIDFKSILESYKQDGALPLGV--NATKCDL-------------DGPVFCLENRPGFYFIPGA

Query:  LSLQEQCQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKI---LVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPA-SA
         +  +Q +WI+ +L ++  PPN  N    +GPI++L+   +++ I   L    +  D +++    P   NG                     + +P    
Subjt:  LSLQEQCQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKI---LVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPA-SA

Query:  LLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS
        LL KL WSTLG Q+ W+ R Y     + + P  L EL +++A A     + +  EAA VN+++    +GGHLDD E +  KPI+S+S G  A+FL+G ++
Subjt:  LLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS

Query:  RQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYI--RTSRININIRQVF
        R   PV +F+RSGD+V+M G +R C+HGV +I   E S ++  ++ +  +QD  + ++++  +  R+NIN RQVF
Subjt:  RQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYI--RTSRININIRQVF

Q9SA98 Alpha-ketoglutarate-dependent dioxygenase alkB6.3e-13966.67Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MY S   +DD++RTAFRRAEKKYKLYY+   K S+KKKLPK +DLSE++DF  I +++  DG LP G+  +K D   PVFC++NRPGFYFIP ALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        C+WI+ESLT+FPQPPNRTNHNAIYGPI DLF +AK  K+LV+DD                  + N WKF EE  +     ++CKS+ AS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKR+YD+SLPHN IP ALC+LAK  AA AM  GEEF+PE AIVNYF  GDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS+ DPP AM+L
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHG+PRIF  EE+ +I  LE   S++    F EYI+TSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein4.5e-14066.67Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MY S   +DD++RTAFRRAEKKYKLYY+   K S+KKKLPK +DLSE++DF  I +++  DG LP G+  +K D   PVFC++NRPGFYFIP ALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        C+WI+ESLT+FPQPPNRTNHNAIYGPI DLF +AK  K+LV+DD                  + N WKF EE  +     ++CKS+ AS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKR+YD+SLPHN IP ALC+LAK  AA AM  GEEF+PE AIVNYF  GDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS+ DPP AM+L
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHG+PRIF  EE+ +I  LE   S++    F EYI+TSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein8.5e-0629.09Show/hide
Query:  RSYDISLPHNKIP---SALCELAKRMAAAAMAT-------GEEFK---PEAAIVNYFASGDTLGGH----LDDMEADWSK--------------------
        R  D S+P  +IP   S L E A + + + +AT       G+E     P+  +VN++ S   LG H     D    D+ K                    
Subjt:  RSYDISLPHNKIP---SALCELAKRMAAAAMAT-------GEEFK---PEAAIVNYFASGDTLGGH----LDDMEADWSK--------------------

Query:  -PIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGV--------PRIFIDEE
         PIVS S+G  A FL G +   D    + L SGDV++    +R  FHGV        PR+F  ++
Subjt:  -PIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGV--------PRIFIDEE

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein2.6e-1040.96Show/hide
Query:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRI
        P+  IVN+++S   LG H D  E++ S     P+VS S+G  A FL G +  +D    + L SGDV+L  G +R+ FHGV  I
Subjt:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRI

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein3.3e-1030.1Show/hide
Query:  PASALLRKLRWSTLGLQFDWS-----KRSYDISLPHNKIP---SALCELAKRMAAAAM--ATGEE--------FKPEAAIVNYFASGDTLGGHLDDMEAD
        P  ++  KL    + L  +W      +++ DI     +IP   + L E A R A A +   +G E          P+  IVN+++    LG H D  E++
Subjt:  PASALLRKLRWSTLGLQFDWS-----KRSYDISLPHNKIP---SALCELAKRMAAAAM--ATGEE--------FKPEAAIVNYFASGDTLGGHLDDMEAD

Query:  WS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIR
         S     PIVS S+G  A FL G K   +    + L SGDV++  GE+R  FHGV  I     S  +S L               +RT R+N+  R
Subjt:  WS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIR

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein3.3e-1030.1Show/hide
Query:  PASALLRKLRWSTLGLQFDWS-----KRSYDISLPHNKIP---SALCELAKRMAAAAM--ATGEE--------FKPEAAIVNYFASGDTLGGHLDDMEAD
        P  ++  KL    + L  +W      +++ DI     +IP   + L E A R A A +   +G E          P+  IVN+++    LG H D  E++
Subjt:  PASALLRKLRWSTLGLQFDWS-----KRSYDISLPHNKIP---SALCELAKRMAAAAM--ATGEE--------FKPEAAIVNYFASGDTLGGHLDDMEAD

Query:  WS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIR
         S     PIVS S+G  A FL G K   +    + L SGDV++  GE+R  FHGV  I     S  +S L               +RT R+N+  R
Subjt:  WS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCTGACCAGTCAAAACCCTAAGCTTGATTATCCGTCTCCCGTCCGTCGCCAGAATTCAGAAGCTCAATCTTACGCCGATCGAGCATCTCACCGGCGACAGTGGAT
CCTGGCGGATGAAATGTACGGATCCGACAAAAACACCGACGATTCCGAGCGCACCGCTTTCAGAAGAGCAGAAAAGAAATACAAATTGTACTATGACGACACCTACAAAT
CTTCGAAAAAGAAAAAACTACCGAAACAAGTCGATTTGTCTGAGGTTATCGATTTCAAGTCCATTCTCGAAAGTTACAAACAAGATGGTGCACTTCCGCTGGGCGTGAAT
GCGACTAAGTGCGATCTCGATGGGCCAGTCTTCTGCTTGGAGAATCGTCCTGGGTTTTATTTTATTCCTGGAGCATTGAGTTTACAAGAGCAATGCCAATGGATCAGGGA
GAGTTTAACGAATTTCCCACAGCCTCCCAACAGAACCAACCACAATGCTATTTATGGACCAATTCAAGACCTGTTCATTGCAGCAAAGCGAAAGAAAATTCTAGTTGAAG
ATGATGAAATCTCTGATTTTAAAGTTGATTCTGACATTGAACCTTCCATTAGCAATGGAAGTACTAATAGCTGGAAGTTTGTAGAGGAGAATACTGTTTCATCCAGAAGA
GGGACGGCCTGCAAATCAATTCCTGCTTCAGCATTACTTCGAAAATTGCGTTGGAGTACCCTCGGCCTACAATTTGATTGGTCCAAGCGAAGCTATGACATATCTCTGCC
CCATAATAAGATACCCTCTGCGCTCTGTGAACTTGCCAAAAGAATGGCGGCAGCTGCAATGGCAACTGGGGAAGAGTTCAAGCCTGAAGCTGCAATAGTGAATTACTTTG
CTTCAGGTGACACACTCGGTGGTCACTTAGATGACATGGAAGCAGACTGGAGCAAGCCAATTGTTAGCATGAGTTTGGGTTGCAAAGCTATTTTTCTTTTAGGTGGTAAG
TCGAGACAGGATCCACCCGTAGCCATGTTTCTTCGAAGCGGAGATGTCGTGCTAATGGCTGGAGAAGCAAGGGAATGTTTCCATGGTGTACCACGGATCTTCATCGATGA
AGAAAGTGAAGAAATTTCTTTTCTTGAAAGGCATTTCTCAAATCAAGATGATTTGCACTTTCTGGAATACATAAGAACTTCAAGAATAAACATCAACATTAGACAGGTTT
TCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTCTGACCAGTCAAAACCCTAAGCTTGATTATCCGTCTCCCGTCCGTCGCCAGAATTCAGAAGCTCAATCTTACGCCGATCGAGCATCTCACCGGCGACAGTGGAT
CCTGGCGGATGAAATGTACGGATCCGACAAAAACACCGACGATTCCGAGCGCACCGCTTTCAGAAGAGCAGAAAAGAAATACAAATTGTACTATGACGACACCTACAAAT
CTTCGAAAAAGAAAAAACTACCGAAACAAGTCGATTTGTCTGAGGTTATCGATTTCAAGTCCATTCTCGAAAGTTACAAACAAGATGGTGCACTTCCGCTGGGCGTGAAT
GCGACTAAGTGCGATCTCGATGGGCCAGTCTTCTGCTTGGAGAATCGTCCTGGGTTTTATTTTATTCCTGGAGCATTGAGTTTACAAGAGCAATGCCAATGGATCAGGGA
GAGTTTAACGAATTTCCCACAGCCTCCCAACAGAACCAACCACAATGCTATTTATGGACCAATTCAAGACCTGTTCATTGCAGCAAAGCGAAAGAAAATTCTAGTTGAAG
ATGATGAAATCTCTGATTTTAAAGTTGATTCTGACATTGAACCTTCCATTAGCAATGGAAGTACTAATAGCTGGAAGTTTGTAGAGGAGAATACTGTTTCATCCAGAAGA
GGGACGGCCTGCAAATCAATTCCTGCTTCAGCATTACTTCGAAAATTGCGTTGGAGTACCCTCGGCCTACAATTTGATTGGTCCAAGCGAAGCTATGACATATCTCTGCC
CCATAATAAGATACCCTCTGCGCTCTGTGAACTTGCCAAAAGAATGGCGGCAGCTGCAATGGCAACTGGGGAAGAGTTCAAGCCTGAAGCTGCAATAGTGAATTACTTTG
CTTCAGGTGACACACTCGGTGGTCACTTAGATGACATGGAAGCAGACTGGAGCAAGCCAATTGTTAGCATGAGTTTGGGTTGCAAAGCTATTTTTCTTTTAGGTGGTAAG
TCGAGACAGGATCCACCCGTAGCCATGTTTCTTCGAAGCGGAGATGTCGTGCTAATGGCTGGAGAAGCAAGGGAATGTTTCCATGGTGTACCACGGATCTTCATCGATGA
AGAAAGTGAAGAAATTTCTTTTCTTGAAAGGCATTTCTCAAATCAAGATGATTTGCACTTTCTGGAATACATAAGAACTTCAAGAATAAACATCAACATTAGACAGGTTT
TCTGA
Protein sequenceShow/hide protein sequence
MCLTSQNPKLDYPSPVRRQNSEAQSYADRASHRRQWILADEMYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILESYKQDGALPLGVN
ATKCDLDGPVFCLENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTNSWKFVEENTVSSRR
GTACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMATGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGK
SRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF