; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC11G206070 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC11G206070
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionalpha-ketoglutarate-dependent dioxygenase alkB
Genome locationCiama_Chr11:5793759..5796031
RNA-Seq ExpressionCaUC11G206070
SyntenyCaUC11G206070
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573589.1 Alpha-ketoglutarate-dependent dioxygenase alkB, partial [Cucurbita argyrosperma subsp. sororia]3.1e-19190.91Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK I ECY QDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIR+SL +FPQPPNRTNHNAIYGPIQDLFIAAK K++LVED+EIS   VDSD EPS+ NG+++ WKFVEENTVSSRRGT CKS+PASALLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISLPHN+IPSALC+L KRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+ FS++DDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

XP_004135327.1 alpha-ketoglutarate-dependent dioxygenase alkB isoform X1 [Cucumis sativus]2.1e-19592.56Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK+ILE Y+QDG+LP+GVNAT CDLDGPVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL  FPQPPNRTNHNAIYGPIQDLFIAAK  K+LVE DEISDFK+DSD+EPSISNG+TH+WKFVEENTVSSRRGTA KSIPAS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISLPHNKIPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLE H +NQDDLH LEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

XP_008446031.1 PREDICTED: LOW QUALITY PROTEIN: alpha-ketoglutarate-dependent dioxygenase alkB [Cucumis melo]1.9e-19693.11Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK+ILE YKQDG+LP+GV AT CDLD PVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL +FPQPPNRTNHNAIYGPIQDLFIAAK KK+LVE DEISDF +DSD+EPSISNGSTH+WKFVEENTVSSRRGTACKSI AS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISLPHNKIPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKA FLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERH SNQDDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

XP_022945036.1 alpha-ketoglutarate-dependent dioxygenase alkB [Cucurbita moschata]6.2e-19291.18Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDD+ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK ILECY QDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL +FPQPPNRTNHNAIYGPIQDLFIAAK K++LVED+EIS   VDSD EPS+ NG+++ WKFVEENTVSSRRGT CKS+PASALLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISLPHN+IPSALC+L KRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+ FS++DDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

XP_038891364.1 alpha-ketoglutarate-dependent dioxygenase alkB [Benincasa hispida]7.5e-19893.94Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK IL CYKQDGALPLGVNA KCDLD PVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESLTNFPQP NRTNHNAIYG IQDLFIAAK KK+LVEDDEISDF VDSD+E S+SNG+TH+WKFVEENTVSS+RGTACKSIPAS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISLPHNKIPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEIS LERHFSNQDDLHFLEYI+ SRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

TrEMBL top hitse value%identityAlignment
A0A0A0KS00 Fe2OG dioxygenase domain-containing protein9.9e-19692.56Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK+ILE Y+QDG+LP+GVNAT CDLDGPVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL  FPQPPNRTNHNAIYGPIQDLFIAAK  K+LVE DEISDFK+DSD+EPSISNG+TH+WKFVEENTVSSRRGTA KSIPAS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISLPHNKIPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLE H +NQDDLH LEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

A0A1S3BE32 LOW QUALITY PROTEIN: alpha-ketoglutarate-dependent dioxygenase alkB9.0e-19793.11Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK+ILE YKQDG+LP+GV AT CDLD PVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL +FPQPPNRTNHNAIYGPIQDLFIAAK KK+LVE DEISDF +DSD+EPSISNGSTH+WKFVEENTVSSRRGTACKSI AS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSY+ISLPHNKIPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKA FLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERH SNQDDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

A0A6J1DG90 alpha-ketoglutarate-dependent dioxygenase alkB5.8e-18889.26Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDD ERTAFR AEKKYK YYDDT+KSSKKKKLPKQVDLSEVIDFK ILECY  DGALPLG+NAT+CDLDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESLT+FPQPPNRTNHNAIYGPIQDLFIAAK K++LVE +EIS F VDSDIEPS+S G+TH WKFVEENTVSSRR T CKS+PAS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISL HNK+PSALC+LAKRMAAAAMP GEEFKPEAAIVNYFASGD+LGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFID E++E S LE+ FSNQDDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

A0A6J1FZQ9 alpha-ketoglutarate-dependent dioxygenase alkB3.0e-19291.18Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDD+ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK ILECY QDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQWIRESL +FPQPPNRTNHNAIYGPIQDLFIAAK K++LVED+EIS   VDSD EPS+ NG+++ WKFVEENTVSSRRGT CKS+PASALLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISLPHN+IPSALC+L KRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+ FS++DDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

A0A6J1HQB4 alpha-ketoglutarate-dependent dioxygenase alkB8.1e-19090.91Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFK ILECY QDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        CQ IRESL +FPQPPNRTNHNAIYGPIQDLFIAAK K++LVED+EIS   VDSD EPS+ NG+++ WKFVEENTVSSRRGT CKS+PASALLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKRSYDISLPHN IPSALC+LAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQD PVAMFL
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+ FS++DDLHFLEY+RTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

SwissProt top hitse value%identityAlignment
O60066 Alpha-ketoglutarate-dependent dioxygenase abh16.5e-2727.27Show/hide
Query:  ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQCQWIRESL-TN
        +   FR  EK+YK   D          +P   D+SEV+D          +     G  A   ++   VF  +  PG   +   +S + Q Q ++  + T 
Subjt:  ERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQCQWIRESL-TN

Query:  FPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRS
           P N+TN +  Y             ++ + +D I     + D E SI +G   +     +  V                 +KLRW TLG Q+DW+ + 
Subjt:  FPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRS

Query:  YDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAG
        Y         P  L +  +++   +      +K EAAIVN+++ GDTL  H+D+ E D + P++S+S+G   I+L+G +SR + P A+ L SGDVV+M G
Subjt:  YDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAG

Query:  EARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQV
         +R+ FH VP+I  +     +    + +          +I   R+N N+RQV
Subjt:  EARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQV

P0CB42 Nucleic acid dioxygenase ALKBH14.1e-2929.89Show/hide
Query:  KKKLPKQVDLSEVIDF------KSILECYKQDGALPLGVNA-TKCDLD----GPV-----FCLENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRTN
        ++  P   DL  VIDF      +S      Q    PL V++ T+ D +     PV     + LE  PGF FIP       Q  W+++ L  + Q PN  N
Subjt:  KKKLPKQVDLSEVIDF------KSILECYKQDGALPLGVNA-TKCDLD----GPV-----FCLENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRTN

Query:  HNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGS-THSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLPHN
                                       +D  +    + G    S + +    V+ RR          +LL +LRW TLG  ++W  + Y     + 
Subjt:  HNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGS-THSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLPHN

Query:  KIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHG
          PS L  L++++A A    G  F+ EA I+NY+    TLG H+D  E D SKP++S S G  AIFLLGG  R + P AMF+ SGD+++M+G +R   H 
Subjt:  KIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHG

Query:  VPRIFIDEESEEI--------------SFLERHFSNQDDLHFLEYIRTSRININIRQV
        VPR+    + E +              + L    S +D      Y+RT+R+N+ +RQV
Subjt:  VPRIFIDEESEEI--------------SFLERHFSNQDDLHFLEYIRTSRININIRQV

Q13686 Nucleic acid dioxygenase ALKBH12.8e-3031.42Show/hide
Query:  LENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHS-WKFVEENTVSSRRG
        L+  PGF FIP       Q  W+++ L  + Q PN  N                                   ++  +S   T   W   E++    R  
Subjt:  LENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHS-WKFVEENTVSSRRG

Query:  TACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGC
         A K  P S LL KLRW T+G  ++W  + Y     +   PS L  L++++AAA     E+F+ EA I+NY+    TLG H+D  E D SKP++S S G 
Subjt:  TACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGC

Query:  KAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEI--------------SFLERHFSNQDDLHFLEYIRTSRININIRQV
         AIFLLGG  R + P AMF+ SGD+++M+G +R   H VPR+  + E E +                +    S +D      Y++T+R+N+ +RQV
Subjt:  KAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEI--------------SFLERHFSNQDDLHFLEYIRTSRININIRQV

Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB5.0e-4330.4Show/hide
Query:  TAFRRAEKKYKLYYDDTYKSSKKKKLPKQ----VDLSEVIDFKSILECYKQDGALPLGV--NATKCDL-------------DGPVFCLENRPGFYFIPGA
        T F R ++ ++       KS+  K +PK+    +D S V+DF ++    +++  L +    N T  D              +  V+ L+  PGFYFI   
Subjt:  TAFRRAEKKYKLYYDDTYKSSKKKKLPKQ----VDLSEVIDFKSILECYKQDGALPLGV--NATKCDL-------------DGPVFCLENRPGFYFIPGA

Query:  LSLQEQCQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKI---LVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPA-SA
         +  +Q +WI+ +L ++  PPN  N    +GPI++L+   +++ I   L    +  D +++    P   NG                     + +P    
Subjt:  LSLQEQCQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKI---LVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPA-SA

Query:  LLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS
        LL KL WSTLG Q+ W+ R Y     + + P  L EL +++A A     + +  EAA VN+++    +GGHLDD E +  KPI+S+S G  A+FL+G ++
Subjt:  LLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS

Query:  RQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYI--RTSRININIRQVF
        R   PV +F+RSGD+V+M G +R C+HGV +I   E S ++  ++ +  +QD  + ++++  +  R+NIN RQVF
Subjt:  RQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYI--RTSRININIRQVF

Q9SA98 Alpha-ketoglutarate-dependent dioxygenase alkB2.8e-13966.67Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MY S   +DD++RTAFRRAEKKYKLYY+   K S+KKKLPK +DLSE++DF  I + +  DG LP G+  +K D   PVFC++NRPGFYFIP ALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        C+WI+ESLT+FPQPPNRTNHNAIYGPI DLF +AK  K+LV+DD                  + + WKF EE  +     ++CKS+ AS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKR+YD+SLPHN IP ALC+LAK  AA AMP GEEF+PE AIVNYF  GDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS+ DPP AM+L
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHG+PRIF  EE+ +I  LE   S++    F EYI+TSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein2.0e-14066.67Show/hide
Query:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ
        MY S   +DD++RTAFRRAEKKYKLYY+   K S+KKKLPK +DLSE++DF  I + +  DG LP G+  +K D   PVFC++NRPGFYFIP ALSL+EQ
Subjt:  MYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVNATKCDLDGPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL
        C+WI+ESLT+FPQPPNRTNHNAIYGPI DLF +AK  K+LV+DD                  + + WKF EE  +     ++CKS+ AS LLRKLRWSTL
Subjt:  CQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRRGTACKSIPASALLRKLRWSTL

Query:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL
        GLQFDWSKR+YD+SLPHN IP ALC+LAK  AA AMP GEEF+PE AIVNYF  GDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS+ DPP AM+L
Subjt:  GLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPVAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHG+PRIF  EE+ +I  LE   S++    F EYI+TSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein1.4e-0529.06Show/hide
Query:  PEAAIVNYFASGDTLGGH----LDDMEADWSK---------------------PIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHG
        P+  +VN++ S   LG H     D    D+ K                     PIVS S+G  A FL G +   D    + L SGDV++    +R  FHG
Subjt:  PEAAIVNYFASGDTLGGH----LDDMEADWSK---------------------PIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHG

Query:  V--------PRIFIDEE
        V        PR+F  ++
Subjt:  V--------PRIFIDEE

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein2.6e-1040.96Show/hide
Query:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRI
        P+  IVN+++S   LG H D  E++ S     P+VS S+G  A FL G +  +D    + L SGDV+L  G +R+ FHGV  I
Subjt:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRI

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein2.0e-1030.1Show/hide
Query:  PASALLRKLRWSTLGLQFDWS-----KRSYDISLPHNKIP---SALCELAKRMAAAAM--PTGEE--------FKPEAAIVNYFASGDTLGGHLDDMEAD
        P  ++  KL    + L  +W      +++ DI     +IP   + L E A R A A +   +G E          P+  IVN+++    LG H D  E++
Subjt:  PASALLRKLRWSTLGLQFDWS-----KRSYDISLPHNKIP---SALCELAKRMAAAAM--PTGEE--------FKPEAAIVNYFASGDTLGGHLDDMEAD

Query:  WS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIR
         S     PIVS S+G  A FL G K   +    + L SGDV++  GE+R  FHGV  I     S  +S L               +RT R+N+  R
Subjt:  WS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIR

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein2.0e-1030.1Show/hide
Query:  PASALLRKLRWSTLGLQFDWS-----KRSYDISLPHNKIP---SALCELAKRMAAAAM--PTGEE--------FKPEAAIVNYFASGDTLGGHLDDMEAD
        P  ++  KL    + L  +W      +++ DI     +IP   + L E A R A A +   +G E          P+  IVN+++    LG H D  E++
Subjt:  PASALLRKLRWSTLGLQFDWS-----KRSYDISLPHNKIP---SALCELAKRMAAAAM--PTGEE--------FKPEAAIVNYFASGDTLGGHLDDMEAD

Query:  WS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIR
         S     PIVS S+G  A FL G K   +    + L SGDV++  GE+R  FHGV  I     S  +S L               +RT R+N+  R
Subjt:  WS----KPIVSMSLGCKAIFLLGGKSRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCTGACCAGTCAAAACCCTAAGCTTGATTATCCGTCTCCCGTCCGTCGCCAGAATTCAGAAGCTCAATCTTACGCCGATCGAGCATCTCACCGGCGACAGCGGAT
CCTGGCGGATGAAATGTACGGATCCGACAAAAACACCGACGATTCCGAGCGCACCGCTTTCAGAAGAGCAGAAAAGAAATACAAATTGTACTATGACGACACCTACAAAT
CTTCGAAAAAGAAAAAACTACCGAAACAAGTCGATTTGTCTGAGGTTATCGATTTCAAGTCCATTCTCGAATGTTACAAACAAGATGGTGCACTTCCGCTGGGCGTGAAT
GCGACTAAGTGCGATCTCGATGGGCCAGTCTTCTGCTTGGAGAATCGTCCTGGGTTTTATTTTATTCCTGGAGCATTGAGTTTACAAGAGCAATGCCAATGGATCAGGGA
GAGTTTAACGAATTTCCCACAGCCTCCCAACAGAACCAACCACAATGCTATTTATGGACCAATTCAAGACCTGTTCATTGCAGCAAAGCGAAAGAAAATTCTAGTTGAAG
ATGATGAAATCTCTGATTTCAAAGTTGATTCTGACATTGAACCTTCCATTAGCAATGGAAGTACTCATAGCTGGAAGTTTGTAGAGGAGAATACTGTTTCATCCAGAAGA
GGGACGGCCTGCAAATCAATTCCTGCTTCAGCATTACTTCGAAAATTGCGTTGGAGTACCCTCGGCCTACAATTTGATTGGTCCAAGCGAAGCTATGACATATCTCTGCC
CCATAATAAGATACCCTCTGCGCTCTGTGAACTTGCCAAAAGAATGGCGGCAGCTGCAATGCCAACTGGGGAAGAGTTCAAGCCTGAAGCTGCAATAGTGAATTACTTTG
CTTCAGGTGACACACTCGGTGGTCACTTAGATGACATGGAAGCAGACTGGAGCAAGCCAATTGTTAGCATGAGTTTGGGTTGCAAAGCTATTTTTCTTTTAGGTGGTAAG
TCGAGACAGGATCCACCCGTAGCCATGTTTCTTCGAAGCGGAGATGTCGTGCTAATGGCTGGAGAAGCAAGGGAATGTTTCCATGGTGTACCACGGATCTTCATCGATGA
AGAAAGTGAAGAAATTTCTTTTCTTGAAAGGCATTTCTCAAATCAAGATGATTTGCACTTTCTGGAATACATAAGAACTTCAAGAATAAACATCAACATTAGACAGGTTT
TCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTCTGACCAGTCAAAACCCTAAGCTTGATTATCCGTCTCCCGTCCGTCGCCAGAATTCAGAAGCTCAATCTTACGCCGATCGAGCATCTCACCGGCGACAGCGGAT
CCTGGCGGATGAAATGTACGGATCCGACAAAAACACCGACGATTCCGAGCGCACCGCTTTCAGAAGAGCAGAAAAGAAATACAAATTGTACTATGACGACACCTACAAAT
CTTCGAAAAAGAAAAAACTACCGAAACAAGTCGATTTGTCTGAGGTTATCGATTTCAAGTCCATTCTCGAATGTTACAAACAAGATGGTGCACTTCCGCTGGGCGTGAAT
GCGACTAAGTGCGATCTCGATGGGCCAGTCTTCTGCTTGGAGAATCGTCCTGGGTTTTATTTTATTCCTGGAGCATTGAGTTTACAAGAGCAATGCCAATGGATCAGGGA
GAGTTTAACGAATTTCCCACAGCCTCCCAACAGAACCAACCACAATGCTATTTATGGACCAATTCAAGACCTGTTCATTGCAGCAAAGCGAAAGAAAATTCTAGTTGAAG
ATGATGAAATCTCTGATTTCAAAGTTGATTCTGACATTGAACCTTCCATTAGCAATGGAAGTACTCATAGCTGGAAGTTTGTAGAGGAGAATACTGTTTCATCCAGAAGA
GGGACGGCCTGCAAATCAATTCCTGCTTCAGCATTACTTCGAAAATTGCGTTGGAGTACCCTCGGCCTACAATTTGATTGGTCCAAGCGAAGCTATGACATATCTCTGCC
CCATAATAAGATACCCTCTGCGCTCTGTGAACTTGCCAAAAGAATGGCGGCAGCTGCAATGCCAACTGGGGAAGAGTTCAAGCCTGAAGCTGCAATAGTGAATTACTTTG
CTTCAGGTGACACACTCGGTGGTCACTTAGATGACATGGAAGCAGACTGGAGCAAGCCAATTGTTAGCATGAGTTTGGGTTGCAAAGCTATTTTTCTTTTAGGTGGTAAG
TCGAGACAGGATCCACCCGTAGCCATGTTTCTTCGAAGCGGAGATGTCGTGCTAATGGCTGGAGAAGCAAGGGAATGTTTCCATGGTGTACCACGGATCTTCATCGATGA
AGAAAGTGAAGAAATTTCTTTTCTTGAAAGGCATTTCTCAAATCAAGATGATTTGCACTTTCTGGAATACATAAGAACTTCAAGAATAAACATCAACATTAGACAGGTTT
TCTGA
Protein sequenceShow/hide protein sequence
MCLTSQNPKLDYPSPVRRQNSEAQSYADRASHRRQRILADEMYGSDKNTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKQVDLSEVIDFKSILECYKQDGALPLGVN
ATKCDLDGPVFCLENRPGFYFIPGALSLQEQCQWIRESLTNFPQPPNRTNHNAIYGPIQDLFIAAKRKKILVEDDEISDFKVDSDIEPSISNGSTHSWKFVEENTVSSRR
GTACKSIPASALLRKLRWSTLGLQFDWSKRSYDISLPHNKIPSALCELAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGK
SRQDPPVAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHFSNQDDLHFLEYIRTSRININIRQVF