; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0001573 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0001573
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptionalpha-ketoglutarate-dependent dioxygenase alkB
Genome locationchr10:3983260..3985655
RNA-Seq ExpressionIVF0001573
SyntenyIVF0001573
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135327.1 alpha-ketoglutarate-dependent dioxygenase alkB isoform X1 [Cucumis sativus]1.28e-25896.69Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ
        MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESY+QDGSLPVGV ATTCDLD PVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL
        CQWIRESLM+FPQPPNRTNHNAIYGPIQDLFIAAKE KVLVEHDEISDF LDSDVEPSISNG+THNWKFVEENTVSSRRGTA KSI ASVLLRKLRWSTL
Subjt:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL

Query:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
Subjt:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLE HL+NQDDLH LEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF

XP_008446031.1 PREDICTED: LOW QUALITY PROTEIN: alpha-ketoglutarate-dependent dioxygenase alkB [Cucumis melo]4.52e-26899.72Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ
        MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL
        CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL
Subjt:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL

Query:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKA FLLGGKSRQDPPIAMFL
Subjt:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF

XP_022945036.1 alpha-ketoglutarate-dependent dioxygenase alkB [Cucurbita moschata]6.57e-23888.98Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ
        MYGSDKGTDD+ERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK ILE Y QDG+LP+GV AT CDLD PVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL
        CQWIRESL DFPQPPNRTNHNAIYGPIQDLFIAAKEK+VLVE +EIS  N+DSD EPS+ NG+++ WKFVEENTVSSRRGT CKS+ AS LLRKLRWSTL
Subjt:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL

Query:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKRSY+ISLPHN+IPSALCQL KRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+  S++DDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF

XP_031740902.1 alpha-ketoglutarate-dependent dioxygenase alkB isoform X2 [Cucumis sativus]2.71e-24192.58Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIP-GALSLQE
        MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESY+QDGSLPVGV ATTCDLD PVFCLENRPG    P   L L++
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIP-GALSLQE

Query:  QCQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWST
        +    RESLM+FPQPPNRTNHNAIYGPIQDLFIAAKE KVLVEHDEISDF LDSDVEPSISNG+THNWKFVEENTVSSRRGTA KSI ASVLLRKLRWST
Subjt:  QCQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWST

Query:  LGLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMF
        LGLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMF
Subjt:  LGLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMF

Query:  LRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
        LRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLE HL+NQDDLH LEYIRTSRININIRQVF
Subjt:  LRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF

XP_038891364.1 alpha-ketoglutarate-dependent dioxygenase alkB [Benincasa hispida]8.12e-24692.56Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ
        MYGSDK TDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK IL  YKQDG+LP+GV A  CDLDRPVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL
        CQWIRESL +FPQP NRTNHNAIYG IQDLFIAAK KKVLVE DEISDFN+DSDVE S+SNG+THNWKFVEENTVSS+RGTACKSI ASVLLRKLRWSTL
Subjt:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL

Query:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKRSY+ISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
Subjt:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEIS LERH SNQDDLHFLEYI+ SRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF

TrEMBL top hitse value%identityAlignment
A0A0A0KS00 Fe2OG dioxygenase domain-containing protein7.5e-20396.69Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ
        MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESY+QDGSLPVGV ATTCDLD PVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL
        CQWIRESLM+FPQPPNRTNHNAIYGPIQDLFIAAKE KVLVEHDEISDF LDSDVEPSISNG+THNWKFVEENTVSSRRGTA KSI ASVLLRKLRWSTL
Subjt:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL

Query:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
Subjt:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLE HL+NQDDLH LEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF

A0A1S3BE32 LOW QUALITY PROTEIN: alpha-ketoglutarate-dependent dioxygenase alkB4.9e-21099.72Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ
        MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL
        CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL
Subjt:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL

Query:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKA FLLGGKSRQDPPIAMFL
Subjt:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF

A0A6J1DG90 alpha-ketoglutarate-dependent dioxygenase alkB1.7e-18387.33Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ
        MYGSDKGTDD ERTAFR AEKKYK YYDDT+KSSKKKKLPK VDLSEVIDFK ILE Y  DG+LP+G+ AT CDLD PVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL
        CQWIRESL  FPQPPNRTNHNAIYGPIQDLFIAAKEK+VLVE +EIS FN+DSD+EPS+S G+TH WKFVEENTVSSRR T CKS+ ASVLLRKLRWSTL
Subjt:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL

Query:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKRSY+ISL HNK+PSALCQLAKRMAAAAMP GEEFKPEAAIVNYFASGD+LGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIFID E++E S LE+  SNQDDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF

A0A6J1FZQ9 alpha-ketoglutarate-dependent dioxygenase alkB4.4e-18788.98Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ
        MYGSDKGTDD+ERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK ILE Y QDG+LP+GV AT CDLD PVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL
        CQWIRESL DFPQPPNRTNHNAIYGPIQDLFIAAKEK+VLVE +EIS  N+DSD EPS+ NG+++ WKFVEENTVSSRRGT CKS+ AS LLRKLRWSTL
Subjt:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL

Query:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKRSY+ISLPHN+IPSALCQL KRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPP+AMFL
Subjt:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+  S++DDLHFLEYIRTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF

A0A6J1HQB4 alpha-ketoglutarate-dependent dioxygenase alkB1.2e-18488.71Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ
        MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPK VDLSEVIDFK ILE Y QDG+LP+GV AT CDLD PVFCLENRPGFYFIPGALSL+EQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL
        CQ IRESL DFPQPPNRTNHNAIYGPIQDLFIAAKEK+VLVE +EIS  N+DSD EPS+ NG+++ WKFVEENTVSSRRGT CKS+ AS LLRKLRWSTL
Subjt:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL

Query:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKRSY+ISLPHN IPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQD P+AMFL
Subjt:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHGVPRIF DEESEEIS LE+  S++DDLHFLEY+RTSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF

SwissProt top hitse value%identityAlignment
O60066 Alpha-ketoglutarate-dependent dioxygenase abh17.2e-2535.29Show/hide
Query:  RKLRWSTLGLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQ
        +KLRW TLG Q+DW+ + Y         P  L    +++   +      +K EAAIVN+++ GDTL  H+D+ E D + P++S+S+G   I+L+G +SR 
Subjt:  RKLRWSTLGLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQ

Query:  DPPIAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQV
        + P A+ L SGDVV+M G +R+ FH VP+I  +     +    +            +I   R+N N+RQV
Subjt:  DPPIAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQV

P0CB42 Nucleic acid dioxygenase ALKBH11.5e-3030.53Show/hide
Query:  KKKLPKHVDLSEVIDFK--NILESYK----QDGSLPVGVKATT-CDLDR----PV-----FCLENRPGFYFIPGALSLQEQCQWIRESLMDFPQPPNRTN
        ++  P   DL  VIDF   ++  S K    Q    P+ V + T  D +R    PV     + LE  PGF FIP       Q  W+++ L  + Q PN  N
Subjt:  KKKLPKHVDLSEVIDFK--NILESYK----QDGSLPVGVKATT-CDLDR----PV-----FCLENRPGFYFIPGALSLQEQCQWIRESLMDFPQPPNRTN

Query:  HNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTLGLQFDWSKRSYNISLPHNK
                 D  +  +E + L E  +                      + +    V+ RR  +        LL +LRW TLG  ++W  + Y+    +  
Subjt:  HNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTLGLQFDWSKRSYNISLPHNK

Query:  IPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGV
         PS L  L++++A A    G  F+ EA I+NY+    TLG H+D  E D SKP++S S G  AIFLLGG  R + P AMF+ SGD+++M+G +R   H V
Subjt:  IPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGV

Query:  PRIFIDEESEEI--------------SFLERHLSNQDDLHFLEYIRTSRININIRQV
        PR+    + E +              + L    S +D      Y+RT+R+N+ +RQV
Subjt:  PRIFIDEESEEI--------------SFLERHLSNQDDLHFLEYIRTSRININIRQV

Q13686 Nucleic acid dioxygenase ALKBH13.0e-3131.86Show/hide
Query:  LENRPGFYFIPGALSLQEQCQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGT
        L+  PGF FIP       Q  W+++ L  + Q PN  N         D  ++ +E + L E  +                      +F+     + RR  
Subjt:  LENRPGFYFIPGALSLQEQCQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGT

Query:  ACKSISASVLLRKLRWSTLGLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCK
        +        LL KLRW T+G  ++W  + Y+    +   PS L  L++++AAA     E+F+ EA I+NY+    TLG H+D  E D SKP++S S G  
Subjt:  ACKSISASVLLRKLRWSTLGLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCK

Query:  AIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISF-LERHL-------------SNQDDLHFLEYIRTSRININIRQV
        AIFLLGG  R + P AMF+ SGD+++M+G +R   H VPR+  + E E +   LE  L             S +D      Y++T+R+N+ +RQV
Subjt:  AIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISF-LERHL-------------SNQDDLHFLEYIRTSRININIRQV

Q54N08 Alpha-ketoglutarate-dependent dioxygenase alkB9.0e-4431.03Show/hide
Query:  TAFRRAEKKYKLYYDDTYKSSKKKKLPKH----VDLSEVIDFKNILESYKQDGSLPVGV--KATTCDL-------------DRPVFCLENRPGFYFIPGA
        T F R ++ ++       KS+  K +PK     +D S V+DF N+  + +++  L +      TT D              +  V+ L+  PGFYFI   
Subjt:  TAFRRAEKKYKLYYDDTYKSSKKKKLPKH----VDLSEVIDFKNILESYKQDGSLPVGV--KATTCDL-------------DRPVFCLENRPGFYFIPGA

Query:  LSLQEQCQWIRESLMDFPQPPNRTNHNAIYGPIQDLF------IAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISA
         +  +Q +WI+ +L D+  PPN  N    +GPI++L+      +  +E K   +HD   D  ++    P   NG         E   + R+         
Subjt:  LSLQEQCQWIRESLMDFPQPPNRTNHNAIYGPIQDLF------IAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISA

Query:  SVLLRKLRWSTLGLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGG
          LL KL WSTLG Q+ W+ R Y+    + + P  L +L +++A A     + +  EAA VN+++    +GGHLDD E +  KPI+S+S G  A+FL+G 
Subjt:  SVLLRKLRWSTLGLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGG

Query:  KSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYI--RTSRININIRQVF
        ++R   P+ +F+RSGD+V+M G +R C+HGV +I   E S ++  ++ +  +QD  + ++++  +  R+NIN RQVF
Subjt:  KSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYI--RTSRININIRQVF

Q9SA98 Alpha-ketoglutarate-dependent dioxygenase alkB1.9e-13967.22Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ
        MY S   +DD++RTAFRRAEKKYKLYY+   K S+KKKLPK +DLSE++DF  I +++  DG LP G++ +  D   PVFC++NRPGFYFIP ALSL+EQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL
        C+WI+ESL  FPQPPNRTNHNAIYGPI DLF +AKE KVLV+ D                  + + WKF EE  +     ++CKS+SASVLLRKLRWSTL
Subjt:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL

Query:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKR+Y++SLPHN IP ALCQLAK  AA AMP GEEF+PE AIVNYF  GDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS+ DPP AM+L
Subjt:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHG+PRIF  EE+ +I  LE  LS++    F EYI+TSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein1.4e-14067.22Show/hide
Query:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ
        MY S   +DD++RTAFRRAEKKYKLYY+   K S+KKKLPK +DLSE++DF  I +++  DG LP G++ +  D   PVFC++NRPGFYFIP ALSL+EQ
Subjt:  MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQ

Query:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL
        C+WI+ESL  FPQPPNRTNHNAIYGPI DLF +AKE KVLV+ D                  + + WKF EE  +     ++CKS+SASVLLRKLRWSTL
Subjt:  CQWIRESLMDFPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTL

Query:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL
        GLQFDWSKR+Y++SLPHN IP ALCQLAK  AA AMP GEEF+PE AIVNYF  GDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKS+ DPP AM+L
Subjt:  GLQFDWSKRSYNISLPHNKIPSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFL

Query:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF
        RSGDVVLMAGEARECFHG+PRIF  EE+ +I  LE  LS++    F EYI+TSRININIRQVF
Subjt:  RSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLSNQDDLHFLEYIRTSRININIRQVF

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein1.0e-0529.06Show/hide
Query:  PEAAIVNYFASGDTLGGH----LDDMEADWSK---------------------PIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHG
        P+  +VN++ S   LG H     D    D+ K                     PIVS S+G  A FL G +   D    + L SGDV++    +R  FHG
Subjt:  PEAAIVNYFASGDTLGGH----LDDMEADWSK---------------------PIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHG

Query:  V--------PRIFIDEE
        V        PR+F  ++
Subjt:  V--------PRIFIDEE

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein3.0e-1040.96Show/hide
Query:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRI
        P+  IVN+++S   LG H D  E++ S     P+VS S+G  A FL G +  +D    + L SGDV+L  G +R+ FHGV  I
Subjt:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRI

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein6.7e-1034.71Show/hide
Query:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLS
        P+  IVN+++    LG H D  E++ S     PIVS S+G  A FL G K   +    + L SGDV++  GE+R  FHGV  I     S  +S L     
Subjt:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLS

Query:  NQDDLHFLEYIRTSRININIR
                  +RT R+N+  R
Subjt:  NQDDLHFLEYIRTSRININIR

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein6.7e-1034.71Show/hide
Query:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLS
        P+  IVN+++    LG H D  E++ S     PIVS S+G  A FL G K   +    + L SGDV++  GE+R  FHGV  I     S  +S L     
Subjt:  PEAAIVNYFASGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEEISFLERHLS

Query:  NQDDLHFLEYIRTSRININIR
                  +RT R+N+  R
Subjt:  NQDDLHFLEYIRTSRININIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACGGATCAGACAAAGGCACCGACGATTCCGAGAGAACCGCTTTCAGAAGAGCAGAAAAGAAATACAAACTATACTACGATGACACCTACAAATCTTCCAAAAAGAA
AAAACTACCGAAACATGTCGATTTGTCCGAGGTTATCGATTTCAAGAACATCCTCGAATCTTACAAACAAGATGGTTCACTTCCTGTGGGCGTGAAGGCGACTACATGCG
ATCTAGATAGGCCAGTTTTCTGCTTGGAGAATCGTCCTGGGTTTTATTTTATTCCTGGAGCGTTGAGTTTACAAGAGCAATGCCAATGGATTAGGGAGAGTTTAATGGAT
TTCCCGCAGCCTCCCAACAGAACCAACCACAATGCTATTTATGGACCAATTCAAGACCTGTTTATTGCAGCTAAGGAAAAGAAAGTTTTAGTTGAACATGATGAAATCTC
TGATTTCAACCTTGATTCTGATGTTGAACCTTCCATTAGCAATGGAAGTACTCATAACTGGAAGTTTGTGGAGGAGAATACTGTTTCGTCCAGAAGAGGAACGGCCTGCA
AATCAATTTCTGCTTCAGTATTACTTAGAAAGTTGCGTTGGAGTACACTTGGCCTACAATTTGATTGGTCCAAGCGAAGCTATAACATATCTCTACCCCATAATAAGATA
CCCTCTGCACTATGTCAACTTGCCAAAAGAATGGCGGCAGCTGCAATGCCAACTGGGGAAGAATTCAAACCTGAAGCTGCAATAGTGAATTATTTTGCTTCAGGCGACAC
GCTCGGTGGTCACCTAGATGACATGGAAGCTGACTGGAGCAAGCCAATTGTTAGCATGAGTTTGGGTTGCAAAGCTATTTTTCTTTTGGGTGGTAAATCAAGACAGGATC
CACCCATAGCCATGTTTCTTCGAAGCGGAGATGTCGTGTTAATGGCTGGAGAAGCAAGGGAATGTTTCCATGGTGTACCACGTATCTTCATCGATGAAGAAAGTGAAGAA
ATTTCTTTTCTTGAAAGGCATTTATCAAATCAAGATGATTTGCACTTTCTGGAATACATAAGAACTTCAAGAATAAACATCAACATTAGACAAGTTTTCTGA
mRNA sequenceShow/hide mRNA sequence
AAAACTCCAAACGAAGAGAAAAAAGAGGTGTCTGACCAGTCAAAACCCTAGCCTCGATTATCCTTCTCCCGTCTTTCGCCAGAATTCAGAAGCTCAATCTTACGTTGATC
GAGCATCTTGCCGGCGACGGCAAAATGTACGGATCAGACAAAGGCACCGACGATTCCGAGAGAACCGCTTTCAGAAGAGCAGAAAAGAAATACAAACTATACTACGATGA
CACCTACAAATCTTCCAAAAAGAAAAAACTACCGAAACATGTCGATTTGTCCGAGGTTATCGATTTCAAGAACATCCTCGAATCTTACAAACAAGATGGTTCACTTCCTG
TGGGCGTGAAGGCGACTACATGCGATCTAGATAGGCCAGTTTTCTGCTTGGAGAATCGTCCTGGGTTTTATTTTATTCCTGGAGCGTTGAGTTTACAAGAGCAATGCCAA
TGGATTAGGGAGAGTTTAATGGATTTCCCGCAGCCTCCCAACAGAACCAACCACAATGCTATTTATGGACCAATTCAAGACCTGTTTATTGCAGCTAAGGAAAAGAAAGT
TTTAGTTGAACATGATGAAATCTCTGATTTCAACCTTGATTCTGATGTTGAACCTTCCATTAGCAATGGAAGTACTCATAACTGGAAGTTTGTGGAGGAGAATACTGTTT
CGTCCAGAAGAGGAACGGCCTGCAAATCAATTTCTGCTTCAGTATTACTTAGAAAGTTGCGTTGGAGTACACTTGGCCTACAATTTGATTGGTCCAAGCGAAGCTATAAC
ATATCTCTACCCCATAATAAGATACCCTCTGCACTATGTCAACTTGCCAAAAGAATGGCGGCAGCTGCAATGCCAACTGGGGAAGAATTCAAACCTGAAGCTGCAATAGT
GAATTATTTTGCTTCAGGCGACACGCTCGGTGGTCACCTAGATGACATGGAAGCTGACTGGAGCAAGCCAATTGTTAGCATGAGTTTGGGTTGCAAAGCTATTTTTCTTT
TGGGTGGTAAATCAAGACAGGATCCACCCATAGCCATGTTTCTTCGAAGCGGAGATGTCGTGTTAATGGCTGGAGAAGCAAGGGAATGTTTCCATGGTGTACCACGTATC
TTCATCGATGAAGAAAGTGAAGAAATTTCTTTTCTTGAAAGGCATTTATCAAATCAAGATGATTTGCACTTTCTGGAATACATAAGAACTTCAAGAATAAACATCAACAT
TAGACAAGTTTTCTGATTTATTGGACGTGTCTGTCATCTTTCAATCTTCAAGATGGTTTTCAGGCTGCTTCAAAATTTTGCCTGGATGACGGGGGAAGGGAGTGGCTTTG
CAAACTCATAGACAGGTGCAAAAAGAGATCGATGACAATCTGATTGCAGACCGAAGATCCATGAAGCTTAGGATCAAGCAATGGTTTTGGTTGGCAATGGGTGGGGATAT
TGCAGGACAGGCACAAATAAAGGTATTGTGCTGTGCAAGTTCTTCTAGATGATTTAATTTAAAATACGAGAGCTGTTTACGTCCGCTTGGTAACAATTTTTCTTTTTTTC
CTTTTTAAATAGTAAGTTTATTTTCTATCCAAAGTATCCTTTAGTTTTAGATAGTACTGATATTTACTACCTGAATTGACATG
Protein sequenceShow/hide protein sequence
MYGSDKGTDDSERTAFRRAEKKYKLYYDDTYKSSKKKKLPKHVDLSEVIDFKNILESYKQDGSLPVGVKATTCDLDRPVFCLENRPGFYFIPGALSLQEQCQWIRESLMD
FPQPPNRTNHNAIYGPIQDLFIAAKEKKVLVEHDEISDFNLDSDVEPSISNGSTHNWKFVEENTVSSRRGTACKSISASVLLRKLRWSTLGLQFDWSKRSYNISLPHNKI
PSALCQLAKRMAAAAMPTGEEFKPEAAIVNYFASGDTLGGHLDDMEADWSKPIVSMSLGCKAIFLLGGKSRQDPPIAMFLRSGDVVLMAGEARECFHGVPRIFIDEESEE
ISFLERHLSNQDDLHFLEYIRTSRININIRQVF