; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012887 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012887
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationscaffold1:16192348..16195076
RNA-Seq ExpressionSpg012887
SyntenySpg012887
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595567.1 hypothetical protein SDJN03_12120, partial [Cucurbita argyrosperma subsp. sororia]1.7e-17770.27Show/hide
Query:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDS--LGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSL
        ML IRTVP S  PWS+ L RLLFA S        RLL+FQRMDSFGSS    ALPDS   GSS GGNEE LHN DHNSNVIM G IPV LN +GN Q+SL
Subjt:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDS--LGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSL

Query:  SRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRSV
        SRLSV +CDDF+L  DQKGIPAN+PSSYHDDEF PV RQNTK RSRIDLG ER LKNSTSS Q+ER                      N P      RS 
Subjt:  SRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRSV

Query:  DIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGTNR------------VLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDG
        DIG KNSL T NLPP   F+IC  ERRG +  R  WQ K + T +            V+RPGMVLLKHYI L EQ+NIV+T Q LG+GPGGFYQPGYKDG
Subjt:  DIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGTNR------------VLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDG

Query:  AKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLG
        AKLRLQMMCLGLDWDPQTRKY  KR  DGN+PPD+P +F+ILV +AL DAHALIKN  +T+NIEDILP MSPDICIVNFY+T+GRLGLHQDRDES+ESL 
Subjt:  AKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLG

Query:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
         GLPVVSFSLGNSAEFLYGDQRDVDKA KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

XP_016903166.1 PREDICTED: uncharacterized protein LOC103502183 [Cucumis melo]1.1e-17668.78Show/hide
Query:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDSL---GSSG-GGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQK
        M  IRT+P+ PSP S+ L RLLF AS F G R F LL+FQRMDSF SSANSHA PDS     S G G ++EHL + D+ S+VI  GS  V LN +    K
Subjt:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDSL---GSSG-GGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQK

Query:  SLSRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTKRSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRS
        SL+ LS  +CD  E+G D+ GI +N P SYH DEF PVSRQNT+R+RIDLG +R LK++  SFQVER E FN+  Q  E SLP  FG+KN   +   R+S
Subjt:  SLSRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTKRSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRS

Query:  VDIGPKNSLVTD-NLPPSGPFNICSSERRGNANPRTYWQVKGKGT---NRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQM
        +DIG K S+VTD +LP   PF+IC     GN   R +W+VK  GT    R+LRPGMVLLKHYIT PEQINIV+TCQ LG+GPGGFYQP YKDGAKLRL+M
Subjt:  VDIGPKNSLVTD-NLPPSGPFNICSSERRGNANPRTYWQVKGKGT---NRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQM

Query:  MCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVS
        MCLGLDWDPQTR+Y+NKR VDGN+PPDIP  FS LVK ALKDAHA IKN+CN SN+EDILP MSPDICI NFYTT+GRLGLHQDRDESKESL SGLPVVS
Subjt:  MCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVS

Query:  FSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        FS+GN+AEFLYGD+RDV+KAEK+ LESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFRKY
Subjt:  FSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

XP_022144035.1 uncharacterized protein LOC111013827 [Momordica charantia]2.2e-17257.68Show/hide
Query:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDSLGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSLSR
        M  IRTVPI  SP S+ LHRLLFA+SRF G R  RLL+F+RMDS  +SA SH          G   E+ HN  H+S+++M G IPV LN +   ++S S 
Subjt:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDSLGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSLSR

Query:  LSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGL-----------------------------------------------
         SVN+ DDFELGR++K  PANVP+SYHDD+F PVSRQN K RSR+DLGLER +                                               
Subjt:  LSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGL-----------------------------------------------

Query:  ------------------------------------------------------------KNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVP
                                                                     N+TSSFQVE F L NN SQLDE S PNQFG+KN P YV 
Subjt:  ------------------------------------------------------------KNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVP

Query:  NRRSVDIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGT------------NRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQP
          +S+DIG KNSLV DNL P  PF+IC  ERRGNA P  +WQ KG+ T             RVLRPGMVLLK+YITL EQ+NIV+TCQ LGVGPGGFY+P
Subjt:  NRRSVDIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGT------------NRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQP

Query:  GYKDGAKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDES
        GYKDGAKLRLQMMCLGLDWDPQTRKY +KRAVDG++PP+IP KF+ILV +ALKDAHALIKN+CNT N+E ILP MSPDICIVNFYTT+GRLGLHQDRDES
Subjt:  GYKDGAKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDES

Query:  KESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        KESL SGLPVVS SLG+SAEFLYGD+RDVDKAEK++LESGDVLIFGG+SRH+FHGVSSIIP STPKFLL HTGLRPGRLNLTFRKY
Subjt:  KESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

XP_022924913.1 uncharacterized protein LOC111432318 [Cucurbita moschata]4.5e-17870.27Show/hide
Query:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDS--LGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSL
        ML IRTVP S  PWS+ L RLLFA S        RLL+FQR+DSFGSS    ALPDS   GSS GGNEE LHN DHNSNVIM G IPV LN +GN Q+SL
Subjt:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDS--LGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSL

Query:  SRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRSV
        SRLSV +CDDF+L  DQKGIPAN+PSSYHDDEF PV RQNTK RSRIDLG ER LKNSTSS Q+ER                      N P      RS 
Subjt:  SRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRSV

Query:  DIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGTNR------------VLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDG
        DIG KNSL T NLPP   F+IC  ERRG + PR  WQ K + T +            V+RPGMVLLKHYI L EQ+NIV+T Q LG+GPGGFYQPGYKDG
Subjt:  DIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGTNR------------VLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDG

Query:  AKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLG
        AKLRLQMMCLGLDWDPQTRKY  KR  DGN+PPD+P +F+ILV +AL DAHALIKN  +T+NIEDILP MSPDICIVNFY+T+GRLGLHQDRDES+ESL 
Subjt:  AKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLG

Query:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
         GLPVVSFSLGNSAEFLYGDQRDVDKA KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

XP_023517205.1 uncharacterized protein LOC111781040 [Cucurbita pepo subsp. pepo]7.2e-17670.06Show/hide
Query:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDS--LGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSL
        M  IRTVP S  P S+ L RLLFA S        RLL+FQRMDSFGSS    ALPDS   GSS GGNEE LHN DHNSNVIM G IPV LN +GN Q+SL
Subjt:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDS--LGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSL

Query:  SRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRSV
        SRLSV +CDDF+L  DQK IPAN+PSSYHDDEF PV RQNTK RSRIDLG ER LKNSTSS Q+ER                      N P      RS 
Subjt:  SRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRSV

Query:  DIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGTNR------------VLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDG
        DIG KNSL T NLPP   F+IC  ERRG + PR  WQ K + T +            V+RPGMVLLKHYI L EQ+NIV+T Q LG+GPGGFYQPGYKDG
Subjt:  DIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGTNR------------VLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDG

Query:  AKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLG
        AKLRLQMMCLGLDWDPQTRKY  KR  DGN+PPD+P +F+ILV +AL DAHALIKN  +T+NIEDILP MSPDICIVNFY+T+GRLGLHQDRDES+ESL 
Subjt:  AKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLG

Query:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        SGLPVVSFSLGNSAEFLYGDQRDVDKA KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

TrEMBL top hitse value%identityAlignment
A0A0A0KY56 Fe2OG dioxygenase domain-containing protein1.7e-16265.61Show/hide
Query:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDS--LGSS--GGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQK
        M  IRT+P+ PSP S+ L RLLF AS F   R FRLL+FQ MDSF +SANSHALPDS   GSS   G ++EHLH+ D++S+VI  GSIPV LN +     
Subjt:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDS--LGSS--GGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQK

Query:  SLSRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTKRSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRS
                                  P SY+ DE  PV RQNT+RSRIDLG +R LK++  S+QVER E  N++ Q  + SLP  FG+KN   +V   +S
Subjt:  SLSRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTKRSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRS

Query:  VDIGPKNSLVTDN-LPPSGPFNICSSERRGNANPRTYWQVKGKGT---NRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQM
        +D GPK S+VTDN LP   PF+IC     GN   R  + VK  GT    R+LRPGMVLLKHYIT  EQINIV+TCQNLG+GPGGFYQPGYKDGAKLRL+M
Subjt:  VDIGPKNSLVTDN-LPPSGPFNICSSERRGNANPRTYWQVKGKGT---NRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQM

Query:  MCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVS
        MCLGLDWDPQTR+YENKR VDGN+PPDIP +F+ LVK+ALKDAHA IKN CN SN+E+ILP MSPDICI NFYTT GRLGLHQDRDESKESL  GLPVVS
Subjt:  MCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVS

Query:  FSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        FS+GN+AEFLYGD+R+VDKAE + LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
Subjt:  FSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

A0A1S4E4K6 uncharacterized protein LOC1035021835.3e-17768.78Show/hide
Query:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDSL---GSSG-GGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQK
        M  IRT+P+ PSP S+ L RLLF AS F G R F LL+FQRMDSF SSANSHA PDS     S G G ++EHL + D+ S+VI  GS  V LN +    K
Subjt:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDSL---GSSG-GGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQK

Query:  SLSRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTKRSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRS
        SL+ LS  +CD  E+G D+ GI +N P SYH DEF PVSRQNT+R+RIDLG +R LK++  SFQVER E FN+  Q  E SLP  FG+KN   +   R+S
Subjt:  SLSRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTKRSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRS

Query:  VDIGPKNSLVTD-NLPPSGPFNICSSERRGNANPRTYWQVKGKGT---NRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQM
        +DIG K S+VTD +LP   PF+IC     GN   R +W+VK  GT    R+LRPGMVLLKHYIT PEQINIV+TCQ LG+GPGGFYQP YKDGAKLRL+M
Subjt:  VDIGPKNSLVTD-NLPPSGPFNICSSERRGNANPRTYWQVKGKGT---NRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQM

Query:  MCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVS
        MCLGLDWDPQTR+Y+NKR VDGN+PPDIP  FS LVK ALKDAHA IKN+CN SN+EDILP MSPDICI NFYTT+GRLGLHQDRDESKESL SGLPVVS
Subjt:  MCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVS

Query:  FSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        FS+GN+AEFLYGD+RDV+KAEK+ LESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFRKY
Subjt:  FSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

A0A6J1CQI1 uncharacterized protein LOC1110138271.0e-17257.68Show/hide
Query:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDSLGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSLSR
        M  IRTVPI  SP S+ LHRLLFA+SRF G R  RLL+F+RMDS  +SA SH          G   E+ HN  H+S+++M G IPV LN +   ++S S 
Subjt:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDSLGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSLSR

Query:  LSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGL-----------------------------------------------
         SVN+ DDFELGR++K  PANVP+SYHDD+F PVSRQN K RSR+DLGLER +                                               
Subjt:  LSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGL-----------------------------------------------

Query:  ------------------------------------------------------------KNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVP
                                                                     N+TSSFQVE F L NN SQLDE S PNQFG+KN P YV 
Subjt:  ------------------------------------------------------------KNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVP

Query:  NRRSVDIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGT------------NRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQP
          +S+DIG KNSLV DNL P  PF+IC  ERRGNA P  +WQ KG+ T             RVLRPGMVLLK+YITL EQ+NIV+TCQ LGVGPGGFY+P
Subjt:  NRRSVDIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGT------------NRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQP

Query:  GYKDGAKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDES
        GYKDGAKLRLQMMCLGLDWDPQTRKY +KRAVDG++PP+IP KF+ILV +ALKDAHALIKN+CNT N+E ILP MSPDICIVNFYTT+GRLGLHQDRDES
Subjt:  GYKDGAKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDES

Query:  KESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        KESL SGLPVVS SLG+SAEFLYGD+RDVDKAEK++LESGDVLIFGG+SRH+FHGVSSIIP STPKFLL HTGLRPGRLNLTFRKY
Subjt:  KESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

A0A6J1EDT3 uncharacterized protein LOC1114323182.2e-17870.27Show/hide
Query:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDS--LGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSL
        ML IRTVP S  PWS+ L RLLFA S        RLL+FQR+DSFGSS    ALPDS   GSS GGNEE LHN DHNSNVIM G IPV LN +GN Q+SL
Subjt:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDS--LGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSL

Query:  SRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRSV
        SRLSV +CDDF+L  DQKGIPAN+PSSYHDDEF PV RQNTK RSRIDLG ER LKNSTSS Q+ER                      N P      RS 
Subjt:  SRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRSV

Query:  DIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGTNR------------VLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDG
        DIG KNSL T NLPP   F+IC  ERRG + PR  WQ K + T +            V+RPGMVLLKHYI L EQ+NIV+T Q LG+GPGGFYQPGYKDG
Subjt:  DIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGTNR------------VLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDG

Query:  AKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLG
        AKLRLQMMCLGLDWDPQTRKY  KR  DGN+PPD+P +F+ILV +AL DAHALIKN  +T+NIEDILP MSPDICIVNFY+T+GRLGLHQDRDES+ESL 
Subjt:  AKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLG

Query:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
         GLPVVSFSLGNSAEFLYGDQRDVDKA KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL HTGLRPGRLNLTFRKY
Subjt:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

A0A6J1HTF0 uncharacterized protein LOC1114660085.7e-17167.78Show/hide
Query:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDS--LGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSL
        M  IRTVP S  PWS+ L +LLFA S        RLL+FQRMDSFGSS    ALP+S   GSS GGNEE LHN DHNSNVIM G IPV LN +GN Q+SL
Subjt:  MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDS--LGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSL

Query:  SRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRSV
        SRLSV +CDDF+L  DQKGIPAN+PS YHDDEF PV RQNTK RSRID G ER LKNSTSS Q++R                      N P      RS 
Subjt:  SRLSVNQCDDFELGRDQKGIPANVPSSYHDDEFAPVSRQNTK-RSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRSV

Query:  DIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGTNR------------VLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDG
        DIG KNSL T NLPP   F+IC  ERRG + PR  WQ K + T +            V+RPGMVLLKHYI L EQ+NIV+T Q LG+GPGGFYQPGYKDG
Subjt:  DIGPKNSLVTDNLPPSGPFNICSSERRGNANPRTYWQVKGKGTNR------------VLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDG

Query:  AKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLG
        AKLRLQMMCLGLDWDPQTRKY  KR  DGN+PPD+P +F+ILV +AL DAHALIKN  + + IEDILP MSPDICIVNFY+T GRLGLHQDRDES+ESL 
Subjt:  AKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLG

Query:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        SGLPVVSFSLGNSA FLYGD+R+VDKA KI+LESGDVLIFGGESRHIFHGVSSIIPKS PKFLL HTG RPG LNLTFRKY
Subjt:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog3.3e-1441.74Show/hide
Query:  PLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHT
        P   PD C+VN Y    R+GLHQDRDE+        PV+S SLG++A F  G     D    + L SGDV    G +R  FHGV  I+P S        +
Subjt:  PLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHT

Query:  GLRP--GRLNLTFRK
         L P  GR+NLT R+
Subjt:  GLRP--GRLNLTFRK

O60066 Alpha-ketoglutarate-dependent dioxygenase abh19.5e-1431.68Show/hide
Query:  RLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGL
        +L+ + LG  +D  T++Y      D ++ P  P      V++ +K++   +                  +  IVNFY+    L  H   DES+E L   L
Subjt:  RLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGL

Query:  PVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
        P++S S+G    +L G +   +K   + L SGDV+I  G SR  FH V  IIP STP +LL
Subjt:  PVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB1.6e-1335.2Show/hide
Query:  NECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSII
        N C  +      P   PD C++N Y    +L LHQD+DE         P+VS SLG  A F +G  +  D  ++++LE GDV+++GGESR  +HG+  + 
Subjt:  NECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSII

Query:  PKSTPKFLLHHTGLRPGRLNLTFRK
            P  +         R NLTFR+
Subjt:  PKSTPKFLLHHTGLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog3.3e-1441.74Show/hide
Query:  PLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHT
        P   PD C+VN Y    R+GLHQDRDE+        PV+S SLG++A F  G     D    + L SGDV    G +R  FHGV  I+P S        +
Subjt:  PLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHT

Query:  GLRP--GRLNLTFRK
         L P  GR+NLT R+
Subjt:  GLRP--GRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB1.6e-1339.45Show/hide
Query:  PDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRP
        PD C++N Y    +L LHQD+DE         P+VS SLG  A F +G  R  D  ++I+LE GD++++GGESR  +HG+        P     H     
Subjt:  PDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRP

Query:  GRLNLTFRK
         R NLTFR+
Subjt:  GRLNLTFRK

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein7.2e-0939.76Show/hide
Query:  PDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSI
        P+  IVN++     LG H D  E+  S     P+VS SLG  A FL G +   D    + L SGDV++  GE+R  FHG+  I
Subjt:  PDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein7.6e-6748.31Show/hide
Query:  GPKNSLVTDNLPPSGPFNICSSERRGNANP------RTYWQVKGKGTNRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQMM
        GP NS    N   S PF+I   ++     P      R   +     +  V+RPGMVLLK+Y+++  Q+ IV  C+ LG+G GGFYQPG++DG  L L+MM
Subjt:  GPKNSLVTDNLPPSGPFNICSSERRGNANP------RTYWQVKGKGTNRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQMM

Query:  CLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQ------------------
        CLG +WD QTR+Y   R +DG+ PP IP +FS LV++A+K++ +L+    N +   D +PL+ PDIC+VNFYT+ G+LGLHQ                  
Subjt:  CLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQ------------------

Query:  ---DRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSI
           D+ ESK+SL  GLP+VSFS+G+SAEFLYGDQ+DVDKA+ ++LESGDVLIFG  SR++FHGV SI
Subjt:  ---DRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSI

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein1.7e-7959.82Show/hide
Query:  VLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNE
        V+RPGMVLLK+Y+++ +Q+ IV  C+ LG+G GGFYQPGY+D AKL L+MMCLG +WDP+T +Y   R  DG+  P IP++F+  V++A+K++ +L  + 
Subjt:  VLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNE

Query:  CNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPK
           +   D +P M PDICIVNFY++ GRLGLHQD+DES+ S+  GLPVVSFS+G+SAEFLYGDQRD DKAE + LESGDVL+FGG SR +FHGV SI   
Subjt:  CNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPK

Query:  STPKFLLHHTGLRPGRLNLTFRKY
        + PK LL  T LRPGRLNLTFR+Y
Subjt:  STPKFLLHHTGLRPGRLNLTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein1.8e-8458.11Show/hide
Query:  PPSGPFNICSSERRGNANPRTYW---------QVKGKGTNRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQMMCLGLDWDP
        PP  PF+ICSS    N      W          V+    ++V+RPGMVLLK ++T   Q++IV+TC+ LGV P GFYQPGY  G+KL LQMMCLG +WDP
Subjt:  PPSGPFNICSSERRGNANPRTYW---------QVKGKGTNRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQMMCLGLDWDP

Query:  QTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEF
        QT KY     +D ++ P+IP  F++LV++A+++AHALI  E  T + E ILP+MSPDICIVNFY+  GRLGLHQDRDES+ES+  GLP+VSFS+G+SAEF
Subjt:  QTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEF

Query:  LYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        LYG++RDV++A+ ++LESGDVLIFGGESR IFHGV SIIP S P  LL+ + LR GRLNLTFR +
Subjt:  LYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein1.8e-8458.11Show/hide
Query:  PPSGPFNICSSERRGNANPRTYW---------QVKGKGTNRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQMMCLGLDWDP
        PP  PF+ICSS    N      W          V+    ++V+RPGMVLLK ++T   Q++IV+TC+ LGV P GFYQPGY  G+KL LQMMCLG +WDP
Subjt:  PPSGPFNICSSERRGNANPRTYW---------QVKGKGTNRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQMMCLGLDWDP

Query:  QTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEF
        QT KY     +D ++ P+IP  F++LV++A+++AHALI  E  T + E ILP+MSPDICIVNFY+  GRLGLHQDRDES+ES+  GLP+VSFS+G+SAEF
Subjt:  QTRKYENKRAVDGNRPPDIPSKFSILVKQALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEF

Query:  LYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
        LYG++RDV++A+ ++LESGDVLIFGGESR IFHGV SIIP S P  LL+ + LR GRLNLTFR +
Subjt:  LYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGTCGATCCGTACAGTTCCGATTTCGCCGTCCCCGTGGTCGAGTCATCTTCATCGGCTTTTATTCGCCGCTTCTCGATTTACCGGCGAACGCAGATTTCGATTGCT
TCGATTTCAGCGGATGGATTCGTTTGGCAGTTCGGCAAATAGCCATGCATTACCTGATTCTCTTGGTAGTTCTGGTGGCGGCAACGAGGAACATTTGCATAATGGAGATC
ATAATTCAAATGTGATAATGTTTGGAAGTATTCCTGTGTGTCTAAATTGCAGGGGAAATGGACAGAAATCTTTATCTCGGTTGTCTGTTAATCAATGTGATGATTTTGAG
TTGGGAAGAGATCAAAAAGGGATTCCTGCAAATGTACCGAGTTCTTACCATGATGATGAGTTTGCACCTGTTTCTAGACAAAATACTAAAAGAAGCCGGATAGATTTAGG
GTTGGAAAGAGGTTTGAAGAATAGTACAAGCTCATTTCAAGTGGAGAGGTTTGAATTGTTCAACAATGCCAGTCAGCTGGATGAATTATCTCTTCCTAATCAATTTGGGC
GGAAAAATGGACCATGTTATGTCCCTAATCGCCGGTCTGTGGATATCGGTCCGAAAAATTCTCTAGTTACAGACAATTTGCCTCCCTCTGGACCATTCAACATATGTTCC
TCTGAAAGAAGAGGTAATGCAAACCCCAGAACTTATTGGCAAGTTAAAGGCAAGGGTACTAATAGAGTGCTGAGGCCTGGAATGGTTTTACTGAAGCATTACATTACTTT
ACCTGAACAGATCAATATAGTGGAAACTTGTCAAAATCTTGGTGTTGGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGCGCAAAACTTAGGCTTCAGATGATGT
GTCTTGGATTGGACTGGGATCCTCAAACGAGGAAATATGAAAATAAGCGGGCTGTCGATGGTAATAGACCACCAGATATACCTTCTAAATTTTCTATTCTGGTTAAACAA
GCACTTAAAGATGCACATGCCTTGATCAAGAACGAGTGCAATACAAGTAACATAGAAGACATACTTCCATTAATGTCTCCTGACATATGCATTGTAAATTTCTACACAAC
GAATGGAAGACTGGGTTTGCATCAGGATCGTGATGAAAGCAAAGAGAGTCTCGGTAGCGGACTACCCGTCGTTTCCTTTTCTTTAGGCAATTCAGCAGAGTTCTTGTATG
GAGATCAAAGAGATGTAGATAAAGCAGAGAAGATTGTATTGGAATCAGGTGATGTTCTGATATTTGGTGGTGAATCTAGGCATATATTTCATGGAGTATCTTCAATCATA
CCAAAATCGACACCTAAGTTTTTGCTTCATCATACCGGTCTTCGTCCTGGGCGTTTAAATCTTACCTTTAGAAAGTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGTCGATCCGTACAGTTCCGATTTCGCCGTCCCCGTGGTCGAGTCATCTTCATCGGCTTTTATTCGCCGCTTCTCGATTTACCGGCGAACGCAGATTTCGATTGCT
TCGATTTCAGCGGATGGATTCGTTTGGCAGTTCGGCAAATAGCCATGCATTACCTGATTCTCTTGGTAGTTCTGGTGGCGGCAACGAGGAACATTTGCATAATGGAGATC
ATAATTCAAATGTGATAATGTTTGGAAGTATTCCTGTGTGTCTAAATTGCAGGGGAAATGGACAGAAATCTTTATCTCGGTTGTCTGTTAATCAATGTGATGATTTTGAG
TTGGGAAGAGATCAAAAAGGGATTCCTGCAAATGTACCGAGTTCTTACCATGATGATGAGTTTGCACCTGTTTCTAGACAAAATACTAAAAGAAGCCGGATAGATTTAGG
GTTGGAAAGAGGTTTGAAGAATAGTACAAGCTCATTTCAAGTGGAGAGGTTTGAATTGTTCAACAATGCCAGTCAGCTGGATGAATTATCTCTTCCTAATCAATTTGGGC
GGAAAAATGGACCATGTTATGTCCCTAATCGCCGGTCTGTGGATATCGGTCCGAAAAATTCTCTAGTTACAGACAATTTGCCTCCCTCTGGACCATTCAACATATGTTCC
TCTGAAAGAAGAGGTAATGCAAACCCCAGAACTTATTGGCAAGTTAAAGGCAAGGGTACTAATAGAGTGCTGAGGCCTGGAATGGTTTTACTGAAGCATTACATTACTTT
ACCTGAACAGATCAATATAGTGGAAACTTGTCAAAATCTTGGTGTTGGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGCGCAAAACTTAGGCTTCAGATGATGT
GTCTTGGATTGGACTGGGATCCTCAAACGAGGAAATATGAAAATAAGCGGGCTGTCGATGGTAATAGACCACCAGATATACCTTCTAAATTTTCTATTCTGGTTAAACAA
GCACTTAAAGATGCACATGCCTTGATCAAGAACGAGTGCAATACAAGTAACATAGAAGACATACTTCCATTAATGTCTCCTGACATATGCATTGTAAATTTCTACACAAC
GAATGGAAGACTGGGTTTGCATCAGGATCGTGATGAAAGCAAAGAGAGTCTCGGTAGCGGACTACCCGTCGTTTCCTTTTCTTTAGGCAATTCAGCAGAGTTCTTGTATG
GAGATCAAAGAGATGTAGATAAAGCAGAGAAGATTGTATTGGAATCAGGTGATGTTCTGATATTTGGTGGTGAATCTAGGCATATATTTCATGGAGTATCTTCAATCATA
CCAAAATCGACACCTAAGTTTTTGCTTCATCATACCGGTCTTCGTCCTGGGCGTTTAAATCTTACCTTTAGAAAGTATTAG
Protein sequenceShow/hide protein sequence
MLSIRTVPISPSPWSSHLHRLLFAASRFTGERRFRLLRFQRMDSFGSSANSHALPDSLGSSGGGNEEHLHNGDHNSNVIMFGSIPVCLNCRGNGQKSLSRLSVNQCDDFE
LGRDQKGIPANVPSSYHDDEFAPVSRQNTKRSRIDLGLERGLKNSTSSFQVERFELFNNASQLDELSLPNQFGRKNGPCYVPNRRSVDIGPKNSLVTDNLPPSGPFNICS
SERRGNANPRTYWQVKGKGTNRVLRPGMVLLKHYITLPEQINIVETCQNLGVGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYENKRAVDGNRPPDIPSKFSILVKQ
ALKDAHALIKNECNTSNIEDILPLMSPDICIVNFYTTNGRLGLHQDRDESKESLGSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSII
PKSTPKFLLHHTGLRPGRLNLTFRKY