; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022140 (gene) of Snake gourd v1 genome

Gene IDTan0022140
OrganismTrichosanthes anguina (Snake gourd v1)
Description2-oxoglutarate-dependent dioxygenase family protein isoform 1
Genome locationLG03:63074425..63077215
RNA-Seq ExpressionTan0022140
SyntenyTan0022140
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0035513 - oxidative RNA demethylation (biological process)
GO:0035552 - oxidative single-stranded DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008198 - ferrous iron binding (molecular function)
GO:0035515 - oxidative RNA demethylase activity (molecular function)
GO:0035516 - oxidative DNA demethylase activity (molecular function)
InterPro domainsIPR004574 - Alkylated DNA repair protein AlkB
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR027450 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like
IPR037151 - Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595567.1 hypothetical protein SDJN03_12120, partial [Cucurbita argyrosperma subsp. sororia]2.0e-19773.8Show/hide
Query:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL
        M LIRTVP     WSNLLR+LLFAE         RLL+FQRMDSFGSS    ALPDS CYG SCGG+EE LH+RDHNS+VIMIG+IPV LN KGNEQ+SL
Subjt:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL

Query:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFV
        SRLSV KCDDF++  DQKGIPAN+PSSYHDDEFPPV RQNT RR+RIDLG    LKN+T S QM+                      +N PF   K +  
Subjt:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFV

Query:  NIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDG
        +IGS+NSL T NLPP E F+ICF ERRGK K R  WQ K RDT+KVMEH D+A N  V+RPGMVLLKHYI LHEQ+NIV   QKLGLGPGGFYQPGYKDG
Subjt:  NIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDG

Query:  AKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLV
        AKLRLQMMCLGLDWDPQTRKY  KR ADGNKPPD+PPEF +LV +AL DA A IKNNG+T NIEDILPTMSPDICIVNFY+TSGRLGLHQDRDES++SLV
Subjt:  AKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLV

Query:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY
         GLPVVSFSLGNSAEFLYGDQRDVDKA KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL+H+GLRPGRLNLTFRKY
Subjt:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY

KAG7027547.1 alkB, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-18575.12Show/hide
Query:  RALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSLSRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG-
        +ALPDS CYG SCGG+EE LH+RDHNS+VIMIG+IPV LN KGNEQ+SLSRLSV KCDDF++  DQKGIPAN+PSSYHDDEFPPV RQNT RR+RIDLG 
Subjt:  RALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSLSRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG-

Query:  ---LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFVNIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVD
           LKN+T S QM+                      +N PF   K +  +IGS+NS+ T NLPP E F+ICF ERRGK KPR  WQ K R+T+KVMEH D
Subjt:  ---LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFVNIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVD

Query:  KAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQ
        +A N  V+RPGMVLLKHYI LHEQ+NIV   QKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKY  KR ADGNKPPD+PPEF +LV +AL DA 
Subjt:  KAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQ

Query:  ASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGV
        A IKNNG+T NIEDILPTMSPDICIVNFY+TSGRLGLHQDRDES++SLV GLPVVSFSLGNSAEFLYGDQRDVDKA KI+LESGDVLIFGGESRHIFHGV
Subjt:  ASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGV

Query:  SSIIPKSTPKFLLEHSGLRPGRLNLTFRKY
        SSIIPKSTPKFLL+H+GLRPGRLNLTFRKY
Subjt:  SSIIPKSTPKFLLEHSGLRPGRLNLTFRKY

XP_022924913.1 uncharacterized protein LOC111432318 [Cucurbita moschata]5.2e-19873.8Show/hide
Query:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL
        M LIRTVP     WSNLLR+LLFAE         RLL+FQR+DSFGSS    ALPDS CYG SCGG+EE LH+RDHNS+VIMIG+IPV LN KGNEQ+SL
Subjt:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL

Query:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFV
        SRLSV KCDDF++  DQKGIPAN+PSSYHDDEFPPV RQNT RR+RIDLG    LKN+T S QM+                      +N PF   K +  
Subjt:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFV

Query:  NIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDG
        +IGS+NSL T NLPP E F+ICF ERRGK KPR  WQ K RDT+KVMEH D+A N  V+RPGMVLLKHYI LHEQ+NIV   QKLGLGPGGFYQPGYKDG
Subjt:  NIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDG

Query:  AKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLV
        AKLRLQMMCLGLDWDPQTRKY  KR ADGNKPPD+PPEF +LV +AL DA A IKNNG+T NIEDILPTMSPDICIVNFY+TSGRLGLHQDRDES++SLV
Subjt:  AKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLV

Query:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY
         GLPVVSFSLGNSAEFLYGDQRDVDKA KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL+H+GLRPGRLNLTFRKY
Subjt:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY

XP_022966314.1 uncharacterized protein LOC111466008 [Cucurbita maxima]5.6e-19272.14Show/hide
Query:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL
        M LIRTVP     WSNLLRQLLFAE         RLL+FQRMDSFGSS    ALP+S CYG S GG+EE LH+RDHNS+VIMIG+IPV LN KGNEQ+SL
Subjt:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL

Query:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFV
        SRLSV KCDDF++  DQKGIPAN+PS YHDDEFPPV RQNT RR+RID G    LKN+T S QM                      K+N PF  +K +  
Subjt:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFV

Query:  NIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDG
        +IGS+NSL T NLPP E F+ICF ERRGK KPR  WQ K RDT+KVMEHVD+A N  V+RPGMVLLKHYI LHEQ+NIV   QKLGLGPGGFYQPGYKDG
Subjt:  NIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDG

Query:  AKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLV
        AKLRLQMMCLGLDWDPQTRKY  KR ADGNKPPD+PPEF +LV +AL DA A IKNNG+   IEDILPTMSPDICIVNFY+T GRLGLHQDRDES++SLV
Subjt:  AKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLV

Query:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY
        SGLPVVSFSLGNSA FLYGD+R+VDKA KI+LESGDVLIFGGESRHIFHGVSSIIPKS PKFLL+H+G RPG LNLTFRKY
Subjt:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY

XP_023517205.1 uncharacterized protein LOC111781040 [Cucurbita pepo subsp. pepo]1.3e-19673.8Show/hide
Query:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL
        M LIRTVP      SNLLR+LLFAE         RLL+FQRMDSFGSS    ALPDS CYG SCGG+EE LH+RDHNS+VIMIG+IPV LN KGNEQ+SL
Subjt:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL

Query:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFV
        SRLSV KCDDF++  DQK IPAN+PSSYHDDEFPPV RQNT RR+RIDLG    LKN+T S QM+                      +N PF  +K +  
Subjt:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFV

Query:  NIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDG
        +IGS+NSL T NLPP E F+ICF ERRGK KPR  WQ K RDT+KVMEH D+A N  V+RPGMVLLKHYI LHEQ+NIV   QKLGLGPGGFYQPGYKDG
Subjt:  NIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDG

Query:  AKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLV
        AKLRLQMMCLGLDWDPQTRKY  KR ADGNKPPD+PPEF +LV +AL DA A IKNNG+T NIEDILPTMSPDICIVNFY+TSGRLGLHQDRDES++SLV
Subjt:  AKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLV

Query:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY
        SGLPVVSFSLGNSAEFLYGDQRDVDKA KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL+H+GLRPGRLNLTFRKY
Subjt:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY

TrEMBL top hitse value%identityAlignment
A0A1S4E4K6 uncharacterized protein LOC1035021832.2e-17367.77Show/hide
Query:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYG--CSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQK
        MF IRT+P+ PS  SN LR+LLF    FPG R   LL+FQRMDSF SSAN  A PDS C G  C CG  +EHL DRD+ SDVI +G   V LN K  E K
Subjt:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYG--CSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQK

Query:  SLSRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQ
        SL+ LS  KCD  E+G D+ GI +N P SYH DEF PVSRQNT RRNRIDLG    LK+N RSFQ++  +  N+  +  E SLP  FGKKN  F+S KRQ
Subjt:  SLSRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQ

Query:  FVNIGSRNSLVTDNLPPFE-PFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGY
         ++IGS+ S+VTD+  PFE PF+ICF    G  K R  W+VK   TVK         + R+LRPGMVLLKHYIT  EQINIV  CQKLGLGPGGFYQP Y
Subjt:  FVNIGSRNSLVTDNLPPFE-PFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGY

Query:  KDGAKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQ
        KDGAKLRL+MMCLGLDWDPQTR+Y++KR  DGNKPPDIPP F+ LVK ALKDA A IKN  N  N+EDILP+MSPDICI NFYTTSGRLGLHQDRDESK+
Subjt:  KDGAKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQ

Query:  SLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY
        SL SGLPVVSFS+GN+AEFLYGD+RDV+KAEK+ LESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL H+GLRPGRLNLTFRKY
Subjt:  SLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY

A0A5D3BFV0 2-oxoglutarate-dependent dioxygenase family protein isoform 12.7e-16069.68Show/hide
Query:  ALPDSPCYG--CSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSLSRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG
        A PDS C G  C CG  +EHL DRD+ SDVI +G   V LN K  E KSL+ LSV KCD  E+G D+ GI +N P SYH DE  PVSRQNT RRNRIDLG
Subjt:  ALPDSPCYG--CSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSLSRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG

Query:  ----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFVNIGSRNSLVTDNLPPFE-PFNICFSERRGKGKPRAHWQVKSRDTVKVMEH
            LK+N RSFQ++  + LN+  +  E SLP  FGKKN  F+S KRQ ++IGS+ S+VTD+ PPFE PF+ICF    G  K R  W+VK   TVK    
Subjt:  ----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFVNIGSRNSLVTDNLPPFE-PFNICFSERRGKGKPRAHWQVKSRDTVKVMEH

Query:  VDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKD
             + R+LRPGMVLLKHYIT  EQINIV  CQKLGLGPGGFYQPGYKDGAKLRL+MMCLGLDWDPQTR+YE+KR  DGNKPPDIPP F+ LVK ALKD
Subjt:  VDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKD

Query:  AQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFH
        A A IKN  N  N+EDILP+MSPDICI NFYTTSGRLGLHQDRDESK+SL SGLPVVSFS+GN+AEFLYGD+RDVDKAEK+ LESGDVLIFGGESRH+FH
Subjt:  AQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFH

Query:  GVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY
        GVSSIIPKSTPKFLL H+GLRPGRLNLTFRKY
Subjt:  GVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY

A0A6J1CQI1 uncharacterized protein LOC1110138279.0e-18057.99Show/hide
Query:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL
        M +IRTVP+ P   SN L +LLFA  RFPGGR  RLL+F+RMDS  +SA               G   E+ H+R H+SD++M+G+IPV LN K  E++S 
Subjt:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL

Query:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHD----------------------------------------------------------------------
        S  SV+K DDFE+GR++K  PANVP+SYHD                                                                      
Subjt:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHD----------------------------------------------------------------------

Query:  ---------------------------------------DEFPPVSRQNTNRRNRIDLGLK--NNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFY
                                               DEF PVSRQNT RRNR+DLG +  NNT SFQ++ F LLNN ++L+E S PNQFGKKN PFY
Subjt:  ---------------------------------------DEFPPVSRQNTNRRNRIDLGLK--NNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFY

Query:  SHKRQFVNIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFY
          K Q ++IGS+NSLV DNL PFEPF+IC  ERRG  KP AHWQ K RDTVKVMEHV +A+N RVLRPGMVLLK+YITLHEQ+NIV  CQ+LG+GPGGFY
Subjt:  SHKRQFVNIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFY

Query:  QPGYKDGAKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRD
        +PGYKDGAKLRLQMMCLGLDWDPQTRKY  KRA DG+KPP+IPP+F +LV EALKDA A IKN  NT N+E ILP+MSPDICIVNFYTTSGRLGLHQDRD
Subjt:  QPGYKDGAKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRD

Query:  ESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY
        ESK+SLVSGLPVVS SLG+SAEFLYGD+RDVDKAEK++LESGDVLIFGG+SRH+FHGVSSIIP STPKFLL+H+GLRPGRLNLTFRKY
Subjt:  ESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY

A0A6J1EDT3 uncharacterized protein LOC1114323182.5e-19873.8Show/hide
Query:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL
        M LIRTVP     WSNLLR+LLFAE         RLL+FQR+DSFGSS    ALPDS CYG SCGG+EE LH+RDHNS+VIMIG+IPV LN KGNEQ+SL
Subjt:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL

Query:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFV
        SRLSV KCDDF++  DQKGIPAN+PSSYHDDEFPPV RQNT RR+RIDLG    LKN+T S QM+                      +N PF   K +  
Subjt:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFV

Query:  NIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDG
        +IGS+NSL T NLPP E F+ICF ERRGK KPR  WQ K RDT+KVMEH D+A N  V+RPGMVLLKHYI LHEQ+NIV   QKLGLGPGGFYQPGYKDG
Subjt:  NIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDG

Query:  AKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLV
        AKLRLQMMCLGLDWDPQTRKY  KR ADGNKPPD+PPEF +LV +AL DA A IKNNG+T NIEDILPTMSPDICIVNFY+TSGRLGLHQDRDES++SLV
Subjt:  AKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLV

Query:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY
         GLPVVSFSLGNSAEFLYGDQRDVDKA KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL+H+GLRPGRLNLTFRKY
Subjt:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY

A0A6J1HTF0 uncharacterized protein LOC1114660082.7e-19272.14Show/hide
Query:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL
        M LIRTVP     WSNLLRQLLFAE         RLL+FQRMDSFGSS    ALP+S CYG S GG+EE LH+RDHNS+VIMIG+IPV LN KGNEQ+SL
Subjt:  MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSL

Query:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFV
        SRLSV KCDDF++  DQKGIPAN+PS YHDDEFPPV RQNT RR+RID G    LKN+T S QM                      K+N PF  +K +  
Subjt:  SRLSVDKCDDFEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLG----LKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFV

Query:  NIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDG
        +IGS+NSL T NLPP E F+ICF ERRGK KPR  WQ K RDT+KVMEHVD+A N  V+RPGMVLLKHYI LHEQ+NIV   QKLGLGPGGFYQPGYKDG
Subjt:  NIGSRNSLVTDNLPPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDG

Query:  AKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLV
        AKLRLQMMCLGLDWDPQTRKY  KR ADGNKPPD+PPEF +LV +AL DA A IKNNG+   IEDILPTMSPDICIVNFY+T GRLGLHQDRDES++SLV
Subjt:  AKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLV

Query:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY
        SGLPVVSFSLGNSA FLYGD+R+VDKA KI+LESGDVLIFGGESRHIFHGVSSIIPKS PKFLL+H+G RPG LNLTFRKY
Subjt:  SGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY

SwissProt top hitse value%identityAlignment
B8GWW6 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog7.2e-1734.52Show/hide
Query:  PGGFYQPGYKDGAKLRLQMMCLG-LDWDPQTR--KYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGR
        P   Y+  Y  G  + + M  LG L W    R  +Y  +    G   PD+PP        AL D    + +           P   PD C+VN Y    R
Subjt:  PGGFYQPGYKDGAKLRLQMMCLG-LDWDPQTR--KYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGR

Query:  LGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S SLG++A F  G     D    + L SGDV    G +R  FHGV  I+P S        S L P  GR+NLT R+
Subjt:  LGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRP--GRLNLTFRK

O60066 Alpha-ketoglutarate-dependent dioxygenase abh19.8e-1432.92Show/hide
Query:  RLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGL
        +L+ + LG  +D  T++Y      D +K P  P +    V++ +K++              D L     +  IVNFY+    L  H   DES++ L   L
Subjt:  RLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGL

Query:  PVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL
        P++S S+G    +L G +   +K   + L SGDV+I  G SR  FH V  IIP STP +LL
Subjt:  PVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL

P05050 Alpha-ketoglutarate-dependent dioxygenase AlkB5.7e-1437.17Show/hide
Query:  PTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHS
        P   PD C++N Y    +L LHQD+DE         P+VS SLG  A F +G  +  D  ++++LE GDV+++GGESR  +HG+  +     P  +    
Subjt:  PTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHS

Query:  GLRPGRLNLTFRK
             R NLTFR+
Subjt:  GLRPGRLNLTFRK

P0CAT7 Alpha-ketoglutarate-dependent dioxygenase AlkB homolog7.2e-1734.52Show/hide
Query:  PGGFYQPGYKDGAKLRLQMMCLG-LDWDPQTR--KYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGR
        P   Y+  Y  G  + + M  LG L W    R  +Y  +    G   PD+PP        AL D    + +           P   PD C+VN Y    R
Subjt:  PGGFYQPGYKDGAKLRLQMMCLG-LDWDPQTR--KYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGR

Query:  LGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRP--GRLNLTFRK
        +GLHQDRDE+        PV+S SLG++A F  G     D    + L SGDV    G +R  FHGV  I+P S        S L P  GR+NLT R+
Subjt:  LGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRP--GRLNLTFRK

P37462 Alpha-ketoglutarate-dependent dioxygenase AlkB5.7e-1437.07Show/hide
Query:  TMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSG
        +  PD C++N Y    +L LHQD+DE         P+VS SLG  A F +G  R  D  ++I+LE GD++++GGESR  +HG+  +            +G
Subjt:  TMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSG

Query:  LRP----GRLNLTFRK
          P     R NLTFR+
Subjt:  LRP----GRLNLTFRK

Arabidopsis top hitse value%identityAlignment
AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein5.7e-0939.76Show/hide
Query:  PDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSI
        P+  IVN++     LG H D  E+  S     P+VS SLG  A FL G +   D    + L SGDV++  GE+R  FHG+  I
Subjt:  PDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSI

AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein2.7e-6753.57Show/hide
Query:  KAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQ
        K  +  V+RPGMVLLK+Y++++ Q+ IVN C++LGLG GGFYQPG++DG  L L+MMCLG +WD QTR+Y   R  DG+ PP IP EF+ LV++A+K+++
Subjt:  KAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQ

Query:  ASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQ---------------------DRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKI
        + +  N N     D +P + PDIC+VNFYT++G+LGLHQ                     D+ ESK+SL  GLP+VSFS+G+SAEFLYGDQ+DVDKA+ +
Subjt:  ASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQ---------------------DRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKI

Query:  VLESGDVLIFGGESRHIFHGVSSI
        +LESGDVLIFG  SR++FHGV SI
Subjt:  VLESGDVLIFGGESRHIFHGVSSI

AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein4.3e-8160Show/hide
Query:  KAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQ
        K  +  V+RPGMVLLK+Y+++++Q+ IVN C++LGLG GGFYQPGY+D AKL L+MMCLG +WDP+T +Y   R  DG+  P IP EF   V++A+K++Q
Subjt:  KAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQ

Query:  ASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGV
        +   +N       D +P M PDICIVNFY+++GRLGLHQD+DES+ S+  GLPVVSFS+G+SAEFLYGDQRD DKAE + LESGDVL+FGG SR +FHGV
Subjt:  ASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGV

Query:  SSIIPKSTPKFLLEHSGLRPGRLNLTFRKY
         SI   + PK LL+ + LRPGRLNLTFR+Y
Subjt:  SSIIPKSTPKFLLEHSGLRPGRLNLTFRKY

AT5G01780.1 2-oxoglutarate-dependent dioxygenase family protein3.6e-8055.97Show/hide
Query:  PPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLD
        PP  PF+IC S           W +         E V+ +  ++V+RPGMVLLK ++T   Q++IV  C++LG+ P GFYQPGY  G+KL LQMMCLG +
Subjt:  PPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLD

Query:  WDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNS
        WDPQT KY      D +K P+IP  F VLV++A+++A A I     T + E ILP MSPDICIVNFY+ +GRLGLHQDRDES++S+  GLP+VSFS+G+S
Subjt:  WDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNS

Query:  AEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY
        AEFLYG++RDV++A+ ++LESGDVLIFGGESR IFHGV SIIP S P  LL  S LR GRLNLTFR +
Subjt:  AEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY

AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein3.6e-8055.97Show/hide
Query:  PPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLD
        PP  PF+IC S           W +         E V+ +  ++V+RPGMVLLK ++T   Q++IV  C++LG+ P GFYQPGY  G+KL LQMMCLG +
Subjt:  PPFEPFNICFSERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLD

Query:  WDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNS
        WDPQT KY      D +K P+IP  F VLV++A+++A A I     T + E ILP MSPDICIVNFY+ +GRLGLHQDRDES++S+  GLP+VSFS+G+S
Subjt:  WDPQTRKYESKRAADGNKPPDIPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNS

Query:  AEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY
        AEFLYG++RDV++A+ ++LESGDVLIFGGESR IFHGV SIIP S P  LL  S LR GRLNLTFR +
Subjt:  AEFLYGDQRDVDKAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTGATCCGTACAGTTCCTGTCTTGCCGTCGCTGTGGTCGAATCTTCTTCGTCAGCTTTTGTTCGCCGAGTTTCGATTTCCCGGCGGACGCCGTCCTCGATTGCT
TCGATTTCAGCGGATGGATTCTTTTGGCAGTTCGGCAAATGGCCGTGCATTACCTGATTCACCATGTTATGGTTGTTCATGTGGCGGCAGCGAGGAACATTTACATGATA
GAGATCATAATTCAGATGTGATAATGATAGGAAAGATTCCTGTGTGTCTAAATTGCAAGGGAAATGAACAGAAATCTTTATCTCGGTTGTCTGTTGATAAATGTGATGAT
TTCGAGATGGGAAGAGATCAAAAAGGGATTCCTGCAAATGTACCAAGTTCTTACCATGATGATGAGTTTCCACCTGTTTCTAGACAAAACACCAACAGAAGAAACCGCAT
AGATCTAGGTTTGAAGAATAATACAAGGTCGTTTCAAATGGATATGTTTCAATTGTTAAACAATACCAATAAGCTGGAAGAATTATCTTTGCCTAATCAGTTTGGGAAGA
AAAATGGACCATTTTATTCCCATAAACGCCAGTTTGTGAATATCGGTTCCAGAAATTCTCTAGTTACTGACAATTTACCTCCCTTTGAACCATTCAATATATGTTTCTCT
GAAAGAAGAGGTAAAGGAAAACCCCGAGCTCATTGGCAGGTTAAAAGCAGGGACACTGTGAAAGTCATGGAGCATGTCGATAAAGCTGCAAATAATAGAGTGTTGAGGCC
TGGAATGGTTTTACTGAAGCATTACATTACTCTACATGAACAGATCAATATAGTGAACATGTGTCAAAAGCTTGGTCTTGGTCCAGGGGGCTTTTACCAGCCTGGTTATA
AAGATGGTGCAAAACTTAGGCTTCAGATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGGAAATATGAAAGTAAGCGGGCTGCCGATGGTAATAAACCACCAGAT
ATACCTCCTGAATTTACAGTTCTTGTTAAAGAAGCACTTAAAGATGCACAGGCCTCGATCAAGAACAATGGCAATACAATTAATATAGAAGACATACTTCCAACAATGTC
TCCTGACATATGCATTGTGAATTTCTACACAACGAGTGGAAGACTGGGTCTGCATCAGGATCGTGATGAAAGCAAACAGAGTCTCGTCAGTGGACTACCCGTTGTTTCCT
TTTCTTTAGGCAATTCAGCAGAATTCTTGTATGGAGATCAAAGAGATGTAGATAAAGCAGAGAAGATTGTATTGGAATCAGGTGATGTTCTGATATTTGGTGGAGAATCT
AGGCATATATTTCATGGAGTATCTTCAATCATACCTAAATCAACACCTAAGTTTTTGCTTGAGCATTCTGGTCTTCGTCCTGGGCGTCTAAATCTTACCTTTAGAAAGTA
TTAA
mRNA sequenceShow/hide mRNA sequence
GGGCTCCATAATAGGTCGGACTAGAAAAATTCCTATAATTTGGTGATAAACTCGACTTCCAAGTTCCAATTGAAAACACATCACTTTCATCGCAGGAGATGGTCACTCAC
AGCTGTGAAAATGGCCGAAATCGTTACTCCTTCCGCTTCCCGGTGCAGTCTCCTCGGCCAACACGGCGGCGACCGAAACTTGCGGATTAAGGTTACAAAATCATCATTTT
GTCAATCCGATCCTCCTCCGACGGTTCCATTTCCTTCTCCTTTTCCGACTTTTGGATTCTTTCTCAAATTTTGAAACCTATAAGTATCTCCGTAGCATCCAAAGTTCCCA
ATGTTTTTGATCCGTACAGTTCCTGTCTTGCCGTCGCTGTGGTCGAATCTTCTTCGTCAGCTTTTGTTCGCCGAGTTTCGATTTCCCGGCGGACGCCGTCCTCGATTGCT
TCGATTTCAGCGGATGGATTCTTTTGGCAGTTCGGCAAATGGCCGTGCATTACCTGATTCACCATGTTATGGTTGTTCATGTGGCGGCAGCGAGGAACATTTACATGATA
GAGATCATAATTCAGATGTGATAATGATAGGAAAGATTCCTGTGTGTCTAAATTGCAAGGGAAATGAACAGAAATCTTTATCTCGGTTGTCTGTTGATAAATGTGATGAT
TTCGAGATGGGAAGAGATCAAAAAGGGATTCCTGCAAATGTACCAAGTTCTTACCATGATGATGAGTTTCCACCTGTTTCTAGACAAAACACCAACAGAAGAAACCGCAT
AGATCTAGGTTTGAAGAATAATACAAGGTCGTTTCAAATGGATATGTTTCAATTGTTAAACAATACCAATAAGCTGGAAGAATTATCTTTGCCTAATCAGTTTGGGAAGA
AAAATGGACCATTTTATTCCCATAAACGCCAGTTTGTGAATATCGGTTCCAGAAATTCTCTAGTTACTGACAATTTACCTCCCTTTGAACCATTCAATATATGTTTCTCT
GAAAGAAGAGGTAAAGGAAAACCCCGAGCTCATTGGCAGGTTAAAAGCAGGGACACTGTGAAAGTCATGGAGCATGTCGATAAAGCTGCAAATAATAGAGTGTTGAGGCC
TGGAATGGTTTTACTGAAGCATTACATTACTCTACATGAACAGATCAATATAGTGAACATGTGTCAAAAGCTTGGTCTTGGTCCAGGGGGCTTTTACCAGCCTGGTTATA
AAGATGGTGCAAAACTTAGGCTTCAGATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGGAAATATGAAAGTAAGCGGGCTGCCGATGGTAATAAACCACCAGAT
ATACCTCCTGAATTTACAGTTCTTGTTAAAGAAGCACTTAAAGATGCACAGGCCTCGATCAAGAACAATGGCAATACAATTAATATAGAAGACATACTTCCAACAATGTC
TCCTGACATATGCATTGTGAATTTCTACACAACGAGTGGAAGACTGGGTCTGCATCAGGATCGTGATGAAAGCAAACAGAGTCTCGTCAGTGGACTACCCGTTGTTTCCT
TTTCTTTAGGCAATTCAGCAGAATTCTTGTATGGAGATCAAAGAGATGTAGATAAAGCAGAGAAGATTGTATTGGAATCAGGTGATGTTCTGATATTTGGTGGAGAATCT
AGGCATATATTTCATGGAGTATCTTCAATCATACCTAAATCAACACCTAAGTTTTTGCTTGAGCATTCTGGTCTTCGTCCTGGGCGTCTAAATCTTACCTTTAGAAAGTA
TTAAAATGCTGCTTGTCCGTTTTCTTTTTTTATTTTTGTCTCATCGTTTGTCCTGTACAAGCGAATGGCTGTTGTATTTGTTCATTACTTGATTATCTAAATGGATATTA
TAATTCTAGAATGTAATTATTATGTTACTAACTATCCTAACTATTAGTAGTCAATAAGGGCCAAGAAAATAAAAGAGCTAAGAGGGAATGAGCTCAAATCATAGCCACCT
ACCTATGATTTAATATCTCTAGTTTTTTTGTTGTCCCTGAGAATGGTTGAGGTGGTTGAGGTGCGCG
Protein sequenceShow/hide protein sequence
MFLIRTVPVLPSLWSNLLRQLLFAEFRFPGGRRPRLLRFQRMDSFGSSANGRALPDSPCYGCSCGGSEEHLHDRDHNSDVIMIGKIPVCLNCKGNEQKSLSRLSVDKCDD
FEMGRDQKGIPANVPSSYHDDEFPPVSRQNTNRRNRIDLGLKNNTRSFQMDMFQLLNNTNKLEELSLPNQFGKKNGPFYSHKRQFVNIGSRNSLVTDNLPPFEPFNICFS
ERRGKGKPRAHWQVKSRDTVKVMEHVDKAANNRVLRPGMVLLKHYITLHEQINIVNMCQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRKYESKRAADGNKPPD
IPPEFTVLVKEALKDAQASIKNNGNTINIEDILPTMSPDICIVNFYTTSGRLGLHQDRDESKQSLVSGLPVVSFSLGNSAEFLYGDQRDVDKAEKIVLESGDVLIFGGES
RHIFHGVSSIIPKSTPKFLLEHSGLRPGRLNLTFRKY