; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g14100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g14100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTranspos_assoc domain-containing protein
Genome locationchr10:10831375..10838684
RNA-Seq ExpressionMoc10g14100
SyntenyMoc10g14100
Gene Ontology termsNA
InterPro domainsIPR015931 - Aconitase/3-isopropylmalate dehydratase large subunit, alpha/beta/alpha, subdomain 1/3
IPR029480 - Transposase-associated domain
IPR036008 - Aconitase, iron-sulfur domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_020094140.1 uncharacterized protein LOC109714114 [Ananas comosus]1.2e-3148.63Show/hide
Query:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTTSNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEESLSYGLGENDNDSGEEDIFEILED
        +DKSWM I NR   EY+ GV  FLDFAF  T+ + I CPCK+CNN + K R++VEADL++ G+V  YTRW +HGEE  +  +  +D DS  E + +ILE 
Subjt:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTTSNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEESLSYGLGENDNDSGEEDIFEILED

Query:  HFGVFNTNNWTKKGESSKHGYNEEPNEEASKFYRLLNDAEKELYPG
        HFG  N   W  +   +    +EEPNE A+KF++LL D  ++LYPG
Subjt:  HFGVFNTNNWTKKGESSKHGYNEEPNEEASKFYRLLNDAEKELYPG

XP_023893229.1 uncharacterized protein LOC112005206 [Quercus suber]1.2e-3329.87Show/hide
Query:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTT-SNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEE-------SL-SYGLGENDNDSGE
        M+ +W+++ NR   EYR GV+ FLDFAF HTT  N+I CPCKRCNN   KTRDDVE DLL  GI+PSYTRW  HGEE       SL S G  + D +S E
Subjt:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTT-SNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEE-------SL-SYGLGENDNDSGE

Query:  EDIFEILEDHFGVF---------NTNNWTKKGES--SKHGYNEEPNEEASKFYR---------------------LLNDA-----EKELYP-GYHEWRLQ
        + + E++ED+   F         ++ N T KG+    K G     N + SK +R                     L+N+      E++  P  + +W+  
Subjt:  EDIFEILEDHFGVF---------NTNNWTKKGES--SKHGYNEEPNEEASKFYR---------------------LLNDA-----EKELYP-GYHEWRLQ

Query:  PE-----LFD--------------------------------------GHVEEE----LPP---------------------------EQLSGSDILEQS
        P+     LFD                                      G   EE    +PP                              S +DI+  S
Subjt:  PE-----LFD--------------------------------------GHVEEE----LPP---------------------------EQLSGSDILEQS

Query:  ELKSY-----------------------------------------------ASQLMDGTLDMNNDKIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKV
          KS+                                                +Q+ +G L M+ D++F +VFGPE+HGRVRGYGAG+TP++L GSSS +
Subjt:  ELKSY-----------------------------------------------ASQLMDGTLDMNNDKIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKV

Query:  -RDLERRLNESEHRLQESERQRKVDVQGLKDQMIELENRFE----------HRFQVMMAEMI
          DLE+RL ESE +  E++ +    V+ LK ++ +L++  E           RF+ MMA+++
Subjt:  -RDLERRLNESEHRLQESERQRKVDVQGLKDQMIELENRFE----------HRFQVMMAEMI

XP_023907200.1 uncharacterized protein LOC112018907 [Quercus suber]2.1e-3329.87Show/hide
Query:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTT-SNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEE-------SL-SYGLGENDNDSGE
        M+ +W+++ NR   EYR GV+ FLDFAF HTT  N+I CPCKRCNN   KTRDDVE DLL  GI+PSYTRW  HGEE       SL S G  + D +S E
Subjt:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTT-SNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEE-------SL-SYGLGENDNDSGE

Query:  EDIFEILEDHFGVF---------NTNNWTKKGES--SKHGYNEEPNEEASKFYR---------------------LLNDA-----EKELYP-GYHEWRLQ
        + + E++ED+   F         ++ N T KG+    K G     N + SK +R                     L+N+      E++  P  + +W+  
Subjt:  EDIFEILEDHFGVF---------NTNNWTKKGES--SKHGYNEEPNEEASKFYR---------------------LLNDA-----EKELYP-GYHEWRLQ

Query:  PE-----LFD--------------------------------------GHVEEE----LPP---------------------------EQLSGSDILEQS
        P+     LFD                                      G   EE    +PP                              S +DI+  S
Subjt:  PE-----LFD--------------------------------------GHVEEE----LPP---------------------------EQLSGSDILEQS

Query:  ELKSY-----------------------------------------------ASQLMDGTLDMNNDKIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKV
          KS+                                                +Q+ +G L M+ D++F +VFGPE+HGRVRGYGAG+TP++L GSSS +
Subjt:  ELKSY-----------------------------------------------ASQLMDGTLDMNNDKIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKV

Query:  -RDLERRLNESEHRLQESERQRKVDVQGLKDQMIELENRFE----------HRFQVMMAEMI
          DLE+RL ESE +  E++ +    V+ LK ++ +L++  E           RF+ MMA+++
Subjt:  -RDLERRLNESEHRLQESERQRKVDVQGLKDQMIELENRFE----------HRFQVMMAEMI

XP_038882387.1 uncharacterized protein LOC120073655 [Benincasa hispida]1.8e-6176.71Show/hide
Query:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTTSNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEESLSYGLGENDNDSGEEDIFEILED
        MD+SWM+IGNR LQEYRNGVRSFLDFAFMH T+ QISCPCKRCNN +LKTRD+VE DLLMFGIVPSY RWTMHGEES +Y +GENDND+ +ED+FEILED
Subjt:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTTSNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEESLSYGLGENDNDSGEEDIFEILED

Query:  HFGVFNTNNWTKKGESSKHGYNEEPNEEASKFYRLLNDAEKELYPG
        HFG  +T+NW  K ES+KHGY+EEPNE AS+FY +LN AEKELY G
Subjt:  HFGVFNTNNWTKKGESSKHGYNEEPNEEASKFYRLLNDAEKELYPG

XP_038904040.1 uncharacterized protein LOC120090445 [Benincasa hispida]1.5e-3168.97Show/hide
Query:  EQSELKSYASQLMDGTLDMNNDKIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKVRDLERRLNESEHRLQESERQRK-------VDVQGLKDQMIELEN
        + S+LKSYASQ++DGTLDMN D+ FV+VFGPEKHG V GYG GVTPSELFGSSS +RD ERRLNESE RLQE E+QRK       ++V+ LK Q+ ELEN
Subjt:  EQSELKSYASQLMDGTLDMNNDKIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKVRDLERRLNESEHRLQESERQRK-------VDVQGLKDQMIELEN

Query:  RFEHRFQVMMAEMIRK
        RFE RFQ MMAEM +K
Subjt:  RFEHRFQVMMAEMIRK

TrEMBL top hitse value%identityAlignment
A0A2N9GUI6 Uncharacterized protein2.4e-3553.38Show/hide
Query:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTT-SNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEESLSYGLGE--NDNDSGEEDIFEI
        M+ +W++I NR L EYR GV+ FLDFAF HTT  ++I CPCKRCN+   K+RDDVEADL+  GIVP+YTRW  HGEE+ S       +D++S   D+ E+
Subjt:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTT-SNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEESLSYGLGE--NDNDSGEEDIFEI

Query:  LEDHFGVFNTNNWTKKGESSKHGYNEEPNEEASKFYRLLNDAEKELYP
        +ED+FG  N  +W   GE S +G  EEPN++A+KF+RLL D E++LYP
Subjt:  LEDHFGVFNTNNWTKKGESSKHGYNEEPNEEASKFYRLLNDAEKELYP

A0A2N9HCP8 Uncharacterized protein1.2e-3452.7Show/hide
Query:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTT-SNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEESLSYGLGE--NDNDSGEEDIFEI
        M+ +W++I NR L EYR GV+ FLDFAF HTT  ++I CPCKRCN+   K+RDDVEADL+   IVP+YTRW  HGEE+ S       +D++S   D+ E+
Subjt:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTT-SNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEESLSYGLGE--NDNDSGEEDIFEI

Query:  LEDHFGVFNTNNWTKKGESSKHGYNEEPNEEASKFYRLLNDAEKELYP
        +ED+FG  N  +W   GE S +G  EEPN++A+KF+RLL D E++LYP
Subjt:  LEDHFGVFNTNNWTKKGESSKHGYNEEPNEEASKFYRLLNDAEKELYP

A0A2N9HCP8 Uncharacterized protein7.4e-1648.57Show/hide
Query:  LKSYASQLMDGTLDMNNDKIFVQVFGPEKHGRVRGYGAGVTPSELFG-SSSKVRDLERRLNESEHRLQESERQRKVDVQGLKDQMIELENRFEHRFQVMM
        L+S  +Q+ +G+L M+ D++FV+VFGPE+HGRVRGYGAGVT ++L+G SSSK+ DLE+RL ESE    E+  +    V+ L++Q+I+L++  E R   M 
Subjt:  LKSYASQLMDGTLDMNNDKIFVQVFGPEKHGRVRGYGAGVTPSELFG-SSSKVRDLERRLNESEHRLQESERQRKVDVQGLKDQMIELENRFEHRFQVMM

Query:  AEMIR
         + IR
Subjt:  AEMIR

A0A2N9HCP8 Uncharacterized protein5.6e-3248.63Show/hide
Query:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTTSNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEESLSYGLGENDNDSGEEDIFEILED
        +DKSWM I NR   EY+ GV  FLDFAF  T+ + I CPCK+CNN + K R++VEADL++ G+V  YTRW +HGEE  +  +  +D DS  E + +ILE 
Subjt:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTTSNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEESLSYGLGENDNDSGEEDIFEILED

Query:  HFGVFNTNNWTKKGESSKHGYNEEPNEEASKFYRLLNDAEKELYPG
        HFG  N   W  +   +    +EEPNE A+KF++LL D  ++LYPG
Subjt:  HFGVFNTNNWTKKGESSKHGYNEEPNEEASKFYRLLNDAEKELYPG

A0A6P5MWG2 uncharacterized protein LOC107470678 isoform X12.0e-2945.96Show/hide
Query:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTTSNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHG-----EESLSYGLGENDNDSGEEDIF
        MDK WM+I NR L +YR GV  FLDFAF HT  ++I CPC +CNN + K+R+ VE DLL  GIV +YT W  HG     E   S     +D+   ++D+ 
Subjt:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTTSNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHG-----EESLSYGLGENDNDSGEEDIF

Query:  EILEDHFGVFNTNNWTKK----------GESSKHGYNEEPNEEASKFYRLLNDAEKELYPG
         +L DHFGV++     ++          GE  +  + EEPNE+A+KFY+LL+D+EKELYPG
Subjt:  EILEDHFGVFNTNNWTKK----------GESSKHGYNEEPNEEASKFYRLLNDAEKELYPG

A0A6P5NRP6 uncharacterized protein LOC1074893442.0e-2945.96Show/hide
Query:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTTSNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHG-----EESLSYGLGENDNDSGEEDIF
        MDK WM+I NR L +YR GV  FLDFAF HT  ++I CPC +CNN + K+R+ VE DLL  GIV +YT W  HG     E   S     +D+   ++D+ 
Subjt:  MDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTTSNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHG-----EESLSYGLGENDNDSGEEDIF

Query:  EILEDHFGVFNTNNWTKK----------GESSKHGYNEEPNEEASKFYRLLNDAEKELYPG
         +L DHFGV++     ++          GE  +  + EEPNE+A+KFY+LL+D+EKELYPG
Subjt:  EILEDHFGVFNTNNWTKK----------GESSKHGYNEEPNEEASKFYRLLNDAEKELYPG

A0A6P5NRP6 uncharacterized protein LOC1074893446.1e-1040.19Show/hide
Query:  ELPPEQLSGSDILEQSE-----LKSYASQLMDGTLDMNNDKIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKV-RDLERRLNESEHRLQESERQRKVDV
        EL   +++G    E+++     LK  +SQ+ +G L+MN  ++ V+VFGPE+HGRVRGYGAGVTP++L+G  S +  DL+ +L  +E + ++S    K++V
Subjt:  ELPPEQLSGSDILEQSE-----LKSYASQLMDGTLDMNNDKIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKV-RDLERRLNESEHRLQESERQRKVDV

Query:  QGLKDQM
        + LK ++
Subjt:  QGLKDQM

SwissProt top hitse value%identityAlignment
B9L6Y8 3-isopropylmalate dehydratase large subunit6.2e-0444.44Show/hide
Query:  VLDSTILLGTDSHTCIAGAFGQFSSGIGNTNAGFVLGTRKSLLKV
        V+   +++G DSHTC  GA G FS+G+G+T+  F + T KS  K+
Subjt:  VLDSTILLGTDSHTCIAGAFGQFSSGIGNTNAGFVLGTRKSLLKV

O28316 3-isopropylmalate dehydratase large subunit 13.7e-0452.5Show/hide
Query:  ILLGTDSHTCIAGAFGQFSSGIGNTNAGFVLGTRKSLLKV
        +++G DSHTC+ GA G F++GIG+T+ GFVL   K   KV
Subjt:  ILLGTDSHTCIAGAFGQFSSGIGNTNAGFVLGTRKSLLKV

P81291 Isopropylmalate/citramalate isomerase large subunit4.8e-0446.15Show/hide
Query:  VLDSTILLGTDSHTCIAGAFGQFSSGIGNTNAGFVLGTRKSLLKV--VFYFN
        V    +++G DSHTC  GAFG F++GIG+T+   V  T K   KV    YFN
Subjt:  VLDSTILLGTDSHTCIAGAFGQFSSGIGNTNAGFVLGTRKSLLKV--VFYFN

Q94AR8 3-isopropylmalate dehydratase large subunit, chloroplastic5.3e-1182.5Show/hide
Query:  ILLGTDSHTCIAGAFGQFSSGIGNTNAGFVLGTRKSLLKV
        +LLGTDSHTC AGAFGQF++GIGNT+AGFVLGT K LLKV
Subjt:  ILLGTDSHTCIAGAFGQFSSGIGNTNAGFVLGTRKSLLKV

Arabidopsis top hitse value%identityAlignment
AT4G13430.1 isopropyl malate isomerase large subunit 13.7e-1282.5Show/hide
Query:  ILLGTDSHTCIAGAFGQFSSGIGNTNAGFVLGTRKSLLKV
        +LLGTDSHTC AGAFGQF++GIGNT+AGFVLGT K LLKV
Subjt:  ILLGTDSHTCIAGAFGQFSSGIGNTNAGFVLGTRKSLLKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACATGGCTACACAAGGGGTGTGTGAAGAGGGTAGGGACGTGAGCAGTGGGGTGCAGGTGACACTCGGCAATGGGCACTGGATAGCGCACATGGCTGACGTAATCAA
TGGGCAAGCGTTGATTACGAGCAAGCGTTGGAGTGCGTGGGAAAAGGACTCTAGCACATTCTTTGCTCGTGCCAAGAATCAGAGATTTAAGCTCTCCGCCTACTTGTTCG
TTCTTGATTCGACAATCTTGCTCGGTACAGACTCTCATACCTGTATAGCTGGTGCATTTGGTCAATTTTCTAGTGGAATTGGTAACACTAATGCAGGATTTGTACTAGGC
ACTCGGAAATCACTGCTCAAGGTAGTCTTTTACTTTAACAGAGAAGTGTGTCACATTATTATCATCACAAGCTTGTTGTATAACATGGATAAAAGTTGGATGGAAATTGG
AAATCGAACATTGCAAGAATATAGAAATGGAGTAAGAAGCTTTCTAGATTTTGCATTTATGCATACGACATCCAATCAGATTTCTTGTCCATGTAAGAGATGCAATAATG
CAATACTTAAAACTCGAGATGACGTTGAAGCAGATTTGTTAATGTTTGGGATAGTTCCAAGTTACACTCGATGGACAATGCATGGTGAAGAAAGTTTATCATATGGATTA
GGAGAAAATGATAATGACTCTGGTGAAGAAGATATTTTTGAAATATTAGAGGATCACTTTGGTGTTTTTAACACCAACAATTGGACCAAGAAAGGAGAATCAAGTAAACA
TGGTTATAATGAAGAACCAAATGAGGAAGCTTCTAAGTTTTATAGATTGTTAAATGACGCAGAAAAGGAACTTTATCCTGGGTACCACGAATGGAGACTACAACCAGAGT
TGTTTGATGGACATGTTGAAGAAGAATTACCTCCTGAACAATTGTCTGGAAGTGATATACTTGAACAATCGGAACTCAAATCATATGCCTCTCAACTGATGGATGGTACA
CTTGACATGAACAACGACAAGATTTTTGTGCAAGTCTTTGGACCAGAGAAACATGGGCGTGTTCGAGGTTATGGAGCCGGTGTTACTCCTTCTGAGTTGTTTGGATCATC
TTCCAAAGTTCGTGATCTTGAGCGACGCCTTAACGAGTCAGAACATCGTCTTCAAGAATCTGAACGACAAAGAAAAGTTGACGTACAAGGACTCAAAGATCAAATGATTG
AATTAGAAAATCGTTTTGAGCATCGATTTCAAGTAATGATGGCTGAGATGATTCGAAAAAAATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACATGGCTACACAAGGGGTGTGTGAAGAGGGTAGGGACGTGAGCAGTGGGGTGCAGGTGACACTCGGCAATGGGCACTGGATAGCGCACATGGCTGACGTAATCAA
TGGGCAAGCGTTGATTACGAGCAAGCGTTGGAGTGCGTGGGAAAAGGACTCTAGCACATTCTTTGCTCGTGCCAAGAATCAGAGATTTAAGCTCTCCGCCTACTTGTTCG
TTCTTGATTCGACAATCTTGCTCGGTACAGACTCTCATACCTGTATAGCTGGTGCATTTGGTCAATTTTCTAGTGGAATTGGTAACACTAATGCAGGATTTGTACTAGGC
ACTCGGAAATCACTGCTCAAGGTAGTCTTTTACTTTAACAGAGAAGTGTGTCACATTATTATCATCACAAGCTTGTTGTATAACATGGATAAAAGTTGGATGGAAATTGG
AAATCGAACATTGCAAGAATATAGAAATGGAGTAAGAAGCTTTCTAGATTTTGCATTTATGCATACGACATCCAATCAGATTTCTTGTCCATGTAAGAGATGCAATAATG
CAATACTTAAAACTCGAGATGACGTTGAAGCAGATTTGTTAATGTTTGGGATAGTTCCAAGTTACACTCGATGGACAATGCATGGTGAAGAAAGTTTATCATATGGATTA
GGAGAAAATGATAATGACTCTGGTGAAGAAGATATTTTTGAAATATTAGAGGATCACTTTGGTGTTTTTAACACCAACAATTGGACCAAGAAAGGAGAATCAAGTAAACA
TGGTTATAATGAAGAACCAAATGAGGAAGCTTCTAAGTTTTATAGATTGTTAAATGACGCAGAAAAGGAACTTTATCCTGGGTACCACGAATGGAGACTACAACCAGAGT
TGTTTGATGGACATGTTGAAGAAGAATTACCTCCTGAACAATTGTCTGGAAGTGATATACTTGAACAATCGGAACTCAAATCATATGCCTCTCAACTGATGGATGGTACA
CTTGACATGAACAACGACAAGATTTTTGTGCAAGTCTTTGGACCAGAGAAACATGGGCGTGTTCGAGGTTATGGAGCCGGTGTTACTCCTTCTGAGTTGTTTGGATCATC
TTCCAAAGTTCGTGATCTTGAGCGACGCCTTAACGAGTCAGAACATCGTCTTCAAGAATCTGAACGACAAAGAAAAGTTGACGTACAAGGACTCAAAGATCAAATGATTG
AATTAGAAAATCGTTTTGAGCATCGATTTCAAGTAATGATGGCTGAGATGATTCGAAAAAAATCCTAA
Protein sequenceShow/hide protein sequence
MHMATQGVCEEGRDVSSGVQVTLGNGHWIAHMADVINGQALITSKRWSAWEKDSSTFFARAKNQRFKLSAYLFVLDSTILLGTDSHTCIAGAFGQFSSGIGNTNAGFVLG
TRKSLLKVVFYFNREVCHIIIITSLLYNMDKSWMEIGNRTLQEYRNGVRSFLDFAFMHTTSNQISCPCKRCNNAILKTRDDVEADLLMFGIVPSYTRWTMHGEESLSYGL
GENDNDSGEEDIFEILEDHFGVFNTNNWTKKGESSKHGYNEEPNEEASKFYRLLNDAEKELYPGYHEWRLQPELFDGHVEEELPPEQLSGSDILEQSELKSYASQLMDGT
LDMNNDKIFVQVFGPEKHGRVRGYGAGVTPSELFGSSSKVRDLERRLNESEHRLQESERQRKVDVQGLKDQMIELENRFEHRFQVMMAEMIRKKS