; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g36500 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g36500
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr8:27100779..27101483
RNA-Seq ExpressionMoc08g36500
SyntenyMoc08g36500
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025314 - Domain of unknown function DUF4219


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037350.1 Integrase, catalytic core [Cucumis melo var. makuwa]2.6e-7870.42Show/hide
Query:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRND
        MGDLQ+ G IKKLNTQNYKTWS+CM+SYLQGQDLW+VVGG+EV PPEDA  LKKW  KAGK  FAIK TIDEEMLEHI   ETPK  WDT  SL  K+ND
Subjt:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRND

Query:  ARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALLKT
        ARL+ L+N+LLS+ QR+ TINQYF KVK LC EISELDP + ISES MRR+IIHGLK +YRS IA +Q W +QPSL DLE++LA+QEA+AK+I E  +K+
Subjt:  ARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALLKT

Query:  NNEGTLFSGQRRG
        NN   LFSGQRRG
Subjt:  NNEGTLFSGQRRG

XP_021821223.1 uncharacterized protein LOC110762835, partial [Prunus avium]2.8e-7262.83Show/hide
Query:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPP--EDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKR
        MGDL +VG I KLN +NY TW++CMESYLQGQDLW+VVGG++V  P  +++ AL+KWK KAGK MFA+K TI+EEMLEHIRKA+TPK AWDT A+L  KR
Subjt:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPP--EDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKR

Query:  NDARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALL
        ND RL+LL+N+LLSVAQRD TI QYF+KVKS+CREIS+LDP A I ES ++R+IIHGL+P+YR  +A VQ WP QPSL + ENLLA+QEAMAK++    L
Subjt:  NDARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALL

Query:  KTNNEGTLFSGQRRGQTKVGFKGDER
        K   E  L+S + +G  K  F   E+
Subjt:  KTNNEGTLFSGQRRGQTKVGFKGDER

XP_023732023.1 uncharacterized protein LOC111879824 [Lactuca sativa]3.1e-7163.55Show/hide
Query:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDAT--ALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKR
        MGD+Q+VG IKKLN  NYKTW +CM+SYLQGQ LWEVVGG+E TPPE+    AL+KWK KAGK MFA+K TI+EEMLEHIR   TPK AWDT  +L  K+
Subjt:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDAT--ALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKR

Query:  NDARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALL
        ND RL+LL+N+LLS++QRD TI QYF+KVKS+CREI+ELDP + I+E+ M+R+IIHGL+P+YRS +  VQ WP QPSL + ENLLA+QEAMAK++    L
Subjt:  NDARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALL

Query:  KTNNEGTLFSGQRR
        K+  E    S  RR
Subjt:  KTNNEGTLFSGQRR

XP_023749568.1 uncharacterized protein LOC111897860 [Lactuca sativa]4.4e-7363.11Show/hide
Query:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDAT--ALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKR
        MGDLQ+VG IKKLN  NYKTW +CM+SYLQGQDLWEVVGGSE TPPE+    AL+KWK KAGK MFA+K TI+EEMLEHIR   TPK AWDT  +L  K+
Subjt:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDAT--ALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKR

Query:  NDARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALL
        ND RL+LL+N+LLS++QRD TI QYF+KVKS+CREI+ELDP + I+E+ M+R+IIHGL+P+YRS +  VQ WP QPSL + ENLLA+QEAMAK++    L
Subjt:  NDARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALL

Query:  KTNNEGTLFSGQRRG---QTKVGFK
        K+  E    S  RR     +K G+K
Subjt:  KTNNEGTLFSGQRRG---QTKVGFK

XP_023756729.1 uncharacterized protein LOC111905270 [Lactuca sativa]2.4e-7159.57Show/hide
Query:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDAT--ALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKR
        MGDLQ+VG IKKLN  NYKTW +CM+SYLQGQDLWEVVGGS+ TP E+     L+KWK KAGK MFA+K TI+EEMLE IR   TPK AWDT   L  K+
Subjt:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDAT--ALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKR

Query:  NDARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALL
        ND RL+LL+N+LLS++QRD TI QYF+KVKS+CREI+ELDP + I E+ M+R+IIHGL+P+YRS +  +Q WP+QPSL + ENLLANQEAMAK++    L
Subjt:  NDARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALL

Query:  KTNNEGTLFSGQRR---GQTKVGFKGDERTRKKQG
        K+  E    S  RR     +K G+K  ++ + +QG
Subjt:  KTNNEGTLFSGQRR---GQTKVGFKGDERTRKKQG

TrEMBL top hitse value%identityAlignment
A0A2N9ETQ8 Uncharacterized protein6.6e-7562.01Show/hide
Query:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRND
        M +LQ VG +KKLN QNY TWS+CMESYLQGQDLWE+V GSE TPP++  AL+KWK KAGK MF IK +I+EEMLEHIR+A+TPK AWDT A+L  K+N+
Subjt:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRND

Query:  ARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALLKT
         RL+LL+N+L+S+AQR+ TI QYF KVKSLCREISELDP + ISES +RR+IIHGL+P+YRS I  VQ WP+QPSL +LENLLA+QEAM K+++   LK+
Subjt:  ARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALLKT

Query:  NNEGTLFSGQRRGQTKVGFKGDERTRKKQ
          E  LFS +   + K  F    + R  +
Subjt:  NNEGTLFSGQRRGQTKVGFKGDERTRKKQ

A0A2N9GDR3 CCHC-type domain-containing protein6.0e-7663.76Show/hide
Query:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRND
        M +LQ VG +KKLN QNY TWS+CMESYLQ QDLWE+V GSE TP E+  AL+KWK KAGK MFAIK +I+EEMLEHIR+A+TPK AWDT A+L  K+N+
Subjt:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRND

Query:  ARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALLKT
         RL+LL+NKL+S+AQR+ TI QYF KVKSLCREISELDP + ISES +RR+IIHGLKP+YRS I  VQ WP+QPSL +LENLLANQ+AM K+++   LK+
Subjt:  ARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALLKT

Query:  NNEGTLFSGQRRGQTKVGFKGDERTRKKQ
          E  LFSG+  G+ K  F    + R  +
Subjt:  NNEGTLFSGQRRGQTKVGFKGDERTRKKQ

A0A2N9GV75 CCHC-type domain-containing protein3.9e-7562.45Show/hide
Query:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRND
        M +LQ VG +KKLN QNY TWS+CMESYLQGQDLWE+V GSE TPP++  AL+KWK KAGK MFAIK +I+EEMLEHIR+A+TPK AWDT A+L  K+N+
Subjt:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRND

Query:  ARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALLKT
         RL+LL+N+L+S+AQ + TI QYF KVKSLCREISELDP + ISES +RR+IIHGL+P+YRS I  VQ WP+QPSL +LENLLA+QEAM K+++   LK+
Subjt:  ARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALLKT

Query:  NNEGTLFSGQRRGQTKVGFKGDERTRKKQ
          E  LFS +  G+ K  F    + R  +
Subjt:  NNEGTLFSGQRRGQTKVGFKGDERTRKKQ

A0A2N9H6C2 CCHC-type domain-containing protein4.1e-7763.76Show/hide
Query:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRND
        M +LQ VG +KKLN QNY TWSSCMESYLQGQDLWE+V GSE TPP++  AL+KWK KAGK MFAIK +I+EEMLEHIR+A+TPK AWDT A+L  K+N+
Subjt:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRND

Query:  ARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALLKT
         RL+LL+N+L+S+AQ++ TI QYF KVKSLCREISELD  + ISES +RR+IIHGL+P+YRS IA VQ WP+QPSL +LENLLANQEAM K+++   LK+
Subjt:  ARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALLKT

Query:  NNEGTLFSGQRRGQTKVGFKGDERTRKKQ
          E  LFSG+  G+ K  F    + R  +
Subjt:  NNEGTLFSGQRRGQTKVGFKGDERTRKKQ

A0A5A7T1U9 Integrase, catalytic core1.3e-7870.42Show/hide
Query:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRND
        MGDLQ+ G IKKLNTQNYKTWS+CM+SYLQGQDLW+VVGG+EV PPEDA  LKKW  KAGK  FAIK TIDEEMLEHI   ETPK  WDT  SL  K+ND
Subjt:  MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRND

Query:  ARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALLKT
        ARL+ L+N+LLS+ QR+ TINQYF KVK LC EISELDP + ISES MRR+IIHGLK +YRS IA +Q W +QPSL DLE++LA+QEA+AK+I E  +K+
Subjt:  ARLRLLKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALLKT

Query:  NNEGTLFSGQRRG
        NN   LFSGQRRG
Subjt:  NNEGTLFSGQRRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.0e-0723.31Show/hide
Query:  IKKL--NTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRNDARLRLLK
        I+KL  +  NY  W     S+L+    +  + G+   P   +   + W+     VM+ +  ++ +++LE +  AET    W+ L  + +   D ++  L+
Subjt:  IKKL--NTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRNDARLRLLK

Query:  NKLLSVAQRDTTINQYFNKVKSLCREISELDPV
         +L ++ Q   ++ +YF K+  +  E+SE  P+
Subjt:  NKLLSVAQRDTTINQYFNKVKSLCREISELDPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACCTTCAAATTGTTGGAAGAATCAAAAAGCTCAACACTCAAAACTACAAGACATGGTCTAGTTGCATGGAATCTTATCTCCAAGGCCAAGACTTATGG
GAAGTTGTGGGAGGCAGTGAAGTCACACCGCCTGAAGATGCTACAGCCTTGAAGAAATGGAAGACTAAGGCAGGGAAGGTTATGTTTGCTATTAAAATTACTATT
GATGAAGAAATGTTAGAGCATATTAGAAAAGCAGAGACTCCTAAAGTGGCATGGGATACGTTGGCCTCACTCGTCTTAAAGAGAAATGATGCAAGATTACGGCTT
TTGAAGAACAAGCTTCTATCAGTTGCTCAAAGAGATACGACAATTAATCAATATTTCAACAAGGTAAAATCTCTTTGCCGTGAAATTTCTGAATTAGATCCTGTT
GCTACTATTTCAGAATCAATAATGAGAAGAATGATTATTCATGGACTTAAACCTAAGTATAGAAGCATTATCGCTACTGTTCAAGATTGGCCAATCCAACCATCC
CTTACTGACCTAGAAAATTTACTTGCCAATCAAGAAGCAATGGCAAAGAAAATATTAGAGGCCTTGTTAAAGACGAATAATGAAGGGACACTCTTTAGTGGCCAA
AGAAGAGGTCAAACGAAAGTAGGATTTAAAGGAGATGAAAGAACTCGGAAAAAACAAGGAAGAGAGTCTACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACCTTCAAATTGTTGGAAGAATCAAAAAGCTCAACACTCAAAACTACAAGACATGGTCTAGTTGCATGGAATCTTATCTCCAAGGCCAAGACTTATGG
GAAGTTGTGGGAGGCAGTGAAGTCACACCGCCTGAAGATGCTACAGCCTTGAAGAAATGGAAGACTAAGGCAGGGAAGGTTATGTTTGCTATTAAAATTACTATT
GATGAAGAAATGTTAGAGCATATTAGAAAAGCAGAGACTCCTAAAGTGGCATGGGATACGTTGGCCTCACTCGTCTTAAAGAGAAATGATGCAAGATTACGGCTT
TTGAAGAACAAGCTTCTATCAGTTGCTCAAAGAGATACGACAATTAATCAATATTTCAACAAGGTAAAATCTCTTTGCCGTGAAATTTCTGAATTAGATCCTGTT
GCTACTATTTCAGAATCAATAATGAGAAGAATGATTATTCATGGACTTAAACCTAAGTATAGAAGCATTATCGCTACTGTTCAAGATTGGCCAATCCAACCATCC
CTTACTGACCTAGAAAATTTACTTGCCAATCAAGAAGCAATGGCAAAGAAAATATTAGAGGCCTTGTTAAAGACGAATAATGAAGGGACACTCTTTAGTGGCCAA
AGAAGAGGTCAAACGAAAGTAGGATTTAAAGGAGATGAAAGAACTCGGAAAAAACAAGGAAGAGAGTCTACATAA
Protein sequenceShow/hide protein sequence
MGDLQIVGRIKKLNTQNYKTWSSCMESYLQGQDLWEVVGGSEVTPPEDATALKKWKTKAGKVMFAIKITIDEEMLEHIRKAETPKVAWDTLASLVLKRNDARLRL
LKNKLLSVAQRDTTINQYFNKVKSLCREISELDPVATISESIMRRMIIHGLKPKYRSIIATVQDWPIQPSLTDLENLLANQEAMAKKILEALLKTNNEGTLFSGQ
RRGQTKVGFKGDERTRKKQGREST