; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g03840 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g03840
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr1:2447774..2449876
RNA-Seq ExpressionMoc01g03840
SyntenyMoc01g03840
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5553390.1 hypothetical protein RHGRI_011316 [Rhododendron griersonianum]3.3e-1029.37Show/hide
Query:  AGPSSHGNKRHKCKHQGSF-KGKIDHKNKKPKQSQNNKGKRKRPMIRLKNK------KANIKCYNCHQKGHYARECTELKK-------------------
        A P S  N       Q S  K K    + K   +     KR + M R++ K      KA + CYNC+++GH+AR+CTE KK                   
Subjt:  AGPSSHGNKRHKCKHQGSF-KGKIDHKNKKPKQSQNNKGKRKRPMIRLKNK------KANIKCYNCHQKGHYARECTELKK-------------------

Query:  -----------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLG-------------
                                                       GIG+YKL++  GR L+LHD+L+APDIRRNLLS+  LL+LG             
Subjt:  -----------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLG-------------

Query:  ------------NGFIVLDVAEIISYNSASFSLITSSVDANIDMIMWHARLG
                    +GFI++D+ E   +N+ S +L+T+S + + D I+WHARLG
Subjt:  ------------NGFIVLDVAEIISYNSASFSLITSSVDANIDMIMWHARLG

KAG7548157.1 Integrase catalytic core [Arabidopsis suecica]2.9e-1425Show/hide
Query:  LNGDNYEIWAMKIQ----------------------------RDKEAYEAWKKENSLARITLLNSMDNDVMA----------------------------
        L GDNY+IW  K+Q                            RD+EAY AWK++NS+ARITLL+ M +D+M                             
Subjt:  LNGDNYEIWAMKIQ----------------------------RDKEAYEAWKKENSLARITLLNSMDNDVMA----------------------------

Query:  ------------------------TRLDNAEVY---------------VAGPSSHGNKRHKCKHQGSFKGKIDHKNKKPKQSQNNKGKRKRPMIRLKNKK
                                T  DN + +                A P +  N           +G  D+K  +    +  K   KR     +  K
Subjt:  ------------------------TRLDNAEVY---------------VAGPSSHGNKRHKCKHQGSFKGKIDHKNKKPKQSQNNKGKRKRPMIRLKNKK

Query:  ANIKCYNCHQKGHYARECTELKK-------------------------------------------------------------------GIGSYKLNMW
          +KCYNC   GH+A ECTE KK                                                                   GIG+ +L+M 
Subjt:  ANIKCYNCHQKGHYARECTELKK-------------------------------------------------------------------GIGSYKLNMW

Query:  NGRILLLHDMLYAPDIRRNLLSITVLLKLG-------------------------NGFIVLDV-AEIISYNSASFSLITSSVDANIDMIMWHARLG
         G+ L+LHD+LYAP+IRR+L+S+  LL+LG                         +GFIVLD      S N+  FS +TSS   NI++ +WHARLG
Subjt:  NGRILLLHDMLYAPDIRRNLLSITVLLKLG-------------------------NGFIVLDV-AEIISYNSASFSLITSSVDANIDMIMWHARLG

KAG7595359.1 Reverse transcriptase RNA-dependent DNA polymerase [Arabidopsis thaliana x Arabidopsis arenosa]3.7e-0931.54Show/hide
Query:  GNKRHKCKHQGSFKGKIDHKNKKPKQSQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTELKK--------------------------------
        GNK +K  +Q   K     +NK  K     +GKR       K  K N+KCYNC   GH+ARECTE KK                                
Subjt:  GNKRHKCKHQGSFKGKIDHKNKKPKQSQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTELKK--------------------------------

Query:  ----------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLG-------------------------N
                                          GIG+ +L+M  GR L+LHD+L AP+IRR+L+S+  LLKLG                         +
Subjt:  ----------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLG-------------------------N

Query:  GFIVLDVAEII--SYNSASFSLITSSVDANIDMIMWHARLG
        GFIVLD    I  S +   FS ITSS   N+DM +WH+RLG
Subjt:  GFIVLDVAEII--SYNSASFSLITSSVDANIDMIMWHARLG

OMO58188.1 Integrase, catalytic core [Corchorus capsularis]3.3e-1030.4Show/hide
Query:  VAGPSSHGNKRHKCKHQGSFKGKIDHKNKKPKQSQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTELKK-------------------------
        VA  +S      K K  G F GK D     PK+++    KR R     K  KA + CYNC ++GH+AR+CTE KK                         
Subjt:  VAGPSSHGNKRHKCKHQGSFKGKIDHKNKKPKQSQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTELKK-------------------------

Query:  -------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLG-----------------
                                                   GI +YKL M  GR LLLHD+LYAP+IRRNL S+  +L LG                 
Subjt:  -------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLG-----------------

Query:  --------NGFIVLDVAEIISYNSA--SFSLITSSVDANIDMIMWHARLG
                NG +VLD+ +  SYN    S SL+ +  D   D + WHARLG
Subjt:  --------NGFIVLDVAEIISYNSA--SFSLITSSVDANIDMIMWHARLG

OMO92476.1 hypothetical protein CCACVL1_06836 [Corchorus capsularis]1.8e-1627.51Show/hide
Query:  LNGDNYEIWAMKI----------------------------QRDKEAYEAWKKENSLARITLLNSMDNDVMAT---------------------------
        L+GDNY+IW  KI                            +RD  AY+ W+K++  AR T+L+SM ND++ +                           
Subjt:  LNGDNYEIWAMKI----------------------------QRDKEAYEAWKKENSLARITLLNSMDNDVMAT---------------------------

Query:  ------RLDNAEVYVAGPSSHG-------NKRH----KCKHQGSFKGKIDHKNKKPKQ---SQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTE
               L++  +  A P++ G       N R     K K  G F GK D     PK+   ++ +KGKR       K  KA + CYNC ++GH+AR+CTE
Subjt:  ------RLDNAEVYVAGPSSHG-------NKRH----KCKHQGSFKGKIDHKNKKPKQ---SQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTE

Query:  LKK--------------------------------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRN
         KK                                                                    GIG+YKL M  GR LLLHD+LYAP+IRRN
Subjt:  LKK--------------------------------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRN

Query:  LLSITVLLKLG-------------------------NGFIVLDVAEIISYNSA--SFSLITSSVDANIDMIMWHARLG
        L S+  +L LG                         NG +VLD+ +  SYN    S SL+ +  D   D + WHARLG
Subjt:  LLSITVLLKLG-------------------------NGFIVLDVAEIISYNSA--SFSLITSSVDANIDMIMWHARLG

TrEMBL top hitse value%identityAlignment
A0A1R3GJB7 Integrase, catalytic core1.6e-1030.4Show/hide
Query:  VAGPSSHGNKRHKCKHQGSFKGKIDHKNKKPKQSQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTELKK-------------------------
        VA  +S      K K  G F GK D     PK+++    KR R     K  KA + CYNC ++GH+AR+CTE KK                         
Subjt:  VAGPSSHGNKRHKCKHQGSFKGKIDHKNKKPKQSQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTELKK-------------------------

Query:  -------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLG-----------------
                                                   GI +YKL M  GR LLLHD+LYAP+IRRNL S+  +L LG                 
Subjt:  -------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLG-----------------

Query:  --------NGFIVLDVAEIISYNSA--SFSLITSSVDANIDMIMWHARLG
                NG +VLD+ +  SYN    S SL+ +  D   D + WHARLG
Subjt:  --------NGFIVLDVAEIISYNSA--SFSLITSSVDANIDMIMWHARLG

A0A1R3JCG0 Uncharacterized protein8.8e-1727.51Show/hide
Query:  LNGDNYEIWAMKI----------------------------QRDKEAYEAWKKENSLARITLLNSMDNDVMAT---------------------------
        L+GDNY+IW  KI                            +RD  AY+ W+K++  AR T+L+SM ND++ +                           
Subjt:  LNGDNYEIWAMKI----------------------------QRDKEAYEAWKKENSLARITLLNSMDNDVMAT---------------------------

Query:  ------RLDNAEVYVAGPSSHG-------NKRH----KCKHQGSFKGKIDHKNKKPKQ---SQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTE
               L++  +  A P++ G       N R     K K  G F GK D     PK+   ++ +KGKR       K  KA + CYNC ++GH+AR+CTE
Subjt:  ------RLDNAEVYVAGPSSHG-------NKRH----KCKHQGSFKGKIDHKNKKPKQ---SQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTE

Query:  LKK--------------------------------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRN
         KK                                                                    GIG+YKL M  GR LLLHD+LYAP+IRRN
Subjt:  LKK--------------------------------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRN

Query:  LLSITVLLKLG-------------------------NGFIVLDVAEIISYNSA--SFSLITSSVDANIDMIMWHARLG
        L S+  +L LG                         NG +VLD+ +  SYN    S SL+ +  D   D + WHARLG
Subjt:  LLSITVLLKLG-------------------------NGFIVLDVAEIISYNSA--SFSLITSSVDANIDMIMWHARLG

A0A2N9HMX8 Uncharacterized protein3.6e-1028.69Show/hide
Query:  NAEVYVAGPSSHGNKRHKCKHQGSFKGKIDHKNKKPKQSQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTELKK--------------------
        N  VY+A  +S    R K K      G++      PK+++  K +  R     K  K+ + CYNC +KGH+A ECT+ KK                    
Subjt:  NAEVYVAGPSSHGNKRHKCKHQGSFKGKIDHKNKKPKQSQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTELKK--------------------

Query:  ----------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLG--------------
                                                      GIG+YKL++  GR LLL+D+LY P+IRRNLLS+ VLL+LG              
Subjt:  ----------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLG--------------

Query:  -----------NGFIVLDVAEIISYNSASFSLITSSVDANIDMIMWHARLG
                   +GF+VL+V      N  S  L+TSS + +   ++WHARLG
Subjt:  -----------NGFIVLDVAEIISYNSASFSLITSSVDANIDMIMWHARLG

A0A2N9IXR2 Uncharacterized protein4.1e-1433.48Show/hide
Query:  NAEVYVAGPSSHGNKRHKCKHQGSFKGKIDHKNKKPKQSQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTELKK--------------------
        N  VY+A  +SH   R K K      G++      PK+++  K +  R     K  K+ + CYNC +K H+ RECTE KK                    
Subjt:  NAEVYVAGPSSHGNKRHKCKHQGSFKGKIDHKNKKPKQSQNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTELKK--------------------

Query:  ----------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLGNGFIVLDVAEIISY
                                                      GIG+YKL++  GR LLLHD+LYAP+IRRNLLS+ VLL+L  GF+VL+   I  Y
Subjt:  ----------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLGNGFIVLDVAEIISY

Query:  N-SASFSLITSSVDANIDMIMWHARLG
        N   S  L+TS  D +   I+WHARLG
Subjt:  N-SASFSLITSSVDANIDMIMWHARLG

A0A7N2KXD4 Uncharacterized protein3.8e-1229.53Show/hide
Query:  AEVYVAGPSSHGNKRHKCKHQGSFKGKIDHKNKKPKQS----QNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTELKK-----------------
        + V++A  SS    R K K      G+       PK++    +N +GK      R K  K+ + CYNC +KGH+AREC E KK                 
Subjt:  AEVYVAGPSSHGNKRHKCKHQGSFKGKIDHKNKKPKQS----QNNKGKRKRPMIRLKNKKANIKCYNCHQKGHYARECTELKK-----------------

Query:  -------------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLG-----------
                                                         GIG+YKL +  GR LLLHD+LYAP+IRRNLLS+  LL+LG           
Subjt:  -------------------------------------------------GIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLG-----------

Query:  --------------NGFIVLDVAEIISYNSASFSLITSSVDANIDMIMWHARLG
                      N F++LD+     YN+ S   +TSS +A+ +  +WHARLG
Subjt:  --------------NGFIVLDVAEIISYNSASFSLITSSVDANIDMIMWHARLG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAATGGGGATAATTATGAGATCTGGGCCATGAAAATCCAGAGGGACAAAGAGGCTTATGAAGCCTGGAAGAAGGAAAACTCACTTGCTCGCATAACTTTG
CTAAACAGCATGGACAATGATGTCATGGCAACAAGGCTTGACAATGCTGAAGTTTATGTTGCTGGCCCTAGTTCACATGGGAACAAAAGGCACAAGTGTAAACAT
CAGGGAAGTTTTAAAGGGAAAATAGATCACAAGAATAAGAAACCCAAGCAAAGTCAGAATAACAAGGGTAAGAGGAAACGTCCCATGATTCGTCTTAAAAATAAG
AAAGCAAATATCAAGTGCTACAATTGTCACCAGAAGGGTCACTATGCTCGCGAGTGTACTGAACTCAAGAAGGGCATTGGTTCCTACAAGTTGAACATGTGGAAT
GGGCGCATTCTACTTCTGCATGATATGTTGTATGCACCTGATATTCGGCGAAATTTGCTTTCTATTACTGTCCTTCTCAAACTTGGTAATGGTTTTATTGTATTG
GATGTTGCTGAAATAATATCTTATAATTCTGCAAGTTTTTCCCTTATAACATCATCTGTTGATGCCAATATTGACATGATTATGTGGCATGCTAGATTAGGAATT
GCGCCTGGCGCAATTTTGGAGGACAATTTTGCCAAAACGTGCGTTTTGATTAATGAAGGGCAAGGAGGACATTTCCATGTGTTTGAATCCAAGTATAACCATGAG
TTTAAGTTAAATGGGAGTTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTAATGGGGATAATTATGAGATCTGGGCCATGAAAATCCAGAGGGACAAAGAGGCTTATGAAGCCTGGAAGAAGGAAAACTCACTTGCTCGCATAACTTTG
CTAAACAGCATGGACAATGATGTCATGGCAACAAGGCTTGACAATGCTGAAGTTTATGTTGCTGGCCCTAGTTCACATGGGAACAAAAGGCACAAGTGTAAACAT
CAGGGAAGTTTTAAAGGGAAAATAGATCACAAGAATAAGAAACCCAAGCAAAGTCAGAATAACAAGGGTAAGAGGAAACGTCCCATGATTCGTCTTAAAAATAAG
AAAGCAAATATCAAGTGCTACAATTGTCACCAGAAGGGTCACTATGCTCGCGAGTGTACTGAACTCAAGAAGGGCATTGGTTCCTACAAGTTGAACATGTGGAAT
GGGCGCATTCTACTTCTGCATGATATGTTGTATGCACCTGATATTCGGCGAAATTTGCTTTCTATTACTGTCCTTCTCAAACTTGGTAATGGTTTTATTGTATTG
GATGTTGCTGAAATAATATCTTATAATTCTGCAAGTTTTTCCCTTATAACATCATCTGTTGATGCCAATATTGACATGATTATGTGGCATGCTAGATTAGGAATT
GCGCCTGGCGCAATTTTGGAGGACAATTTTGCCAAAACGTGCGTTTTGATTAATGAAGGGCAAGGAGGACATTTCCATGTGTTTGAATCCAAGTATAACCATGAG
TTTAAGTTAAATGGGAGTTGTTAG
Protein sequenceShow/hide protein sequence
MLNGDNYEIWAMKIQRDKEAYEAWKKENSLARITLLNSMDNDVMATRLDNAEVYVAGPSSHGNKRHKCKHQGSFKGKIDHKNKKPKQSQNNKGKRKRPMIRLKNK
KANIKCYNCHQKGHYARECTELKKGIGSYKLNMWNGRILLLHDMLYAPDIRRNLLSITVLLKLGNGFIVLDVAEIISYNSASFSLITSSVDANIDMIMWHARLGI
APGAILEDNFAKTCVLINEGQGGHFHVFESKYNHEFKLNGSC