; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g37580 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g37580
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr6:28887712..28890373
RNA-Seq ExpressionMoc06g37580
SyntenyMoc06g37580
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]7.0e-0928.62Show/hide
Query:  KRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQLATSRSSKASTDEASTSYLT--------------------
        K HGYP  Y+     SP   +  P A  SK+++ S++  +H++    S   +L QLLQSQL+T +   A TD  +TSY                      
Subjt:  KRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQLATSRSSKASTDEASTSYLT--------------------

Query:  ------------VFVSLPNQLRLLVEYSGSVSIIEDITLHNDMY------------------------------------TSKMIDNGSLLNGLYFHASS
                    V V+LPN+ R +VEYSG V + + +++H  +Y                                     SK ID G L +GLY   ++
Subjt:  ------------VFVSLPNQLRLLVEYSGSVSIIEDITLHNDMY------------------------------------TSKMIDNGSLLNGLYFHASS

Query:  AGDTTTASLVGINCHSTFNVHKPSFDICHNCFGHSSFKRISVLRTVL-----SFEDLF---------------LHVDPFPQLVLPMATDF
        +  ++T   +   C S F +   SFD+ HN  GH SF R+  L++VL     S ED                 + V+PFP LVLP   DF
Subjt:  AGDTTTASLVGINCHSTFNVHKPSFDICHNCFGHSSFKRISVLRTVL-----SFEDLF---------------LHVDPFPQLVLPMATDF

XP_030970454.1 uncharacterized protein LOC115990812 [Quercus lobata]4.0e-0429.02Show/hide
Query:  GHTIELYYKRHGYPLRYKFQGSRSPSVYTTTPSASHSKATD----NSSVNDAHSSYDVSSQCHELIQLL---QSQLA-------TSRSSKASTDEAS---
        GHT++  YK HG+P  +KF+           PS +H  +++     SS++  +S++ +  QC +L+ L     S LA       TS ++ AS++ AS   
Subjt:  GHTIELYYKRHGYPLRYKFQGSRSPSVYTTTPSASHSKATD----NSSVNDAHSSYDVSSQCHELIQLL---QSQLA-------TSRSSKASTDEAS---

Query:  -----TSYLTV---FVSLPNQLRLLVEYSGSVSIIEDITLHNDMYTSKMIDNGSLLNGLY-FHASSAGDTTTASLVGINCHSTFN-----------VHKP
             TS +T     V LPN     V + G+V +   +TL ND+ + K I  G  ++GLY     S   T T+SL     +  FN           V   
Subjt:  -----TSYLTV---FVSLPNQLRLLVEYSGSVSIIEDITLHNDMYTSKMIDNGSLLNGLY-FHASSAGDTTTASLVGINCHSTFN-----------VHKP

Query:  SFDICHNCFGHSSFKRISVLRTVL
        S  + H   GH S  ++ VL  V+
Subjt:  SFDICHNCFGHSSFKRISVLRTVL

XP_031259419.1 uncharacterized protein LOC116117549 [Pistacia vera]3.1e-0425.2Show/hide
Query:  GHTIELYYKRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQLATSRSSKASTDEASTSYLT------------
        GHT++  YK HGYPL YKF+ + + +      S S  ++  ++S      + + S+Q  +L+ +L + L++S     + D + T+ LT            
Subjt:  GHTIELYYKRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQLATSRSSKASTDEASTSYLT------------

Query:  ---------------------VFVSLP----------NQLRLLVEYSGSVSIIEDITLHNDMYTSKMIDNGSLLNGLYFHASSAGDTTTASLVGINCHST
                              FVSL           N  ++LV ++G V +  D+ L + +Y  +   N   +N L          T   L  +N  S 
Subjt:  ---------------------VFVSLP----------NQLRLLVEYSGSVSIIEDITLHNDMYTSKMIDNGSLLNGLYFHASSAGDTTTASLVGINCHST

Query:  FNVHKPSFDICHNCFGHSSFKRISVLRTVLSFEDLFLHVDPFPQLVLPMA
          V+  S  I HN  GH SFKR+  L+  L  +   LH    P  + P+A
Subjt:  FNVHKPSFDICHNCFGHSSFKRISVLRTVLSFEDLFLHVDPFPQLVLPMA

TrEMBL top hitse value%identityAlignment
A0A2N9EL72 Reverse transcriptase Ty1/copia-type domain-containing protein5.1e-0528.7Show/hide
Query:  GHTIELYYKRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQ---------------LATSRSSKASTDE----
        GHT+E  Y+ HG+P  YK +   S +   T   +++    DN   N A        Q  +L+  L SQ               L+ S +   +TD     
Subjt:  GHTIELYYKRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQ---------------LATSRSSKASTDE----

Query:  -----ASTSYLTVFVSLPNQLRLLVEYSGSVSIIEDITLHNDMYTSKMIDNGSLLNGLYFHASSAGDTTTASLVGINCHSTFNVHKPS-------FDICH
               TS L   V LPN    LV ++G++ + + +TL +D+   KMI  G    GLYF  SS  D+  +S +  N  S   V++P+        D+ H
Subjt:  -----ASTSYLTVFVSLPNQLRLLVEYSGSVSIIEDITLHNDMYTSKMIDNGSLLNGLYFHASSAGDTTTASLVGINCHSTFNVHKPS-------FDICH

Query:  NCFGHSSFKRISVLRTVLSFEDLFLHVDPF
           GH+S+  +  ++      DLFL V PF
Subjt:  NCFGHSSFKRISVLRTVLSFEDLFLHVDPF

A0A2N9GMU9 Integrase catalytic domain-containing protein3.9e-0527.11Show/hide
Query:  PLASNFS-PGHTIELYYKRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQLATSRS--------------SKA
        PL S+    GHT++  YK HGYP  YKF+            +  HS    +++  + H  +    QC +L+ +L SQ + + S              + +
Subjt:  PLASNFS-PGHTIELYYKRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQLATSRS--------------SKA

Query:  STDEASTSYLTVFVSLPNQLRLLVEYSGSVSIIED-ITLHN-------------DMYTSKMIDNGSLLNGLYFHASSAGDTTTASLVGINCHSTFNVHKP
        ST   + S ++ F++  + +  L ++S  +S I   I L N             D+   K I  G   NGLYF   S   T   S   +  H+  N + P
Subjt:  STDEASTSYLTVFVSLPNQLRLLVEYSGSVSIIED-ITLHN-------------DMYTSKMIDNGSLLNGLYFHASSAGDTTTASLVGINCHSTFNVHKP

Query:  SFDICHNCFGHSSFKRISVLRTVLS
        +FD+ H+  GH S  R+S+L+ V++
Subjt:  SFDICHNCFGHSSFKRISVLRTVLS

A0A2N9HKE6 Uncharacterized protein4.8e-1128.19Show/hide
Query:  GHTIELYYKRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQ--LATSRSSK---------------ASTDEAS
        GHT++  YK HGYP  YKF+            +  HS    ++ V D H  +   +QC +L+ +L SQ  LA+ +SS+               +ST   +
Subjt:  GHTIELYYKRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQ--LATSRSSK---------------ASTDEAS

Query:  ----------------------TSYLTVFVSLPNQLRLLVEYSGSVSIIEDITLHNDMYTSKMIDNGSLLNGLYFHASSAGDTTTASLVGINCHSTFNVH
                              TS +  ++ LPN  ++L  + G+V +   + L +D+ T K I  G   NGLYF   S     ++S   +  H+  N +
Subjt:  ----------------------TSYLTVFVSLPNQLRLLVEYSGSVSIIEDITLHNDMYTSKMIDNGSLLNGLYFHASSAGDTTTASLVGINCHSTFNVH

Query:  KPSFDICHNCFGHSSFKRISVLRTVLS
         P FD+ H+  GH S  R+S+L+ V+S
Subjt:  KPSFDICHNCFGHSSFKRISVLRTVLS

A0A2N9HKX8 Integrase catalytic domain-containing protein3.9e-0527.11Show/hide
Query:  PLASNFS-PGHTIELYYKRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQLATSRS--------------SKA
        PL S+    GHT++  YK HGYP  YKF+            +  HS    +++  + H  +    QC +L+ +L SQ + + S              + +
Subjt:  PLASNFS-PGHTIELYYKRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQLATSRS--------------SKA

Query:  STDEASTSYLTVFVSLPNQLRLLVEYSGSVSIIED-ITLHN-------------DMYTSKMIDNGSLLNGLYFHASSAGDTTTASLVGINCHSTFNVHKP
        ST   + S ++ F++  + +  L ++S  +S I   I L N             D+   K I  G   NGLYF   S   T   S   +  H+  N + P
Subjt:  STDEASTSYLTVFVSLPNQLRLLVEYSGSVSIIED-ITLHN-------------DMYTSKMIDNGSLLNGLYFHASSAGDTTTASLVGINCHSTFNVHKP

Query:  SFDICHNCFGHSSFKRISVLRTVLS
        +FD+ H+  GH S  R+S+L+ V++
Subjt:  SFDICHNCFGHSSFKRISVLRTVLS

A0A6J1CR17 uncharacterized protein LOC1110134413.4e-0928.62Show/hide
Query:  KRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQLATSRSSKASTDEASTSYLT--------------------
        K HGYP  Y+     SP   +  P A  SK+++ S++  +H++    S   +L QLLQSQL+T +   A TD  +TSY                      
Subjt:  KRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQLATSRSSKASTDEASTSYLT--------------------

Query:  ------------VFVSLPNQLRLLVEYSGSVSIIEDITLHNDMY------------------------------------TSKMIDNGSLLNGLYFHASS
                    V V+LPN+ R +VEYSG V + + +++H  +Y                                     SK ID G L +GLY   ++
Subjt:  ------------VFVSLPNQLRLLVEYSGSVSIIEDITLHNDMY------------------------------------TSKMIDNGSLLNGLYFHASS

Query:  AGDTTTASLVGINCHSTFNVHKPSFDICHNCFGHSSFKRISVLRTVL-----SFEDLF---------------LHVDPFPQLVLPMATDF
        +  ++T   +   C S F +   SFD+ HN  GH SF R+  L++VL     S ED                 + V+PFP LVLP   DF
Subjt:  AGDTTTASLVGINCHSTFNVHKPSFDICHNCFGHSSFKRISVLRTVL-----SFEDLF---------------LHVDPFPQLVLPMATDF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATTCCGTCTTCTTTCTCCAGTTGATGAATCTCCGGTTGATAATTCGATTTCATCTGTTCTTCCACCTCTTGCTTCGAATTTTTCACCGGGACACACCATT
GAACTTTATTATAAACGACATGGTTATCCTCTAAGGTATAAGTTTCAAGGCTCGCGTTCGCCCTCTGTTTATACCACTACGCCTTCGGCCTCTCACTCGAAGGCA
ACTGATAATTCGTCAGTGAATGATGCTCATTCAAGTTATGATGTTTCCAGCCAATGCCACGAATTGATTCAATTACTGCAATCTCAATTGGCCACCTCTAGGTCT
TCGAAAGCCTCTACTGATGAGGCTTCCACATCTTATTTGACAGTATTTGTGAGTTTACCAAATCAATTGCGATTGTTGGTTGAATACAGTGGTTCAGTTTCCATC
ATAGAGGACATCACCCTACATAATGACATGTACACTTCGAAGATGATTGACAATGGTAGTTTACTCAATGGGCTGTATTTCCATGCTTCGTCAGCTGGGGACACA
ACTACTGCATCTTTAGTTGGTATCAATTGTCATTCTACCTTTAATGTACATAAGCCTTCTTTTGACATTTGTCATAATTGTTTTGGCCACTCATCCTTTAAACGC
ATTAGTGTTCTACGTACTGTTTTATCTTTTGAGGATCTCTTTCTACATGTTGATCCATTTCCACAATTGGTTCTCCCCATGGCTACTGATTTTATATCTATTGCT
GTTCTTATGTCTCGCACTTTCGATACTACTGGTACTATGAATTTTTCTGGTGATGCTACTGATGGCACCTCTGATTCTATCATTCTTGATATTGGTCCTTCCGAT
AATACCTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTATTCCGTCTTCTTTCTCCAGTTGATGAATCTCCGGTTGATAATTCGATTTCATCTGTTCTTCCACCTCTTGCTTCGAATTTTTCACCGGGACACACCATT
GAACTTTATTATAAACGACATGGTTATCCTCTAAGGTATAAGTTTCAAGGCTCGCGTTCGCCCTCTGTTTATACCACTACGCCTTCGGCCTCTCACTCGAAGGCA
ACTGATAATTCGTCAGTGAATGATGCTCATTCAAGTTATGATGTTTCCAGCCAATGCCACGAATTGATTCAATTACTGCAATCTCAATTGGCCACCTCTAGGTCT
TCGAAAGCCTCTACTGATGAGGCTTCCACATCTTATTTGACAGTATTTGTGAGTTTACCAAATCAATTGCGATTGTTGGTTGAATACAGTGGTTCAGTTTCCATC
ATAGAGGACATCACCCTACATAATGACATGTACACTTCGAAGATGATTGACAATGGTAGTTTACTCAATGGGCTGTATTTCCATGCTTCGTCAGCTGGGGACACA
ACTACTGCATCTTTAGTTGGTATCAATTGTCATTCTACCTTTAATGTACATAAGCCTTCTTTTGACATTTGTCATAATTGTTTTGGCCACTCATCCTTTAAACGC
ATTAGTGTTCTACGTACTGTTTTATCTTTTGAGGATCTCTTTCTACATGTTGATCCATTTCCACAATTGGTTCTCCCCATGGCTACTGATTTTATATCTATTGCT
GTTCTTATGTCTCGCACTTTCGATACTACTGGTACTATGAATTTTTCTGGTGATGCTACTGATGGCACCTCTGATTCTATCATTCTTGATATTGGTCCTTCCGAT
AATACCTTCTAG
Protein sequenceShow/hide protein sequence
MLFRLLSPVDESPVDNSISSVLPPLASNFSPGHTIELYYKRHGYPLRYKFQGSRSPSVYTTTPSASHSKATDNSSVNDAHSSYDVSSQCHELIQLLQSQLATSRS
SKASTDEASTSYLTVFVSLPNQLRLLVEYSGSVSIIEDITLHNDMYTSKMIDNGSLLNGLYFHASSAGDTTTASLVGINCHSTFNVHKPSFDICHNCFGHSSFKR
ISVLRTVLSFEDLFLHVDPFPQLVLPMATDFISIAVLMSRTFDTTGTMNFSGDATDGTSDSIILDIGPSDNTF