; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moctig00544g140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoctig00544g140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationtig00000544_pilon:63975..65318
RNA-Seq ExpressionMoctig00544g140
SyntenyMoctig00544g140
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR022546 - Uncharacterised protein family Ycf68


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7013176.1 unnamed protein product [Microthlaspi erraticum]1.9e-7294.59Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD
        MVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSD
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD

Query:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH
        EMLRGVENKRRSGDSRIGEAVEC TLDGESPVAESITSL SDPSSMGH
Subjt:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH

CAD5336140.1 unnamed protein product [Arabidopsis thaliana]2.9e-7380.65Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD
        MVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSD
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD

Query:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
        EMLRGVENKRRSGDSRIG AV                    D +S G    RSASET+GDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

CAD5336141.1 unnamed protein product [Arabidopsis thaliana]2.9e-7380.65Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD
        MVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSD
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD

Query:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
        EMLRGVENKRRSGDSRIG AV                    D +S G    RSASET+GDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

CAD5336145.1 unnamed protein product [Arabidopsis thaliana]2.9e-7380.65Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD
        MVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSD
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD

Query:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
        EMLRGVENKRRSGDSRIG AV                    D +S G    RSASET+GDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHG-SRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

KAG7528872.1 hypothetical protein ISN44_Un153g000040 [Arabidopsis suecica]1.9e-7294.59Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD
        MVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSD
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD

Query:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH
        EMLRGVENKRRSGDSRIGEAVEC TLDGESPVAESITSL SDPSSMGH
Subjt:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH

TrEMBL top hitse value%identityAlignment
A0A2N9F5N4 Uncharacterized protein ycf686.2e-9072.22Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD
        MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSD
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD

Query:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH----------------------------------------------------
        EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH                                                    
Subjt:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH----------------------------------------------------

Query:  ---------------GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
                       GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  ---------------GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

A0A2N9HJU4 Uncharacterized protein ycf686.2e-9072.22Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD
        MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSD
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD

Query:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH----------------------------------------------------
        EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH                                                    
Subjt:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH----------------------------------------------------

Query:  ---------------GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
                       GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  ---------------GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

A0A2N9HJZ2 Uncharacterized protein ycf686.2e-9072.22Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD
        MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSD
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD

Query:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH----------------------------------------------------
        EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH                                                    
Subjt:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH----------------------------------------------------

Query:  ---------------GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
                       GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  ---------------GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

A0A2N9HP93 Uncharacterized protein ycf686.2e-9072.22Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD
        MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSD
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD

Query:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH----------------------------------------------------
        EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH                                                    
Subjt:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH----------------------------------------------------

Query:  ---------------GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
                       GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  ---------------GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

A0A2N9I678 Uncharacterized protein ycf686.2e-9072.22Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD
        MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSD
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD

Query:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH----------------------------------------------------
        EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH                                                    
Subjt:  EMLRGVENKRRSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGH----------------------------------------------------

Query:  ---------------GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV
                       GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKE+
Subjt:  ---------------GSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07706.1 unknown protein2.6e-0856.45Show/hide
Query:  SETMGDKLHRR----------EGNSPDHQLRPLNDRSVIKEVGVQRQPGGL------PRSSH
        SET G ++ R           EGNSPDHQLRP N RSVIKEVGVQRQP  L      PRS H
Subjt:  SETMGDKLHRR----------EGNSPDHQLRPLNDRSVIKEVGVQRQPGGL------PRSSH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATA
TGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAG
TTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGT
AGATCCGGAGATTCCCGAATAGGCGAAGCGGTGGAGTGCTGCACCCTAGATGGCGAGAGTCCAGTAGCCGAAAGCATCACTAGCTTACGCTCTGACCCGAGTAGCATGGG
GCACGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAAGCTTCATCGTCGAGAGGGAAACAGCCCGGATCACCAGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGG
AGGTAGGGGTGCAGAGACAGCCAGGAGGTTTGCCTAGAAGCAGCCACCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATA
TGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAG
TTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGT
AGATCCGGAGATTCCCGAATAGGCGAAGCGGTGGAGTGCTGCACCCTAGATGGCGAGAGTCCAGTAGCCGAAAGCATCACTAGCTTACGCTCTGACCCGAGTAGCATGGG
GCACGGGTCAAGGTCGGCCAGTGAGACGATGGGGGATAAGCTTCATCGTCGAGAGGGAAACAGCCCGGATCACCAGCTAAGGCCCCTAAATGACCGCTCAGTGATAAAGG
AGGTAGGGGTGCAGAGACAGCCAGGAGGTTTGCCTAGAAGCAGCCACCCTTGA
Protein sequenceShow/hide protein sequence
MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKR
RSGDSRIGEAVECCTLDGESPVAESITSLRSDPSSMGHGSRSASETMGDKLHRREGNSPDHQLRPLNDRSVIKEVGVQRQPGGLPRSSHP