; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr1:3000575..3006811
RNA-Seq ExpressionMoc01g04560
SyntenyMoc01g04560
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-3247.14Show/hide
Query:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK
        MVHFNVAE NG VI E SQVSFILESLP+SFL FRSNA MNK+ YT TTLLNELQT++SLMK KGQ+GE NVATS ++F+RGS+S T+S PSSSG+K +K
Subjt:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK

Query:  KKNTAGKRSKPDSAAA------------------------------AQKGKVK------------------------------------GISSWIQVDAR
        KK   G+ +K + AAA                              A+K K K                                    GISSW Q++  
Subjt:  KKNTAGKRSKPDSAAA------------------------------AQKGKVK------------------------------------GISSWIQVDAR

Query:  EMTLKVGTREVVSAVAVGELKLFTNKN
        EMT++VGT  VVSA+AVG L+L+  K+
Subjt:  EMTLKVGTREVVSAVAVGELKLFTNKN

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-3247.14Show/hide
Query:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK
        MVHFNVAE NG VI E SQVSFILESLP+SFL FRSNA MNK+ YT TTLLNELQT++SLMK KGQ+GE NVATS ++F+RGS+S T+S PSSSG+K +K
Subjt:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK

Query:  KKNTAGKRSKPDSAAA------------------------------AQKGKVK------------------------------------GISSWIQVDAR
        KK   G+ +K + AAA                              A+K K K                                    GISSW Q++  
Subjt:  KKNTAGKRSKPDSAAA------------------------------AQKGKVK------------------------------------GISSWIQVDAR

Query:  EMTLKVGTREVVSAVAVGELKLFTNKN
        EMT++VGT  VVSA+AVG L+L   K+
Subjt:  EMTLKVGTREVVSAVAVGELKLFTNKN

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-3247.14Show/hide
Query:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK
        MVHFNVAE NG VI E SQVSFILESLP+SFL FRSNA MNK+ YT TTLLNELQT++SLMK KGQ+GE NVATS ++F+RGS+S T+S PSSSG+K +K
Subjt:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK

Query:  KKNTAGKRSKPDSAAA------------------------------AQKGKVK------------------------------------GISSWIQVDAR
        KK   G+ +K + AAA                              A+K K K                                    GISSW Q++  
Subjt:  KKNTAGKRSKPDSAAA------------------------------AQKGKVK------------------------------------GISSWIQVDAR

Query:  EMTLKVGTREVVSAVAVGELKLFTNKN
        EMT++VGT  VVSA+AVG L+L   K+
Subjt:  EMTLKVGTREVVSAVAVGELKLFTNKN

TYJ96755.1 gag/pol protein [Cucumis melo var. makuwa]3.2e-3358.14Show/hide
Query:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK
        MVHFN+AE N  VI E SQVSFILESLP+SFL FRSNA MNK+ YT TTLLNELQT++SLMK KGQ+GE NVATS ++F RGS S T+S P SS +K  K
Subjt:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK

Query:  KKNTAGKRSKPDSAAAAQKGKVK----------------GISSWIQVDAREMTLKVGTREVVSAVAVGELKL
        KK   G+ +K + AAA    K K                GISSW Q++  EMT++VGT  VVSA+AVG L+L
Subjt:  KKNTAGKRSKPDSAAAAQKGKVK----------------GISSWIQVDAREMTLKVGTREVVSAVAVGELKL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-3247.35Show/hide
Query:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK
        MVHFNVAE NG VI E SQVSFILESLP+SFL FRSNA MNK+ YT TTLLNELQT++SLMK KGQ+GE NVATS ++F+RGS+S T+S PSSSG+K +K
Subjt:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK

Query:  KKN---------TAGKRSKPDSAA--------------------AAQKGKVK------------------------------------GISSWIQVDARE
        KK           A K +K   AA                     A+K K K                                    GISSW Q++  E
Subjt:  KKN---------TAGKRSKPDSAA--------------------AAQKGKVK------------------------------------GISSWIQVDARE

Query:  MTLKVGTREVVSAVAVGELKLFTNKN
        MT++VGT  VVSA+AVG L+L   K+
Subjt:  MTLKVGTREVVSAVAVGELKLFTNKN

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.3e-3247.14Show/hide
Query:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK
        MVHFNVAE NG VI E SQVSFILESLP+SFL FRSNA MNK+ YT TTLLNELQT++SLMK KGQ+GE NVATS ++F+RGS+S T+S PSSSG+K +K
Subjt:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK

Query:  KKNTAGKRSKPDSAAA------------------------------AQKGKVK------------------------------------GISSWIQVDAR
        KK   G+ +K + AAA                              A+K K K                                    GISSW Q++  
Subjt:  KKNTAGKRSKPDSAAA------------------------------AQKGKVK------------------------------------GISSWIQVDAR

Query:  EMTLKVGTREVVSAVAVGELKLFTNKN
        EMT++VGT  VVSA+AVG L+L   K+
Subjt:  EMTLKVGTREVVSAVAVGELKLFTNKN

A0A5A7TU93 Gag/pol protein6.0e-3347.14Show/hide
Query:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK
        MVHFNVAE NG VI E SQVSFILESLP+SFL FRSNA MNK+ YT TTLLNELQT++SLMK KGQ+GE NVATS ++F+RGS+S T+S PSSSG+K +K
Subjt:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK

Query:  KKNTAGKRSKPDSAAA------------------------------AQKGKVK------------------------------------GISSWIQVDAR
        KK   G+ +K + AAA                              A+K K K                                    GISSW Q++  
Subjt:  KKNTAGKRSKPDSAAA------------------------------AQKGKVK------------------------------------GISSWIQVDAR

Query:  EMTLKVGTREVVSAVAVGELKLFTNKN
        EMT++VGT  VVSA+AVG L+L+  K+
Subjt:  EMTLKVGTREVVSAVAVGELKLFTNKN

A0A5A7V4M1 Gag/pol protein2.3e-3247.14Show/hide
Query:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK
        MVHFNVAE NG VI E SQVSFILESLP+SFL FRSNA MNK+ YT TTLLNELQT++SLMK KGQ+GE NVATS ++F+RGS+S T+S PSSSG+K +K
Subjt:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK

Query:  KKNTAGKRSKPDSAAA------------------------------AQKGKVK------------------------------------GISSWIQVDAR
        KK   G+ +K + AAA                              A+K K K                                    GISSW Q++  
Subjt:  KKNTAGKRSKPDSAAA------------------------------AQKGKVK------------------------------------GISSWIQVDAR

Query:  EMTLKVGTREVVSAVAVGELKLFTNKN
        EMT++VGT  VVSA+AVG L+L   K+
Subjt:  EMTLKVGTREVVSAVAVGELKLFTNKN

A0A5D3BE74 Gag/pol protein1.6e-3358.14Show/hide
Query:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK
        MVHFN+AE N  VI E SQVSFILESLP+SFL FRSNA MNK+ YT TTLLNELQT++SLMK KGQ+GE NVATS ++F RGS S T+S P SS +K  K
Subjt:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK

Query:  KKNTAGKRSKPDSAAAAQKGKVK----------------GISSWIQVDAREMTLKVGTREVVSAVAVGELKL
        KK   G+ +K + AAA    K K                GISSW Q++  EMT++VGT  VVSA+AVG L+L
Subjt:  KKNTAGKRSKPDSAAAAQKGKVK----------------GISSWIQVDAREMTLKVGTREVVSAVAVGELKL

A0A5D3CPJ6 Gag/pol protein2.3e-3247.35Show/hide
Query:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK
        MVHFNVAE NG VI E SQVSFILESLP+SFL FRSNA MNK+ YT TTLLNELQT++SLMK KGQ+GE NVATS ++F+RGS+S T+S PSSSG+K +K
Subjt:  MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATS-KRFNRGSSSRTRSAPSSSGSKTFK

Query:  KKN---------TAGKRSKPDSAA--------------------AAQKGKVK------------------------------------GISSWIQVDARE
        KK           A K +K   AA                     A+K K K                                    GISSW Q++  E
Subjt:  KKN---------TAGKRSKPDSAA--------------------AAQKGKVK------------------------------------GISSWIQVDARE

Query:  MTLKVGTREVVSAVAVGELKLFTNKN
        MT++VGT  VVSA+AVG L+L   K+
Subjt:  MTLKVGTREVVSAVAVGELKLFTNKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCACTTCAACGTGGCTGAGTCGAACGGGACCGTCATAGTCGAGCAGAGTCAGGTCAGCTTTATTCTGGAATCTCTTCCGAAGAGTTTCCTACCATTCCGCAGCAA
TGCATTTATGAATAAGCTGGAGTACACTTATACCACGCTCCTAAACGAGCTGCAGACCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGACAAATGTTGCCA
CCTCAAAGAGGTTCAACAGAGGATCGTCCTCTAGAACCAGGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAATACTGCTGGTAAGAGGTCTAAACCT
GACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCAAGGGAATTAGTTCCTGGATACAGGTTGACGCCAGAGAGATGACTCTCAAGGTCGGAACGAGAGAGGTCGTCTCAGC
TGTGGCGGTAGGGGAACTCAAGTTGTTTACAAACAAGAATATTCCAATACACAGCGAGTTGGAACGCTTTGAGGTGGAGCTGACGGTGGATGATGTCTCCACGTTGTTGG
CTCGACTCTCAGTGGAACCTAGCCTGAGACAGAGGATCATTGTTGCCCAAAAGGAAGACCCTAGCTTGGCCAAAGGCTTTAGTATGGTGGGCCATGGGGATTTCACTCTC
TCGGATGAGTCCTTGTGCTATGAGGAGGTACCCGTCCAGATTTTTGCAAAAGAAACCAAGTTGTTGAGGAACCGGGCAATTCGCCTGGTTAAGCTTCTCCGACGCCTCGA
CGCAGCACCGGTGCTGCTAGCCACTCCGAACGGGACGACGACCCGTAAGAGCGGCGGACGAAGACACCACGAGCGGCGGCCGGCGACTCTTGACGCTCCAACGGCGGCGT
TTCATAACAGGTACAGCGGCGGCGCGACCCTCCTCGCAGCGGCGCACGGCGGCTGCTACAGCGACGGACCGGCAGCGCGACTCCCCGACGTTCCGACGGCCTGCAGCAGC
GGCGGCGCGCCCCTACGATCTACCACGAACGAGCACGGCCTCCCCCTCACGGCGGCGCAGGGCGAATGGAGGTACAGCAGCGGAATAGTGGCAGAATTGTACGTTACAGT
GGTGTTTAGGACGTTTGGCGGTGACCCACATCCGTTCGAAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCACTTCAACGTGGCTGAGTCGAACGGGACCGTCATAGTCGAGCAGAGTCAGGTCAGCTTTATTCTGGAATCTCTTCCGAAGAGTTTCCTACCATTCCGCAGCAA
TGCATTTATGAATAAGCTGGAGTACACTTATACCACGCTCCTAAACGAGCTGCAGACCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGACAAATGTTGCCA
CCTCAAAGAGGTTCAACAGAGGATCGTCCTCTAGAACCAGGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAATACTGCTGGTAAGAGGTCTAAACCT
GACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCAAGGGAATTAGTTCCTGGATACAGGTTGACGCCAGAGAGATGACTCTCAAGGTCGGAACGAGAGAGGTCGTCTCAGC
TGTGGCGGTAGGGGAACTCAAGTTGTTTACAAACAAGAATATTCCAATACACAGCGAGTTGGAACGCTTTGAGGTGGAGCTGACGGTGGATGATGTCTCCACGTTGTTGG
CTCGACTCTCAGTGGAACCTAGCCTGAGACAGAGGATCATTGTTGCCCAAAAGGAAGACCCTAGCTTGGCCAAAGGCTTTAGTATGGTGGGCCATGGGGATTTCACTCTC
TCGGATGAGTCCTTGTGCTATGAGGAGGTACCCGTCCAGATTTTTGCAAAAGAAACCAAGTTGTTGAGGAACCGGGCAATTCGCCTGGTTAAGCTTCTCCGACGCCTCGA
CGCAGCACCGGTGCTGCTAGCCACTCCGAACGGGACGACGACCCGTAAGAGCGGCGGACGAAGACACCACGAGCGGCGGCCGGCGACTCTTGACGCTCCAACGGCGGCGT
TTCATAACAGGTACAGCGGCGGCGCGACCCTCCTCGCAGCGGCGCACGGCGGCTGCTACAGCGACGGACCGGCAGCGCGACTCCCCGACGTTCCGACGGCCTGCAGCAGC
GGCGGCGCGCCCCTACGATCTACCACGAACGAGCACGGCCTCCCCCTCACGGCGGCGCAGGGCGAATGGAGGTACAGCAGCGGAATAGTGGCAGAATTGTACGTTACAGT
GGTGTTTAGGACGTTTGGCGGTGACCCACATCCGTTCGAAGCTTGA
Protein sequenceShow/hide protein sequence
MVHFNVAESNGTVIVEQSQVSFILESLPKSFLPFRSNAFMNKLEYTYTTLLNELQTYQSLMKCKGQEGETNVATSKRFNRGSSSRTRSAPSSSGSKTFKKKNTAGKRSKP
DSAAAAQKGKVKGISSWIQVDAREMTLKVGTREVVSAVAVGELKLFTNKNIPIHSELERFEVELTVDDVSTLLARLSVEPSLRQRIIVAQKEDPSLAKGFSMVGHGDFTL
SDESLCYEEVPVQIFAKETKLLRNRAIRLVKLLRRLDAAPVLLATPNGTTTRKSGGRRHHERRPATLDAPTAAFHNRYSGGATLLAAAHGGCYSDGPAARLPDVPTACSS
GGAPLRSTTNEHGLPLTAAQGEWRYSSGIVAELYVTVVFRTFGGDPHPFEA