; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g26590 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g26590
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr4:19469299..19469993
RNA-Seq ExpressionMoc04g26590
SyntenyMoc04g26590
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022159413.1 uncharacterized protein LOC111025834 [Momordica charantia]1.3e-8474.89Show/hide
Query:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP
        MP GYQPPKFQHFDGKGNPKQHIAHFVETCENA                        DLE ETIESWEQLE+EFLNRFYSTK+TVSMMEL +S+QRKGEP
Subjt:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP

Query:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKSVERTSNESFGVNTTSPKLF
        VVEYINRWRALSLD KDRLTELS +ELCTQGM+  L+YILQ IKP  FEEL TRA+DMELSIA+RGSKD LV DLEKE+KSVE+TSNE FGVNT SPKLF
Subjt:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKSVERTSNESFGVNTTSPKLF

Query:  SRIKGKRIEKQQENEKRWLSLIKRQEK
        S+IKGKRIEKQ+ N K WLSL +RQEK
Subjt:  SRIKGKRIEKQQENEKRWLSLIKRQEK

XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]3.1e-7063.4Show/hide
Query:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP
        MP+GYQPPKFQ FDGKGNPKQH+AHFVETCENA                        DLE E+IESWEQLE+EFLNRFYST++TVSMMEL ++KQRKGEP
Subjt:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP

Query:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKS-------VERTSNESFGVN
        V++YINRWRALSLD KDRLTELS +E+CTQGM+ GL+YILQGIKP  FEEL TRA+DMELSIASRG+KDFLV +++K+ K        V+ TS ES  VN
Subjt:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKS-------VERTSNESFGVN

Query:  TTSPKLFSRIKGKRIEKQQE-NEKRWLSLIKRQEK
        TT P  FS+ K  R+EK+ + +E+R L+L +RQEK
Subjt:  TTSPKLFSRIKGKRIEKQQE-NEKRWLSLIKRQEK

XP_031740568.1 uncharacterized protein LOC116403508 [Cucumis sativus]3.1e-7063.4Show/hide
Query:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP
        MP+GYQPPKFQ FDGKGNPKQH+AHFVETCENA                        DLE E+IESWEQLE+EFLNRFYST++TVSMMEL ++KQRKGEP
Subjt:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP

Query:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKS-------VERTSNESFGVN
        V++YINRWRALSLD KDRLTELS +E+CTQGM+ GL+YILQGIKP  FEEL TRA+DMELSIASRG+KDFLV +++K+ K        V+ TS ES  VN
Subjt:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKS-------VERTSNESFGVN

Query:  TTSPKLFSRIKGKRIEKQQE-NEKRWLSLIKRQEK
        TT P  FS+ K  R+EK+ + +E+R L+L +RQEK
Subjt:  TTSPKLFSRIKGKRIEKQQE-NEKRWLSLIKRQEK

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]3.1e-7063.4Show/hide
Query:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP
        MP+GYQPPKFQ FDGKGNPKQH+AHFVETCENA                        DLE E+IESWEQLE+EFLNRFYST++TVSMMEL ++KQRKGEP
Subjt:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP

Query:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKS-------VERTSNESFGVN
        V++YINRWRALSLD KDRLTELS +E+CTQGM+ GL+YILQGIKP  FEEL TRA+DMELSIASRG+KDFLV +++K+ K        V+ TS ES  VN
Subjt:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKS-------VERTSNESFGVN

Query:  TTSPKLFSRIKGKRIEKQQE-NEKRWLSLIKRQEK
        TT P  FS+ K  R+EK+ + +E+R L+L +RQEK
Subjt:  TTSPKLFSRIKGKRIEKQQE-NEKRWLSLIKRQEK

XP_031742390.1 uncharacterized protein LOC116401672 [Cucumis sativus]6.9e-7062.98Show/hide
Query:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP
        MP+GYQPPKFQ FDGKGNPKQH+AHFVETCENA                        DLE E+IESWEQLE+EFLNRFYST++TVSMMEL ++KQRKGEP
Subjt:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP

Query:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKS-------VERTSNESFGVN
        V++YINRWRALSLD KDRLTELS +E+CTQGM+ GL+YILQGIKP  FEEL TRA+DMELSIASRG+KDFLV +++K+ K        V+ T+ ES  VN
Subjt:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKS-------VERTSNESFGVN

Query:  TTSPKLFSRIKGKRIEKQQE-NEKRWLSLIKRQEK
        TT P  FS+ K  R+EK+ + +E+R L+L +RQEK
Subjt:  TTSPKLFSRIKGKRIEKQQE-NEKRWLSLIKRQEK

TrEMBL top hitse value%identityAlignment
A0A5A7TZU9 Ribonuclease H5.5e-6560Show/hide
Query:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP
        MP GYQPPKFQ FDGKGNPKQH+AHF+ETCE A                        DLE E+I+SWEQLER+FLNRFYST++ VSM+EL ++KQRKGEP
Subjt:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP

Query:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKSVERT-------SNESFGVN
        V++YINRWRALSLD KDRLTELS +E+CTQGM+ GL+YILQGIKP  FEEL TRA+DMELSIA+RG+ D LV ++ KE K V+ T       + E+  V+
Subjt:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKSVERT-------SNESFGVN

Query:  TTSPKLFSRIKGKRIEKQQ-ENEKRWLSLIKRQEK
        TT  KL S  K K++EK+Q E EKR  +L +RQEK
Subjt:  TTSPKLFSRIKGKRIEKQQ-ENEKRWLSLIKRQEK

A0A5A7U8E4 Retrotransposon gag protein1.3e-6660.26Show/hide
Query:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP
        MP+GYQPPKFQ FD KGNPKQHIAHFVETCENA                        DLE E I+SW+QLE+EFLNRFYST++TVSMMEL ++KQRKGEP
Subjt:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP

Query:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKS-------VERTSNESFGVN
        V++YINRWRALSLD KDRLTELS +E+CTQGM+ GL+YILQGIKP  FEEL TRA+DMELSIASRG+KDF V ++ K+ K        V+ +  ES  VN
Subjt:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKS-------VERTSNESFGVN

Query:  TTSPKLFSRIKGKRIEKQQENEKRWLSLIKRQEK
        TT  K   R +G+  +K   +E+R L+L +RQEK
Subjt:  TTSPKLFSRIKGKRIEKQQENEKRWLSLIKRQEK

A0A5A7V8K5 Ty3-gypsy retrotransposon protein2.9e-6659.4Show/hide
Query:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP
        MP+GYQPPKFQ FDGKGNPKQHIAHFVETCENA                        DLE + I+SWEQLE+EFLNRFYST++T+SMMEL ++KQ KGEP
Subjt:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP

Query:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKSVERTS-------NESFGVN
        V++YINRWRALSLD KDRLT+LS +E+CTQGM+ GL+YILQGIKP  FEEL TRA+DMELSIASRG+KDFLV +++K+ K    T         ES  VN
Subjt:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKSVERTS-------NESFGVN

Query:  TTSPKLFSRIKGKRIEKQQENEKRWLSLIKRQEK
        T   K   R +G+  +K   +E+R L+L +RQEK
Subjt:  TTSPKLFSRIKGKRIEKQQENEKRWLSLIKRQEK

A0A5D3C0W6 Ty3-gypsy retrotransposon protein1.5e-6558.97Show/hide
Query:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP
        MP+GYQPPKFQ FDGKGNPKQHIAHFVETCENA                        +LEL+ I+SWEQLE+EFLNRFYST++T+SMMEL ++KQ KGEP
Subjt:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP

Query:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKSVERTS-------NESFGVN
        V++YINRWRAL LD KDRLT+LS +E+CTQGM+ GL+YILQGIKP  FEEL TRA+DMELSIASRG+KDF V +++K+ K    T         ES  VN
Subjt:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKSVERTS-------NESFGVN

Query:  TTSPKLFSRIKGKRIEKQQENEKRWLSLIKRQEK
        T   K   R +G+  +K   +EKR L+L +RQEK
Subjt:  TTSPKLFSRIKGKRIEKQQENEKRWLSLIKRQEK

A0A6J1DYQ8 uncharacterized protein LOC1110258346.3e-8574.89Show/hide
Query:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP
        MP GYQPPKFQHFDGKGNPKQHIAHFVETCENA                        DLE ETIESWEQLE+EFLNRFYSTK+TVSMMEL +S+QRKGEP
Subjt:  MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENA----------------------VYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEP

Query:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKSVERTSNESFGVNTTSPKLF
        VVEYINRWRALSLD KDRLTELS +ELCTQGM+  L+YILQ IKP  FEEL TRA+DMELSIA+RGSKD LV DLEKE+KSVE+TSNE FGVNT SPKLF
Subjt:  VVEYINRWRALSLDYKDRLTELSVIELCTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKSVERTSNESFGVNTTSPKLF

Query:  SRIKGKRIEKQQENEKRWLSLIKRQEK
        S+IKGKRIEKQ+ N K WLSL +RQEK
Subjt:  SRIKGKRIEKQQENEKRWLSLIKRQEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGTTGGGTACCAACCACCAAAGTTTCAGCATTTCGACGGAAAAGGTAATCCCAAACAACATATTGCTCACTTCGTCGAAACTTGTGAGAATGCTGTATAC
GATCTGGAGCTTGAAACGATCGAGAGTTGGGAACAGCTTGAAAGAGAATTTCTAAATCGCTTTTACAGTACGAAGAAAACCGTCAGCATGATGGAGCTCATGAGC
TCCAAACAAAGAAAAGGTGAACCGGTTGTCGAATATATCAACCGATGGAGAGCTTTAAGTCTTGATTACAAAGATAGGCTTACTGAACTATCTGTTATCGAATTA
TGCACTCAAGGTATGTACCGAGGACTTATTTACATTCTTCAAGGTATAAAACCTCACATCTTTGAAGAGCTAACAACTCGTGCTTATGATATGGAGCTAAGCATT
GCTAGTCGAGGAAGTAAGGATTTCTTAGTGTCGGACTTAGAGAAAGAAAACAAAAGTGTCGAGAGAACTTCAAATGAATCTTTTGGTGTAAACACGACTTCTCCC
AAGTTGTTTTCAAGAATAAAAGGAAAGAGAATTGAGAAGCAACAAGAAAATGAGAAAAGATGGCTAAGCTTGATAAAAAGGCAAGAAAAGTCTATTCTTTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTGTTGGGTACCAACCACCAAAGTTTCAGCATTTCGACGGAAAAGGTAATCCCAAACAACATATTGCTCACTTCGTCGAAACTTGTGAGAATGCTGTATAC
GATCTGGAGCTTGAAACGATCGAGAGTTGGGAACAGCTTGAAAGAGAATTTCTAAATCGCTTTTACAGTACGAAGAAAACCGTCAGCATGATGGAGCTCATGAGC
TCCAAACAAAGAAAAGGTGAACCGGTTGTCGAATATATCAACCGATGGAGAGCTTTAAGTCTTGATTACAAAGATAGGCTTACTGAACTATCTGTTATCGAATTA
TGCACTCAAGGTATGTACCGAGGACTTATTTACATTCTTCAAGGTATAAAACCTCACATCTTTGAAGAGCTAACAACTCGTGCTTATGATATGGAGCTAAGCATT
GCTAGTCGAGGAAGTAAGGATTTCTTAGTGTCGGACTTAGAGAAAGAAAACAAAAGTGTCGAGAGAACTTCAAATGAATCTTTTGGTGTAAACACGACTTCTCCC
AAGTTGTTTTCAAGAATAAAAGGAAAGAGAATTGAGAAGCAACAAGAAAATGAGAAAAGATGGCTAAGCTTGATAAAAAGGCAAGAAAAGTCTATTCTTTTGTAG
Protein sequenceShow/hide protein sequence
MPVGYQPPKFQHFDGKGNPKQHIAHFVETCENAVYDLELETIESWEQLEREFLNRFYSTKKTVSMMELMSSKQRKGEPVVEYINRWRALSLDYKDRLTELSVIEL
CTQGMYRGLIYILQGIKPHIFEELTTRAYDMELSIASRGSKDFLVSDLEKENKSVERTSNESFGVNTTSPKLFSRIKGKRIEKQQENEKRWLSLIKRQEKSILL