; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g19910 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g19910
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr7:14299099..14311025
RNA-Seq ExpressionMoc07g19910
SyntenyMoc07g19910
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]1.4e-4443.53Show/hide
Query:  IEDAVDNSPKESYSGILRNEVPNHAYSLGLLRDDINREVIAYAASTFYTFNLVITEPEIEASKFELK-----------LVIF------------------
        IE+ VD  P  +      +EV   + ++ LL   I+RE+ AYAA TFY FN VITE EI A KFELK           L +F                  
Subjt:  IEDAVDNSPKESYSGILRNEVPNHAYSLGLLRDDINREVIAYAASTFYTFNLVITEPEIEASKFELK-----------LVIF------------------

Query:  ------------------------------------------------QMLQT-----------IETHYKGLNHATRLVIDASANGALLTKLYAEAFNIL
                                                        +++Q            IE +Y GL+ ATRLV   S N ALL K YAEAFNIL
Subjt:  ------------------------------------------------QMLQT-----------IETHYKGLNHATRLVIDASANGALLTKLYAEAFNIL

Query:  ERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVNQIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNR
        ERISSN HS SD RA+QG  +K L ESKSY   NSKIEN+ DLV RSMTQQS++GA TGKAN +  QG S SF  G HHYNNCP N E VY  GN  ++R
Subjt:  ERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVNQIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNR

Query:  NNPYSNTYNPDWRNHPN
        NN YSNTYNP  RNHPN
Subjt:  NNPYSNTYNPDWRNHPN

XP_022156571.1 uncharacterized protein LOC111023448 [Momordica charantia]4.9e-6185.62Show/hide
Query:  MNPDESHRCFEEKSLKEAADHLDEARPSPDRQLNPTTLDGGEPVSGLNKATDLLGKARASPGYRLNPTTLDKGEPVAGLIKVTDLSGEARSSPGHRLNSI
        MNPDESHRCFEEKSLK+AADHLDEARPSPDRQLNPTTLDGGEPVSGLNKATDL GKAR SPGYRLNPTTLDKG PVAGLIK TDLSG+AR SPGHRLNS 
Subjt:  MNPDESHRCFEEKSLKEAADHLDEARPSPDRQLNPTTLDGGEPVSGLNKATDLLGKARASPGYRLNPTTLDKGEPVAGLIKVTDLSGEARSSPGHRLNSI

Query:  PLDRGEPAEEVESVPLTAEDRRVNIGTKLGIFD----LNFQTANGD
         LDRGEPAEEVESVPLTAEDRRVNIGTKLG  +    +NF  +N D
Subjt:  PLDRGEPAEEVESVPLTAEDRRVNIGTKLGIFD----LNFQTANGD

XP_022156835.1 uncharacterized protein LOC111023669 [Momordica charantia]1.1e-7378.14Show/hide
Query:  IETHYKGLNHATRLVIDASANGALLTKLYAEAFNILERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVN
        IE +YKGL+ ATRLVIDAS NGALL K YAEAFNILERISSNNHSWSDPRA+QG   KGL ES+SY ALNSK+ENLT+LVMRSMTQQ+++GAS GKANV+
Subjt:  IETHYKGLNHATRLVIDASANGALLTKLYAEAFNILERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVN

Query:  QIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNRNNPYSNTYNPDWRNHPNFSWSGNQGGHNAGTSNAPTFQQKVSYPTGF
         IQGISCSFCEG+HHYNN P N E VYY GN Q+N  N YSNTYNP WRNHPNFSWSGNQGG+NAGTSNAP +QQK SYP  F
Subjt:  QIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNRNNPYSNTYNPDWRNHPNFSWSGNQGGHNAGTSNAPTFQQKVSYPTGF

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]1.5e-5768.31Show/hide
Query:  IETHYKGLNHATRLVIDASANGALLTKLYAEAFNILERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVN
        IET+YK LN ATRL                                 DPRAVQG SSKGLVES+SY  LNS IENLT LVMRSM QQSS+GA TG ANVN
Subjt:  IETHYKGLNHATRLVIDASANGALLTKLYAEAFNILERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVN

Query:  QIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNRNNPYSNTYNPDWRNHPNFSWSGNQGGHNAGTSNAPTFQQKVSYPTGF
        QIQGISCSFCEGDHHYNNCP N E VYY GNPQ+NRNN YSNTYNP WRNHPNFSWSG+QGGHNAGTS+AP FQ KVSYP GF
Subjt:  QIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNRNNPYSNTYNPDWRNHPNFSWSGNQGGHNAGTSNAPTFQQKVSYPTGF

XP_022159060.1 uncharacterized protein LOC111025500 [Momordica charantia]3.3e-4974.47Show/hide
Query:  IETHYKGLNHATRLVIDASANGALLTKLYAEAFNILERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVN
        I+T+Y GL+ ATRLVIDASANGALL K YAEAFNILERISSNN SWSDPRA+ G  SKG  ES+S+ ALN KIENLTDLVMRSMT QS++GAS GKANV+
Subjt:  IETHYKGLNHATRLVIDASANGALLTKLYAEAFNILERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVN

Query:  QIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNRNNPYS
         IQGISCSFC G++ YNNCP N E V+Y GN Q+N NNPYS
Subjt:  QIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNRNNPYS

TrEMBL top hitse value%identityAlignment
A0A6J1DAK9 uncharacterized protein LOC1110189106.9e-4543.53Show/hide
Query:  IEDAVDNSPKESYSGILRNEVPNHAYSLGLLRDDINREVIAYAASTFYTFNLVITEPEIEASKFELK-----------LVIF------------------
        IE+ VD  P  +      +EV   + ++ LL   I+RE+ AYAA TFY FN VITE EI A KFELK           L +F                  
Subjt:  IEDAVDNSPKESYSGILRNEVPNHAYSLGLLRDDINREVIAYAASTFYTFNLVITEPEIEASKFELK-----------LVIF------------------

Query:  ------------------------------------------------QMLQT-----------IETHYKGLNHATRLVIDASANGALLTKLYAEAFNIL
                                                        +++Q            IE +Y GL+ ATRLV   S N ALL K YAEAFNIL
Subjt:  ------------------------------------------------QMLQT-----------IETHYKGLNHATRLVIDASANGALLTKLYAEAFNIL

Query:  ERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVNQIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNR
        ERISSN HS SD RA+QG  +K L ESKSY   NSKIEN+ DLV RSMTQQS++GA TGKAN +  QG S SF  G HHYNNCP N E VY  GN  ++R
Subjt:  ERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVNQIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNR

Query:  NNPYSNTYNPDWRNHPN
        NN YSNTYNP  RNHPN
Subjt:  NNPYSNTYNPDWRNHPN

A0A6J1DRG1 uncharacterized protein LOC1110236695.4e-7478.14Show/hide
Query:  IETHYKGLNHATRLVIDASANGALLTKLYAEAFNILERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVN
        IE +YKGL+ ATRLVIDAS NGALL K YAEAFNILERISSNNHSWSDPRA+QG   KGL ES+SY ALNSK+ENLT+LVMRSMTQQ+++GAS GKANV+
Subjt:  IETHYKGLNHATRLVIDASANGALLTKLYAEAFNILERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVN

Query:  QIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNRNNPYSNTYNPDWRNHPNFSWSGNQGGHNAGTSNAPTFQQKVSYPTGF
         IQGISCSFCEG+HHYNN P N E VYY GN Q+N  N YSNTYNP WRNHPNFSWSGNQGG+NAGTSNAP +QQK SYP  F
Subjt:  QIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNRNNPYSNTYNPDWRNHPNFSWSGNQGGHNAGTSNAPTFQQKVSYPTGF

A0A6J1DVC6 Ribonuclease H2.4e-6185.62Show/hide
Query:  MNPDESHRCFEEKSLKEAADHLDEARPSPDRQLNPTTLDGGEPVSGLNKATDLLGKARASPGYRLNPTTLDKGEPVAGLIKVTDLSGEARSSPGHRLNSI
        MNPDESHRCFEEKSLK+AADHLDEARPSPDRQLNPTTLDGGEPVSGLNKATDL GKAR SPGYRLNPTTLDKG PVAGLIK TDLSG+AR SPGHRLNS 
Subjt:  MNPDESHRCFEEKSLKEAADHLDEARPSPDRQLNPTTLDGGEPVSGLNKATDLLGKARASPGYRLNPTTLDKGEPVAGLIKVTDLSGEARSSPGHRLNSI

Query:  PLDRGEPAEEVESVPLTAEDRRVNIGTKLGIFD----LNFQTANGD
         LDRGEPAEEVESVPLTAEDRRVNIGTKLG  +    +NF  +N D
Subjt:  PLDRGEPAEEVESVPLTAEDRRVNIGTKLGIFD----LNFQTANGD

A0A6J1DXK5 uncharacterized protein LOC1110255001.6e-4974.47Show/hide
Query:  IETHYKGLNHATRLVIDASANGALLTKLYAEAFNILERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVN
        I+T+Y GL+ ATRLVIDASANGALL K YAEAFNILERISSNN SWSDPRA+ G  SKG  ES+S+ ALN KIENLTDLVMRSMT QS++GAS GKANV+
Subjt:  IETHYKGLNHATRLVIDASANGALLTKLYAEAFNILERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVN

Query:  QIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNRNNPYS
         IQGISCSFC G++ YNNCP N E V+Y GN Q+N NNPYS
Subjt:  QIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNRNNPYS

A0A6J1E1F3 uncharacterized protein LOC1110250657.1e-5868.31Show/hide
Query:  IETHYKGLNHATRLVIDASANGALLTKLYAEAFNILERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVN
        IET+YK LN ATRL                                 DPRAVQG SSKGLVES+SY  LNS IENLT LVMRSM QQSS+GA TG ANVN
Subjt:  IETHYKGLNHATRLVIDASANGALLTKLYAEAFNILERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLTDLVMRSMTQQSSIGASTGKANVN

Query:  QIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNRNNPYSNTYNPDWRNHPNFSWSGNQGGHNAGTSNAPTFQQKVSYPTGF
        QIQGISCSFCEGDHHYNNCP N E VYY GNPQ+NRNN YSNTYNP WRNHPNFSWSG+QGGHNAGTS+AP FQ KVSYP GF
Subjt:  QIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNRNNPYSNTYNPDWRNHPNFSWSGNQGGHNAGTSNAPTFQQKVSYPTGF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCTGATGAGAGCCATCGGTGCTTCGAAGAGAAGAGCCTGAAAGAAGCGGCCGATCACTTAGATGAGGCAAGACCGAGCCCTGATCGTCAACTTAATCCGACCAC
CCTTGACGGGGGAGAACCAGTGTCGGGCCTGAATAAAGCGACAGACCTCTTGGGAAAGGCGAGGGCGAGCCCTGGTTATCGACTTAATCCGACCACCCTTGATAAGGGAG
AACCAGTGGCGGGCCTGATTAAAGTGACAGACCTTTCGGGGGAGGCAAGGTCGAGCCCTGGTCATCGACTTAATTCGATCCCCCTTGACAGGGGAGAACCGGCTGAGGAG
GTAGAATCTGTCCCTCTGACAGCCGAAGATCGACGGGTCAACATCGGAACCAAGTTGGGGATCTTCGATCTTAATTTTCAGACGGCCAACGGTGATCAGTTGATTCAGTT
GGCTTATTTTAATTTTCAAACGGACGGCAAGCGGTGCTCTGCTTTTGGCAAACTTGGTCGGATTAAATTTCACCTAACACAGGCGATGGTTAAAGACAGTGGCCCTTTTA
GAATGATGTGTCATTCTCCAGTGTCGACTGTGGTGTCTCGCAGCGTCGTAATCGGTTTGGCGGTGGGGAACAAACCTAGGTCAACTTTGCATGGGCCGGATGATGTAGCA
GTGGTTCGAACTAGTTCTGATCCTTCTAGGTCCGAGAATTTGATTGTTCCTATTGGTTCGGGGGTTGGGGTTTCCCTTTCTTTTGGTGGTGATGTTGCGGAAAAGAGGAA
GGGTTTTCTAGTTAACTTTTCTAATATACCGCTTGAGCTGTGGACACCGCGGGGTCTTAGTACTATTGCTAGTGTGCTTAGGACCCTGCTATGGCTTGATAAGGCTACGG
AGGAGCGTAGTCGGCTTTCTTTTGCTAGGGTATGTAATGAGATGTGGGCTGCTTCTTCTTTCCCGCTTATGTTAAAGTCCGAGTGTGTGATGTTTTATTTCCTATTTCTA
TTGAGTGTGTGTTCTAAAGATTCCTCAGGTGTCGGCTCCAAAGTCTACGAGCCTTCTTTGGCTTCTCTAGTGCCGGCTCCTCTTGTTCCGACTGGGGAGCTTGTGGTTCT
AGATAAGGAGGCTATGGTTTCGATTTCGGGATTGATTCGGCCTGTTGTGGGTGGAAATTGTTTTGCTACTTTGGCGTCTAGTGAGGATGTTATAGAGGATGCTGTTGATA
ACTCTCCAAAAGAGAGTTATTCTGGGATACTCAGGAATGAGGTCCCTAATCATGCCTATAGTTTGGGTCTTTTGCGAGACGACATCAATAGGGAAGTCATAGCATATGCA
GCCTCGACATTCTACACTTTCAACCTAGTTATCACGGAGCCAGAAATTGAAGCTTCCAAATTTGAGCTGAAACTAGTGATATTTCAGATGCTCCAGACAATAGAAACACA
TTACAAAGGTCTGAATCATGCCACACGCTTAGTAATTGATGCATCCGCAAATGGGGCTTTGCTAACAAAACTATATGCTGAAGCATTCAATATTTTAGAAAGAATATCAT
CAAACAATCACTCATGGTCTGATCCTAGAGCTGTTCAAGGAAACTCAAGTAAGGGGCTAGTTGAGTCTAAATCATACATTGCATTAAATTCGAAGATTGAGAATCTGACG
GACTTGGTAATGAGAAGTATGACGCAACAAAGTTCAATTGGAGCGTCAACTGGTAAGGCTAATGTCAATCAAATTCAAGGGATTTCATGTTCTTTCTGCGAGGGAGATCA
CCATTACAACAACTGCCCTAGAAATTCGGAGTTAGTTTATTATTCGGGGAACCCGCAACATAATAGAAACAATCCATATTCGAATACGTACAATCCTGACTGGAGGAATC
ACCCCAATTTTAGTTGGAGTGGCAATCAAGGAGGACATAACGCTGGAACATCCAATGCTCCAACTTTTCAACAGAAAGTAAGTTATCCTACTGGTTTTCGAATCAAGGAC
AAATGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCCTGATGAGAGCCATCGGTGCTTCGAAGAGAAGAGCCTGAAAGAAGCGGCCGATCACTTAGATGAGGCAAGACCGAGCCCTGATCGTCAACTTAATCCGACCAC
CCTTGACGGGGGAGAACCAGTGTCGGGCCTGAATAAAGCGACAGACCTCTTGGGAAAGGCGAGGGCGAGCCCTGGTTATCGACTTAATCCGACCACCCTTGATAAGGGAG
AACCAGTGGCGGGCCTGATTAAAGTGACAGACCTTTCGGGGGAGGCAAGGTCGAGCCCTGGTCATCGACTTAATTCGATCCCCCTTGACAGGGGAGAACCGGCTGAGGAG
GTAGAATCTGTCCCTCTGACAGCCGAAGATCGACGGGTCAACATCGGAACCAAGTTGGGGATCTTCGATCTTAATTTTCAGACGGCCAACGGTGATCAGTTGATTCAGTT
GGCTTATTTTAATTTTCAAACGGACGGCAAGCGGTGCTCTGCTTTTGGCAAACTTGGTCGGATTAAATTTCACCTAACACAGGCGATGGTTAAAGACAGTGGCCCTTTTA
GAATGATGTGTCATTCTCCAGTGTCGACTGTGGTGTCTCGCAGCGTCGTAATCGGTTTGGCGGTGGGGAACAAACCTAGGTCAACTTTGCATGGGCCGGATGATGTAGCA
GTGGTTCGAACTAGTTCTGATCCTTCTAGGTCCGAGAATTTGATTGTTCCTATTGGTTCGGGGGTTGGGGTTTCCCTTTCTTTTGGTGGTGATGTTGCGGAAAAGAGGAA
GGGTTTTCTAGTTAACTTTTCTAATATACCGCTTGAGCTGTGGACACCGCGGGGTCTTAGTACTATTGCTAGTGTGCTTAGGACCCTGCTATGGCTTGATAAGGCTACGG
AGGAGCGTAGTCGGCTTTCTTTTGCTAGGGTATGTAATGAGATGTGGGCTGCTTCTTCTTTCCCGCTTATGTTAAAGTCCGAGTGTGTGATGTTTTATTTCCTATTTCTA
TTGAGTGTGTGTTCTAAAGATTCCTCAGGTGTCGGCTCCAAAGTCTACGAGCCTTCTTTGGCTTCTCTAGTGCCGGCTCCTCTTGTTCCGACTGGGGAGCTTGTGGTTCT
AGATAAGGAGGCTATGGTTTCGATTTCGGGATTGATTCGGCCTGTTGTGGGTGGAAATTGTTTTGCTACTTTGGCGTCTAGTGAGGATGTTATAGAGGATGCTGTTGATA
ACTCTCCAAAAGAGAGTTATTCTGGGATACTCAGGAATGAGGTCCCTAATCATGCCTATAGTTTGGGTCTTTTGCGAGACGACATCAATAGGGAAGTCATAGCATATGCA
GCCTCGACATTCTACACTTTCAACCTAGTTATCACGGAGCCAGAAATTGAAGCTTCCAAATTTGAGCTGAAACTAGTGATATTTCAGATGCTCCAGACAATAGAAACACA
TTACAAAGGTCTGAATCATGCCACACGCTTAGTAATTGATGCATCCGCAAATGGGGCTTTGCTAACAAAACTATATGCTGAAGCATTCAATATTTTAGAAAGAATATCAT
CAAACAATCACTCATGGTCTGATCCTAGAGCTGTTCAAGGAAACTCAAGTAAGGGGCTAGTTGAGTCTAAATCATACATTGCATTAAATTCGAAGATTGAGAATCTGACG
GACTTGGTAATGAGAAGTATGACGCAACAAAGTTCAATTGGAGCGTCAACTGGTAAGGCTAATGTCAATCAAATTCAAGGGATTTCATGTTCTTTCTGCGAGGGAGATCA
CCATTACAACAACTGCCCTAGAAATTCGGAGTTAGTTTATTATTCGGGGAACCCGCAACATAATAGAAACAATCCATATTCGAATACGTACAATCCTGACTGGAGGAATC
ACCCCAATTTTAGTTGGAGTGGCAATCAAGGAGGACATAACGCTGGAACATCCAATGCTCCAACTTTTCAACAGAAAGTAAGTTATCCTACTGGTTTTCGAATCAAGGAC
AAATGGTAG
Protein sequenceShow/hide protein sequence
MNPDESHRCFEEKSLKEAADHLDEARPSPDRQLNPTTLDGGEPVSGLNKATDLLGKARASPGYRLNPTTLDKGEPVAGLIKVTDLSGEARSSPGHRLNSIPLDRGEPAEE
VESVPLTAEDRRVNIGTKLGIFDLNFQTANGDQLIQLAYFNFQTDGKRCSAFGKLGRIKFHLTQAMVKDSGPFRMMCHSPVSTVVSRSVVIGLAVGNKPRSTLHGPDDVA
VVRTSSDPSRSENLIVPIGSGVGVSLSFGGDVAEKRKGFLVNFSNIPLELWTPRGLSTIASVLRTLLWLDKATEERSRLSFARVCNEMWAASSFPLMLKSECVMFYFLFL
LSVCSKDSSGVGSKVYEPSLASLVPAPLVPTGELVVLDKEAMVSISGLIRPVVGGNCFATLASSEDVIEDAVDNSPKESYSGILRNEVPNHAYSLGLLRDDINREVIAYA
ASTFYTFNLVITEPEIEASKFELKLVIFQMLQTIETHYKGLNHATRLVIDASANGALLTKLYAEAFNILERISSNNHSWSDPRAVQGNSSKGLVESKSYIALNSKIENLT
DLVMRSMTQQSSIGASTGKANVNQIQGISCSFCEGDHHYNNCPRNSELVYYSGNPQHNRNNPYSNTYNPDWRNHPNFSWSGNQGGHNAGTSNAPTFQQKVSYPTGFRIKD
KW