; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g17250 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g17250
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationchr4:12724922..12725302
RNA-Seq ExpressionMoc04g17250
SyntenyMoc04g17250
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]5.7e-1946.56Show/hide
Query:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQ--GEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEP---SLVLQISD
        MMLN A NG+  +K+ NEIVDIL  +   N+    +  +  PKKQ  AGV  LD   S Q +   MNQMLK+  +E+  K+A     +P   S V QI++
Subjt:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQ--GEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEP---SLVLQISD

Query:  ISCVYCGDNHLYENRPANPASIFYVGQGAQR
        I C YC DNH+YEN P NPAS +YVG G  R
Subjt:  ISCVYCGDNHLYENRPANPASIFYVGQGAQR

XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]7.4e-2780Show/hide
Query:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEPSLV
        MMLNTA NGSLLEKS+NEIVDILNKM DINDQGE GRSL KKQVSAG+FELDTVA  QAQ AAMNQMLKQ TMEKETKT TS+ +   L+
Subjt:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEPSLV

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]5.3e-4984.92Show/hide
Query:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEPSLVLQISDISCVY
        MMLNTA N SL EKSI+EI+DILNKMTD NDQGEIGRSLPKKQVSA VFELDTVAS QAQ A +NQMLKQ TMEKETKTATS MLEPSL LQISDISCVY
Subjt:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEPSLVLQISDISCVY

Query:  CGDNHLYENRPANPASIFYVGQGAQR
        CGDN LYEN PANP S+FYVGQ AQR
Subjt:  CGDNHLYENRPANPASIFYVGQGAQR

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]4.6e-3775.63Show/hide
Query:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEPSLVLQISDISCVY
        MMLNT  NGSLLEKS+NEIVD+LNKMTDINDQGE+GRSLPKKQVS G+FELDTVAS QAQ AAMNQMLKQ TMEKETKT TS + E S +LQISDISCVY
Subjt:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEPSLVLQISDISCVY

Query:  CGDNHLYENRPANPASIFY
        CG       R  NP S  Y
Subjt:  CGDNHLYENRPANPASIFY

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]2.8e-4276.38Show/hide
Query:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSV-MLEPSLVLQISDISCV
        MM +TA N SLLEKS+NEI+DILNKM DINDQ E+GRSLPKKQ SAG+FELDTV S QAQ +AM+QMLKQ TM+K  K ATSV +LEPS +LQISDISCV
Subjt:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSV-MLEPSLVLQISDISCV

Query:  YCGDNHLYENRPANPASIFYVGQGAQR
        YC DNHLYEN  ANPA IFYVGQG QR
Subjt:  YCGDNHLYENRPANPASIFYVGQGAQR

TrEMBL top hitse value%identityAlignment
A0A6J1DAE9 uncharacterized protein LOC1110185142.8e-1946.56Show/hide
Query:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQ--GEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEP---SLVLQISD
        MMLN A NG+  +K+ NEIVDIL  +   N+    +  +  PKKQ  AGV  LD   S Q +   MNQMLK+  +E+  K+A     +P   S V QI++
Subjt:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQ--GEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEP---SLVLQISD

Query:  ISCVYCGDNHLYENRPANPASIFYVGQGAQR
        I C YC DNH+YEN P NPAS +YVG G  R
Subjt:  ISCVYCGDNHLYENRPANPASIFYVGQGAQR

A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220073.6e-2780Show/hide
Query:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEPSLV
        MMLNTA NGSLLEKS+NEIVDILNKM DINDQGE GRSL KKQVSAG+FELDTVA  QAQ AAMNQMLKQ TMEKETKT TS+ +   L+
Subjt:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEPSLV

A0A6J1DYY9 uncharacterized protein LOC1110255571.4e-4276.38Show/hide
Query:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSV-MLEPSLVLQISDISCV
        MM +TA N SLLEKS+NEI+DILNKM DINDQ E+GRSLPKKQ SAG+FELDTV S QAQ +AM+QMLKQ TM+K  K ATSV +LEPS +LQISDISCV
Subjt:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSV-MLEPSLVLQISDISCV

Query:  YCGDNHLYENRPANPASIFYVGQGAQR
        YC DNHLYEN  ANPA IFYVGQG QR
Subjt:  YCGDNHLYENRPANPASIFYVGQGAQR

A0A6J1DZ19 uncharacterized protein LOC1110248242.6e-4984.92Show/hide
Query:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEPSLVLQISDISCVY
        MMLNTA N SL EKSI+EI+DILNKMTD NDQGEIGRSLPKKQVSA VFELDTVAS QAQ A +NQMLKQ TMEKETKTATS MLEPSL LQISDISCVY
Subjt:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEPSLVLQISDISCVY

Query:  CGDNHLYENRPANPASIFYVGQGAQR
        CGDN LYEN PANP S+FYVGQ AQR
Subjt:  CGDNHLYENRPANPASIFYVGQGAQR

A0A6J1E251 uncharacterized protein LOC1110253022.2e-3775.63Show/hide
Query:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEPSLVLQISDISCVY
        MMLNT  NGSLLEKS+NEIVD+LNKMTDINDQGE+GRSLPKKQVS G+FELDTVAS QAQ AAMNQMLKQ TMEKETKT TS + E S +LQISDISCVY
Subjt:  MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEPSLVLQISDISCVY

Query:  CGDNHLYENRPANPASIFY
        CG       R  NP S  Y
Subjt:  CGDNHLYENRPANPASIFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTTGAACACTGCAACCAATGGCTCATTGTTAGAAAAGTCGATAAATGAGATCGTTGATATCTTAAACAAGATGACAGACATTAATGACCAAGGCGAAATAGGAAG
GTCATTGCCAAAGAAGCAAGTATCAGCCGGAGTCTTTGAGTTGGACACAGTAGCTTCAACGCAAGCCCAAACGGCGGCTATGAACCAGATGTTAAAGCAGCCGACAATGG
AGAAGGAAACCAAAACCGCCACTTCGGTGATGCTTGAACCGTCTCTTGTTTTACAAATTTCAGATATATCTTGTGTCTATTGTGGTGATAACCACTTGTATGAAAACCGT
CCAGCTAATCCAGCGTCTATTTTCTATGTAGGTCAAGGTGCCCAGCGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGTTGAACACTGCAACCAATGGCTCATTGTTAGAAAAGTCGATAAATGAGATCGTTGATATCTTAAACAAGATGACAGACATTAATGACCAAGGCGAAATAGGAAG
GTCATTGCCAAAGAAGCAAGTATCAGCCGGAGTCTTTGAGTTGGACACAGTAGCTTCAACGCAAGCCCAAACGGCGGCTATGAACCAGATGTTAAAGCAGCCGACAATGG
AGAAGGAAACCAAAACCGCCACTTCGGTGATGCTTGAACCGTCTCTTGTTTTACAAATTTCAGATATATCTTGTGTCTATTGTGGTGATAACCACTTGTATGAAAACCGT
CCAGCTAATCCAGCGTCTATTTTCTATGTAGGTCAAGGTGCCCAGCGGTAA
Protein sequenceShow/hide protein sequence
MMLNTATNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGVFELDTVASTQAQTAAMNQMLKQPTMEKETKTATSVMLEPSLVLQISDISCVYCGDNHLYENR
PANPASIFYVGQGAQR