; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g00100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g00100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:67033..70319
RNA-Seq ExpressionMoc08g00100
SyntenyMoc08g00100
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]1.9e-2361.05Show/hide
Query:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAKIEEE
        M+ C TA+EVW +L  L++SRNLARVMQLKSKLEN KKGNL LK+YF K+K +VDSL A GKKV+ EDH+MH+L GL ++++ TV VI A+ + +
Subjt:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAKIEEE

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]1.9e-2361.05Show/hide
Query:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAKIEEE
        M+ C TA+EVW +L  L++SRNLARVMQLKSKLEN KKGNL LK+YF K+K +VDSL A GKKV+ EDH+MH+L GL ++++ TV VI A+ + +
Subjt:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAKIEEE

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]5.8e-2565.93Show/hide
Query:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAK
        ML+C++A+E+W+VL  +F+SR LARVMQLK KLEN KKGNLSLK+YFLKIKN+VDSL   GKK+S EDH+MH+LAGLG ++D  + VI A+
Subjt:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAK

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]2.1e-2734.69Show/hide
Query:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAKIEEEEEIDQ
        ML+C   +E+W++L   F+SRNLARVMQLKSKLEN KKG+++LK YFLKIKN+VDSL   GK++  +DH+MH+LA LG ++D  V VI  +   +   + 
Subjt:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAKIEEEEEIDQ

Query:  IVAVRIITLEFNVSYVDGLDIQCYDAQQQ--------APLNALMVAPDLNRDTNWYPGSG-------------------------------LKVHHIGSS
                    V    G       AQ            + A+MVA D NRD  WYP SG                               L + HIGS+
Subjt:  IVAVRIITLEFNVSYVDGLDIQCYDAQQQ--------APLNALMVAPDLNRDTNWYPGSG-------------------------------LKVHHIGSS

Query:  VLQPSDTASSSHK-FLLTNLLH--------------------------------DIKTGQVLLEGKVSDGL
        +LQ    ++SS   F L NLLH                                D+ TGQ+L +G V D L
Subjt:  VLQPSDTASSSHK-FLLTNLLH--------------------------------DIKTGQVLLEGKVSDGL

XP_022158089.1 uncharacterized protein LOC111024658 [Momordica charantia]2.5e-2054.95Show/hide
Query:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAK
        +L C TA+E+W  L  +F+++NL +VMQLK++L+N +KG LSLKEY  +IKN+VDSL A GK ++ EDH+MH+L+GLG++Y+ TV VI  K
Subjt:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAK

TrEMBL top hitse value%identityAlignment
A0A6J1C6N9 dr1-associated corepressor homolog isoform X19.1e-2461.05Show/hide
Query:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAKIEEE
        M+ C TA+EVW +L  L++SRNLARVMQLKSKLEN KKGNL LK+YF K+K +VDSL A GKKV+ EDH+MH+L GL ++++ TV VI A+ + +
Subjt:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAKIEEE

A0A6J1C8R2 dr1-associated corepressor homolog isoform X29.1e-2461.05Show/hide
Query:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAKIEEE
        M+ C TA+EVW +L  L++SRNLARVMQLKSKLEN KKGNL LK+YF K+K +VDSL A GKKV+ EDH+MH+L GL ++++ TV VI A+ + +
Subjt:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAKIEEE

A0A6J1DLT9 uncharacterized protein LOC1110217572.8e-2565.93Show/hide
Query:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAK
        ML+C++A+E+W+VL  +F+SR LARVMQLK KLEN KKGNLSLK+YFLKIKN+VDSL   GKK+S EDH+MH+LAGLG ++D  + VI A+
Subjt:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAK

A0A6J1DSS1 uncharacterized protein LOC1110235861.0e-2734.69Show/hide
Query:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAKIEEEEEIDQ
        ML+C   +E+W++L   F+SRNLARVMQLKSKLEN KKG+++LK YFLKIKN+VDSL   GK++  +DH+MH+LA LG ++D  V VI  +   +   + 
Subjt:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAKIEEEEEIDQ

Query:  IVAVRIITLEFNVSYVDGLDIQCYDAQQQ--------APLNALMVAPDLNRDTNWYPGSG-------------------------------LKVHHIGSS
                    V    G       AQ            + A+MVA D NRD  WYP SG                               L + HIGS+
Subjt:  IVAVRIITLEFNVSYVDGLDIQCYDAQQQ--------APLNALMVAPDLNRDTNWYPGSG-------------------------------LKVHHIGSS

Query:  VLQPSDTASSSHK-FLLTNLLH--------------------------------DIKTGQVLLEGKVSDGL
        +LQ    ++SS   F L NLLH                                D+ TGQ+L +G V D L
Subjt:  VLQPSDTASSSHK-FLLTNLLH--------------------------------DIKTGQVLLEGKVSDGL

A0A6J1DYD5 uncharacterized protein LOC1110246581.2e-2054.95Show/hide
Query:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAK
        +L C TA+E+W  L  +F+++NL +VMQLK++L+N +KG LSLKEY  +IKN+VDSL A GK ++ EDH+MH+L+GLG++Y+ TV VI  K
Subjt:  MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.2e-0528.92Show/hide
Query:  TAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVI
        T++++W  +   F +   AR ++L S+L     G++ + +Y+ K+K + DSL      V+  + VM++L GL   +D  + VI
Subjt:  TAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGAGTGTGAAACTGCACAAGAAGTCTGGTCAGTCCTTAATACTCTTTTCTCTTCGAGAAATCTAGCCAGAGTTATGCAATTAAAATCTAAGTTAGAGAATACAAA
GAAAGGAAATCTTAGCCTCAAAGAGTATTTCCTGAAAATAAAGAATATTGTGGATTCTTTAAATGCGACTGGAAAGAAAGTGTCCAAAGAAGATCATGTTATGCATTTGT
TAGCAGGATTAGGAACAGATTATGATCCTACTGTATATGTAATTGTTGCTAAAATCGAGGAAGAGGAGGAAATCGATCAAATCGTAGCCGTCAGAATAATAACTCTAGAG
TTCAATGTCAGCTATGTGGACGGTTTGGACATACAGTGCTACGATGCTCAACAACAAGCACCATTGAATGCTCTGATGGTTGCTCCGGATCTTAATAGAGATACCAACTG
GTATCCCGGCTCAGGTCTGAAAGTTCATCATATTGGATCATCTGTTCTTCAACCTTCCGATACTGCATCTTCTTCTCATAAATTTTTACTTACAAATCTTCTTCATGACA
TCAAAACTGGACAAGTACTTCTCGAGGGCAAGGTTTCTGATGGACTCCCTGCTCTTCTTCCATTACAGATAACTCTTTTCATCCTATTTCTACATCTCAGGTTTGAAATG
TCCGCGAGGATTGGGTATTGTTTATGCAAAGATGTGCACAACAGTGTGTTCCTGATTGTAGCTCGAACTCGGCCTCTGTACCGACCTGAACCTTTGGGCGGACCTGCACA
AAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCAGACCTGGGCCAGGTTCACCTCGGCCCTCATACTTAAGCTTTCGTCGGCGTTGGTGGCTGCCATCTCGGCTC
TCCGAGCTGGGCATGACTCGTGGGCCACCTTGGAGTACCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCTAGAGTGTGAAACTGCACAAGAAGTCTGGTCAGTCCTTAATACTCTTTTCTCTTCGAGAAATCTAGCCAGAGTTATGCAATTAAAATCTAAGTTAGAGAATACAAA
GAAAGGAAATCTTAGCCTCAAAGAGTATTTCCTGAAAATAAAGAATATTGTGGATTCTTTAAATGCGACTGGAAAGAAAGTGTCCAAAGAAGATCATGTTATGCATTTGT
TAGCAGGATTAGGAACAGATTATGATCCTACTGTATATGTAATTGTTGCTAAAATCGAGGAAGAGGAGGAAATCGATCAAATCGTAGCCGTCAGAATAATAACTCTAGAG
TTCAATGTCAGCTATGTGGACGGTTTGGACATACAGTGCTACGATGCTCAACAACAAGCACCATTGAATGCTCTGATGGTTGCTCCGGATCTTAATAGAGATACCAACTG
GTATCCCGGCTCAGGTCTGAAAGTTCATCATATTGGATCATCTGTTCTTCAACCTTCCGATACTGCATCTTCTTCTCATAAATTTTTACTTACAAATCTTCTTCATGACA
TCAAAACTGGACAAGTACTTCTCGAGGGCAAGGTTTCTGATGGACTCCCTGCTCTTCTTCCATTACAGATAACTCTTTTCATCCTATTTCTACATCTCAGGTTTGAAATG
TCCGCGAGGATTGGGTATTGTTTATGCAAAGATGTGCACAACAGTGTGTTCCTGATTGTAGCTCGAACTCGGCCTCTGTACCGACCTGAACCTTTGGGCGGACCTGCACA
AAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCAGACCTGGGCCAGGTTCACCTCGGCCCTCATACTTAAGCTTTCGTCGGCGTTGGTGGCTGCCATCTCGGCTC
TCCGAGCTGGGCATGACTCGTGGGCCACCTTGGAGTACCAATAG
Protein sequenceShow/hide protein sequence
MLECETAQEVWSVLNTLFSSRNLARVMQLKSKLENTKKGNLSLKEYFLKIKNIVDSLNATGKKVSKEDHVMHLLAGLGTDYDPTVYVIVAKIEEEEEIDQIVAVRIITLE
FNVSYVDGLDIQCYDAQQQAPLNALMVAPDLNRDTNWYPGSGLKVHHIGSSVLQPSDTASSSHKFLLTNLLHDIKTGQVLLEGKVSDGLPALLPLQITLFILFLHLRFEM
SARIGYCLCKDVHNSVFLIVARTRPLYRPEPLGGPAQKGEHSDDQVSIGQTWARFTSALILKLSSALVAAISALRAGHDSWATLEYQ