; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g29250 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g29250
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:21992632..22008980
RNA-Seq ExpressionMoc09g29250
SyntenyMoc09g29250
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_009798319.1 PREDICTED: uncharacterized protein LOC104244566 [Nicotiana sylvestris]4.3e-2755.93Show/hide
Query:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC
        +  L++ LYNV  +A +TS++LW+AL+KKYK+E A  +KF+V KFLDYKM+DNK V  Q+++LQ+I  D  +EG+V+NE +FQVA +I+KLPP+ R+FK 
Subjt:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC

Query:  YLKHKRKELSMENLTMEL
        YLKHKRKE+ +ENL + L
Subjt:  YLKHKRKELSMENLTMEL

XP_010544457.1 PREDICTED: uncharacterized protein LOC104817078 [Tarenaya hassleriana]1.5e-2760Show/hide
Query:  LDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKCYLK
        LD+ LYNV C A  TSR+LWEAL+KKYK E A  +KF+  KFL++KMVDNKLV  Q+++LQ+IV +L +EG++INE  FQVA  I+KLPP+ ++FK YLK
Subjt:  LDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKCYLK

Query:  HKRKELSMENLTMEL
        HKRKE+S+E+L + L
Subjt:  HKRKELSMENLTMEL

XP_022147763.1 uncharacterized protein LOC111016620 [Momordica charantia]5.1e-4480.17Show/hide
Query:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC
        + C+D+ LYNV CNAFDTSRQLWEALDKKYKLE A T+KFLVGKFLDYKMVD KLVVN LE+LQII+SDLQSEGLVINEP FQV VVI+KL PA REFKC
Subjt:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC

Query:  YLKHKRKELSMENLTMELHKK
        YLKHK+KELS+ENLT++L  K
Subjt:  YLKHKRKELSMENLTMELHKK

XP_022148559.1 uncharacterized protein LOC111017193 [Momordica charantia]1.4e-4177.97Show/hide
Query:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC
        + CLD+ L+NV CNAFDTSRQLWEALDKKYKLE A T+KFLV KFLDYK++D KLV+NQLE+LQII SDLQSE LVINEP FQ+  VI+KLPPA REFK 
Subjt:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC

Query:  YLKHKRKELSMENLTMEL
        YLKHKRKELSMENLT++L
Subjt:  YLKHKRKELSMENLTMEL

XP_022156727.1 uncharacterized protein LOC111023572 [Momordica charantia]1.0e-3977.97Show/hide
Query:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC
        + CLD  L NV CNAFDTSRQLW+ LDKKYKLE   T+KFLVGKFLDYKMV+ KLVVNQLE+LQII SDLQSEGLVINE LFQVA VI+ LP   REFKC
Subjt:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC

Query:  YLKHKRKELSMENLTMEL
        YLKHKRK+LSMENLT++L
Subjt:  YLKHKRKELSMENLTMEL

TrEMBL top hitse value%identityAlignment
A0A1U7YGF8 uncharacterized protein LOC1042445662.1e-2755.93Show/hide
Query:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC
        +  L++ LYNV  +A +TS++LW+AL+KKYK+E A  +KF+V KFLDYKM+DNK V  Q+++LQ+I  D  +EG+V+NE +FQVA +I+KLPP+ R+FK 
Subjt:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC

Query:  YLKHKRKELSMENLTMEL
        YLKHKRKE+ +ENL + L
Subjt:  YLKHKRKELSMENLTMEL

A0A6J1D271 uncharacterized protein LOC1110166202.5e-4480.17Show/hide
Query:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC
        + C+D+ LYNV CNAFDTSRQLWEALDKKYKLE A T+KFLVGKFLDYKMVD KLVVN LE+LQII+SDLQSEGLVINEP FQV VVI+KL PA REFKC
Subjt:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC

Query:  YLKHKRKELSMENLTMELHKK
        YLKHK+KELS+ENLT++L  K
Subjt:  YLKHKRKELSMENLTMELHKK

A0A6J1D4C8 uncharacterized protein LOC1110171936.7e-4277.97Show/hide
Query:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC
        + CLD+ L+NV CNAFDTSRQLWEALDKKYKLE A T+KFLV KFLDYK++D KLV+NQLE+LQII SDLQSE LVINEP FQ+  VI+KLPPA REFK 
Subjt:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC

Query:  YLKHKRKELSMENLTMEL
        YLKHKRKELSMENLT++L
Subjt:  YLKHKRKELSMENLTMEL

A0A6J1DSQ3 uncharacterized protein LOC1110235724.8e-4077.97Show/hide
Query:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC
        + CLD  L NV CNAFDTSRQLW+ LDKKYKLE   T+KFLVGKFLDYKMV+ KLVVNQLE+LQII SDLQSEGLVINE LFQVA VI+ LP   REFKC
Subjt:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC

Query:  YLKHKRKELSMENLTMEL
        YLKHKRK+LSMENLT++L
Subjt:  YLKHKRKELSMENLTMEL

A0A6P6UTU4 uncharacterized protein LOC1137141392.7e-2755.08Show/hide
Query:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC
        M CL + LYNV      T++ LWE+LD+KYK+E A  +KF+VGKFLDYKMVD+K V++Q++++QII+ ++ +EG++++E  FQVA VI+KLPP  ++FK 
Subjt:  MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKC

Query:  YLKHKRKELSMENLTMEL
        YLKHKRKE+SME+L + L
Subjt:  YLKHKRKELSMENLTMEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGCCTAGATGAAAAATTGTATAATGTCGATTGCAATGCCTTTGATACTTCAAGGCAATTGTGGGAGGCTTTAGACAAGAAGTATAAGCTGGAATATGCTAGTAC
TAGGAAATTCCTTGTTGGAAAGTTCTTAGATTACAAAATGGTTGATAACAAGTTGGTAGTTAATCAGTTGGAAAAATTGCAAATTATCGTTAGTGATTTACAAAGTGAAG
GATTGGTCATTAATGAACCATTATTCCAAGTTGCTGTTGTGATTGATAAATTGCCTCCTGCTTTGAGAGAATTCAAATGTTATCTCAAACACAAGCGAAAGGAGTTATCC
ATGGAGAATCTTACTATGGAACTCCACAAAAAATTGAAGCTAATGCACATATTGCTGAATCTTCAAGGTGTCATCCCAAGAAGCAACATTCCAAAACGAAGAATGTCAAT
CTTGGGCCAAGAAATGACGCTAACAAGCACATTCGTGGAATCTGATTGGCGCATGAAATTCTCCACTCAAGTTGACCATACCGCGCAAGCGTATGGAAGATTTGGGGGCC
GCTTCCCTCTAGTTAGTGGGTACGTTGAAGAGCGGGTGGTCAAGCTATTTGTCAAGATCGCGAGGAGTAGGGAAATTTATAGGCTGTGCTTAGCTGCTATACATTTGGAT
CCTCAGCCATATAAATTGTCCACCAGAGTTCAAGTGTCCTCAGGACTAAGGCCAGAAGACGTGGAGTGGAGAGCCCGATGGATGTCAACCAAACCTATGATGTATAGGTG
TGGTGTATTTAAGGAGGGCTGCAGTGGGGAGGAGAGGACGGACAGAACACCCATACAGAAGAATTCACTTCCTTCTTTCTCTCTACCATCATCTTCTCTTCTTCTTCACC
CAGGCCGGCGACCCCGACGATACACAGACCGCACACGTCCCCCGGTGATCACAACTTCAGACAACCTGCACCATAGTAGTCCGGCGCAAAACTTCTACAACCGACGCACG
GATCACCGACCACCTGACTTCGACCTCGAGTGCTCCTTCACAGTTTTCAGACCCGCGAGCACACCTAGAATTTCCTCACTGCAGTACGTGCGTGAAGTTGTTCCAGAACC
TTCCGTAGCTAACCCTAATTCACCGTTGCAGGGCCTGGGCAAAATCGTCGATCGAAAATGGGGGATCACTGGCTGGGCCGTGGGCCGAGCACGGCTCGGGACCGAGCCCC
GGGTCGGGGCCGAGCACCGGGTCGGGACTGAGCCCTTCGTGGCTCAGTGGGCTTCCCTCCGGTCGCTTTCCTCGTTCTCTGCCCCGGCGTTAGGTCCTTCTTTGGACCGG
CCCGTTATGTCACGGGCCGGTCTTCCCCAATCAGACTTCAACTTCTTGTCTCGTTTCGGGTCGTTTTTGTTCTCCATAACAATGGCCCCCACTCTCAAAACAGAAGACGA
ATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGCCTAGATGAAAAATTGTATAATGTCGATTGCAATGCCTTTGATACTTCAAGGCAATTGTGGGAGGCTTTAGACAAGAAGTATAAGCTGGAATATGCTAGTAC
TAGGAAATTCCTTGTTGGAAAGTTCTTAGATTACAAAATGGTTGATAACAAGTTGGTAGTTAATCAGTTGGAAAAATTGCAAATTATCGTTAGTGATTTACAAAGTGAAG
GATTGGTCATTAATGAACCATTATTCCAAGTTGCTGTTGTGATTGATAAATTGCCTCCTGCTTTGAGAGAATTCAAATGTTATCTCAAACACAAGCGAAAGGAGTTATCC
ATGGAGAATCTTACTATGGAACTCCACAAAAAATTGAAGCTAATGCACATATTGCTGAATCTTCAAGGTGTCATCCCAAGAAGCAACATTCCAAAACGAAGAATGTCAAT
CTTGGGCCAAGAAATGACGCTAACAAGCACATTCGTGGAATCTGATTGGCGCATGAAATTCTCCACTCAAGTTGACCATACCGCGCAAGCGTATGGAAGATTTGGGGGCC
GCTTCCCTCTAGTTAGTGGGTACGTTGAAGAGCGGGTGGTCAAGCTATTTGTCAAGATCGCGAGGAGTAGGGAAATTTATAGGCTGTGCTTAGCTGCTATACATTTGGAT
CCTCAGCCATATAAATTGTCCACCAGAGTTCAAGTGTCCTCAGGACTAAGGCCAGAAGACGTGGAGTGGAGAGCCCGATGGATGTCAACCAAACCTATGATGTATAGGTG
TGGTGTATTTAAGGAGGGCTGCAGTGGGGAGGAGAGGACGGACAGAACACCCATACAGAAGAATTCACTTCCTTCTTTCTCTCTACCATCATCTTCTCTTCTTCTTCACC
CAGGCCGGCGACCCCGACGATACACAGACCGCACACGTCCCCCGGTGATCACAACTTCAGACAACCTGCACCATAGTAGTCCGGCGCAAAACTTCTACAACCGACGCACG
GATCACCGACCACCTGACTTCGACCTCGAGTGCTCCTTCACAGTTTTCAGACCCGCGAGCACACCTAGAATTTCCTCACTGCAGTACGTGCGTGAAGTTGTTCCAGAACC
TTCCGTAGCTAACCCTAATTCACCGTTGCAGGGCCTGGGCAAAATCGTCGATCGAAAATGGGGGATCACTGGCTGGGCCGTGGGCCGAGCACGGCTCGGGACCGAGCCCC
GGGTCGGGGCCGAGCACCGGGTCGGGACTGAGCCCTTCGTGGCTCAGTGGGCTTCCCTCCGGTCGCTTTCCTCGTTCTCTGCCCCGGCGTTAGGTCCTTCTTTGGACCGG
CCCGTTATGTCACGGGCCGGTCTTCCCCAATCAGACTTCAACTTCTTGTCTCGTTTCGGGTCGTTTTTGTTCTCCATAACAATGGCCCCCACTCTCAAAACAGAAGACGA
ATAA
Protein sequenceShow/hide protein sequence
MGCLDEKLYNVDCNAFDTSRQLWEALDKKYKLEYASTRKFLVGKFLDYKMVDNKLVVNQLEKLQIIVSDLQSEGLVINEPLFQVAVVIDKLPPALREFKCYLKHKRKELS
MENLTMELHKKLKLMHILLNLQGVIPRSNIPKRRMSILGQEMTLTSTFVESDWRMKFSTQVDHTAQAYGRFGGRFPLVSGYVEERVVKLFVKIARSREIYRLCLAAIHLD
PQPYKLSTRVQVSSGLRPEDVEWRARWMSTKPMMYRCGVFKEGCSGEERTDRTPIQKNSLPSFSLPSSSLLLHPGRRPRRYTDRTRPPVITTSDNLHHSSPAQNFYNRRT
DHRPPDFDLECSFTVFRPASTPRISSLQYVREVVPEPSVANPNSPLQGLGKIVDRKWGITGWAVGRARLGTEPRVGAEHRVGTEPFVAQWASLRSLSSFSAPALGPSLDR
PVMSRAGLPQSDFNFLSRFGSFLFSITMAPTLKTEDE