; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g14850 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g14850
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr4:11329951..11347774
RNA-Seq ExpressionMoc04g14850
SyntenyMoc04g14850
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]2.5e-5177.61Show/hide
Query:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV
        MLRGEALNWWD VAT +DH NEPITWT  KDLLYDYYFPKT+KDE EIEFL+LTQ T+M+ QYEKKFTE SRFALDLIPTE RKIK FVRGL     GP+
Subjt:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV

Query:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEVWL
        DL R TTY E I+GALVMDK+VIEKAQPQQ+V L
Subjt:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEVWL

XP_022155872.1 uncharacterized protein LOC111022885 [Momordica charantia]8.6e-4467.42Show/hide
Query:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV
        MLR EALNWWD+VA A+DHAN P+TW RFKDLLYDYY+P+TVKD  E EFL+L QGT+ + QYE+KFTELSRFA +LIPTE  KIK FV+GLR    GPV
Subjt:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV

Query:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEV
        DL R  TY E +RGAL+MDK+V  + QP  EV
Subjt:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEV

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]3.2e-4649.57Show/hide
Query:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV
        MLRGEALNWWD+VA A+D+AN PI W RFK+LLYDYY+P+TVKD  E EFL+L QGT+ + QYE+KFTELSRFAL+LIPTE  KIK FV+GLR    GPV
Subjt:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV

Query:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEVW---------------LILRSK-----------------------------------KEGHLYRE--Y
        DL R TTY E +RGALVMDK+V  KA P  EV                L+LR+                                    +EGH  RE   
Subjt:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEVW---------------LILRSK-----------------------------------KEGHLYRE--Y

Query:  SMPDTQKLAPNAP----LQGTVQEARVFALTQEE
        S  +TQ+L    P     QG  Q ARVFALT++E
Subjt:  SMPDTQKLAPNAP----LQGTVQEARVFALTQEE

XP_022156985.1 uncharacterized protein LOC111023814 [Momordica charantia]2.0e-4063.97Show/hide
Query:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV
        MLRG ALNWWD+V   +DHAN  ITW RFKDLLYDYYFPKT+KD  E+EFL LTQG++ +V+YEKKFTELSRFA ++I TE  KIK FV+GL     GPV
Subjt:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV

Query:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEVWLIL
        DL R  TY E ++G L+MDK+V  KAQ  QEV  +L
Subjt:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEVWLIL

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]1.8e-4165.15Show/hide
Query:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV
        MLRGEALNWWD+VA A+DHAN PITW RFKDLLYDYY+PKT+KD  E EFL+ + GT+ + QYE+KFTELS FA +LIPTE  KIK FV+GLR    GPV
Subjt:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV

Query:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEV
        DL R  TY E +RG L+MD +V    QP  EV
Subjt:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEV

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196031.2e-5177.61Show/hide
Query:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV
        MLRGEALNWWD VAT +DH NEPITWT  KDLLYDYYFPKT+KDE EIEFL+LTQ T+M+ QYEKKFTE SRFALDLIPTE RKIK FVRGL     GP+
Subjt:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV

Query:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEVWL
        DL R TTY E I+GALVMDK+VIEKAQPQQ+V L
Subjt:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEVWL

A0A6J1DQJ4 uncharacterized protein LOC1110228854.2e-4467.42Show/hide
Query:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV
        MLR EALNWWD+VA A+DHAN P+TW RFKDLLYDYY+P+TVKD  E EFL+L QGT+ + QYE+KFTELSRFA +LIPTE  KIK FV+GLR    GPV
Subjt:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV

Query:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEV
        DL R  TY E +RGAL+MDK+V  + QP  EV
Subjt:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEV

A0A6J1DRW8 uncharacterized protein LOC1110238149.6e-4163.97Show/hide
Query:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV
        MLRG ALNWWD+V   +DHAN  ITW RFKDLLYDYYFPKT+KD  E+EFL LTQG++ +V+YEKKFTELSRFA ++I TE  KIK FV+GL     GPV
Subjt:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV

Query:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEVWLIL
        DL R  TY E ++G L+MDK+V  KAQ  QEV  +L
Subjt:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEVWLIL

A0A6J1DUM2 uncharacterized protein LOC1110232471.5e-4649.57Show/hide
Query:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV
        MLRGEALNWWD+VA A+D+AN PI W RFK+LLYDYY+P+TVKD  E EFL+L QGT+ + QYE+KFTELSRFAL+LIPTE  KIK FV+GLR    GPV
Subjt:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV

Query:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEVW---------------LILRSK-----------------------------------KEGHLYRE--Y
        DL R TTY E +RGALVMDK+V  KA P  EV                L+LR+                                    +EGH  RE   
Subjt:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEVW---------------LILRSK-----------------------------------KEGHLYRE--Y

Query:  SMPDTQKLAPNAP----LQGTVQEARVFALTQEE
        S  +TQ+L    P     QG  Q ARVFALT++E
Subjt:  SMPDTQKLAPNAP----LQGTVQEARVFALTQEE

A0A6J1DYU5 uncharacterized protein LOC1110255178.7e-4265.15Show/hide
Query:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV
        MLRGEALNWWD+VA A+DHAN PITW RFKDLLYDYY+PKT+KD  E EFL+ + GT+ + QYE+KFTELS FA +LIPTE  KIK FV+GLR    GPV
Subjt:  MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPV

Query:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEV
        DL R  TY E +RG L+MD +V    QP  EV
Subjt:  DLHRSTTYEEPIRGALVMDKNVIEKAQPQQEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGAGGCGAAGCTCTAAATTGGTGGGATGCAGTAGCAACTGCAAAGGACCATGCGAATGAACCAATCACTTGGACAAGATTCAAAGATCTACTTTATGAC
TATTACTTTCCGAAGACGGTAAAGGATGAAAATGAGATAGAGTTTCTGTACCTCACTCAAGGAACTGTGATGATGGTTCAGTATGAAAAGAAGTTTACGGAACTC
TCTCGTTTTGCTCTTGATTTAATCCCCACCGAGCCGAGGAAAATTAAAATATTTGTTAGAGGTCTACGGAATGAGACTATAGGACCAGTTGATCTTCATCGGTCA
ACCACTTATGAGGAACCAATTAGGGGTGCCTTGGTTATGGATAAGAATGTCATCGAAAAAGCTCAACCACAACAAGAAGTTTGGCTCATCCTCAGGAGTAAAAAG
GAAGGTCATCTTTATAGGGAGTATTCAATGCCTGACACTCAAAAGTTAGCTCCGAACGCACCATTGCAAGGAACTGTTCAGGAGGCACGTGTCTTTGCACTAACT
CAGGAAGAAAAAGATGTTAATACTACAGGAGTAAGTCCTGGAGGAATAGAAAAGAAACGAGTGAGCTGTGATATGAGACTTGGGTTTGGCATGAGATTATATGTT
AGGAAAGGAGAAATCATCACGAAGGTCAATGATGAGCATGTCATGTTCAACATCCTGGATGTAATGCTTCTGCCGAATGAAGTTGAGGAGTGCTCTACAATAGGG
GCAACAATGGAAAAACTTCAAGAGTTGATAGCTGCAGACTTAGAAGTTAGTCTTCTTTGGATGCTACTAGTTCTTGAGATGATTGTCTTGTTGTCATGTGTGAAA
AGTGCCATGATGCTCATCTTGTCAATGAATTGCTTTCCCTTGTTCTTTGCTTTCCGCAGTGAAGACACAGTCAACCTAAGGCTCTCATCATTTTCACTTGTGTCG
AAGTTCTCGGCTGATGCAAATCCAACCACGGATCCATTGAATGTACCCGATGGACCAATTACAAGAAGCAAAGCAAAGAAGATTCAAGAGATTTTCATAATACAT
CTTCAAAGGCTAGCTAATGCACACGAGGAGACAAAGATTTCTGAGGCCAAAATTCTTTACAATGTTAACTTAATGAGTCAAGAAAAGAATGGAGCAAAGATGGCA
CGGGAAAAATTGTCTATTTTGAGAGATAACACGGAGGACAAAAAAACAAGGAGACAGATTTCTGAGGCCAAAATTCTTTACAATATTAAACATATTGGAATTCAA
GAACTTTTGTTCCTTAACACATTTGGTGTTGCTTCTTTCAAATCTTGCTTTAGGTTTGATGTTAAACCGATTCATTGGATTAGTCTCGATCAAGCTAATTCTGGA
CTTTGTAAACCTGAGCAACCTTGGCGCTTATCATTTGGTATCAGAGCATTGGTCGACATCCTTTGGCCGGCTCTAGAGGATGGAGTACCACAAGTACCGCAAGAT
CCTAACACGGTGATTCTTCAAGCAATTCAAAGAATGATGGAAATGATGATGGAAGATAGACAAGAAAGGAGAGCGCAACAACAAAAGGAAGAACGAGCCTTACAA
GAAGATGATGCGTTTGACCTTGCTGAACAAGAAAGACAAGTTGGAGGAAGAAGAAATGGGAGAGGAAGAGGACGAAATAACTTTGCCAACGTTATGCAACCTAGA
AAATTGGAAAGATTAAGCATAATGGAATTCAAGAACTTTTGTTCCTTAACACATTTGGTGTTGCTTCTTTCAAATCTTGCTTTAGGCAACTCATTCTCAATCCTG
AGTGAGTTATGGACTCTTGCCCGTGAGGCTTCGTCCTTTGATCTGTATAGAACACAGAACAAAGAAGAGGAACTCCCAGATCTCTCTCAACAAAAAGCTCTCTCT
CTCAAAATTCTCCCTTACGTTCCAAATCGACGCTCCCACAAGCACGATCCCGAGACCCAAGAGGATAGCGAGGAAGACATAGTGGTGGTGTTCGAGGGAAACTTG
TTGAAGAAACGTTCTTCAAAGGCGCTTTTCTGGCTTATGTTGGAAACTGAGTTCTACTCCTATCACAAACCTCTTTTGGTTGTTTTAGAACCTAGGGTAACCATT
GGTTATATGTCTATGCATGATTACAGCCAAGAAATTACTACCAATTCATCTACTTTTATGGATGCCACGGAACGGGATGTGATATCTCTACTAAAGTTGGAGCTT
CTAGCACCATGTGAGGGCCCTGCACATGCTCTTTTGTATAGCCAACTAAGCCAATGCAGCCTTTGTTCTTGGTTTGTTGGAAGTAGTGCCAACCACCTTTTTAGG
TCAGATGTAGAGAACCTCGAAATTGATGGACACAAACCTTTGGGTGATGAAGGAGTAGTGGCAGAAGATTCATGGGGTGCATCCATAAGACCGGATGGCTCCCTT
CCCAAAGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAGAGGCGAAGCTCTAAATTGGTGGGATGCAGTAGCAACTGCAAAGGACCATGCGAATGAACCAATCACTTGGACAAGATTCAAAGATCTACTTTATGAC
TATTACTTTCCGAAGACGGTAAAGGATGAAAATGAGATAGAGTTTCTGTACCTCACTCAAGGAACTGTGATGATGGTTCAGTATGAAAAGAAGTTTACGGAACTC
TCTCGTTTTGCTCTTGATTTAATCCCCACCGAGCCGAGGAAAATTAAAATATTTGTTAGAGGTCTACGGAATGAGACTATAGGACCAGTTGATCTTCATCGGTCA
ACCACTTATGAGGAACCAATTAGGGGTGCCTTGGTTATGGATAAGAATGTCATCGAAAAAGCTCAACCACAACAAGAAGTTTGGCTCATCCTCAGGAGTAAAAAG
GAAGGTCATCTTTATAGGGAGTATTCAATGCCTGACACTCAAAAGTTAGCTCCGAACGCACCATTGCAAGGAACTGTTCAGGAGGCACGTGTCTTTGCACTAACT
CAGGAAGAAAAAGATGTTAATACTACAGGAGTAAGTCCTGGAGGAATAGAAAAGAAACGAGTGAGCTGTGATATGAGACTTGGGTTTGGCATGAGATTATATGTT
AGGAAAGGAGAAATCATCACGAAGGTCAATGATGAGCATGTCATGTTCAACATCCTGGATGTAATGCTTCTGCCGAATGAAGTTGAGGAGTGCTCTACAATAGGG
GCAACAATGGAAAAACTTCAAGAGTTGATAGCTGCAGACTTAGAAGTTAGTCTTCTTTGGATGCTACTAGTTCTTGAGATGATTGTCTTGTTGTCATGTGTGAAA
AGTGCCATGATGCTCATCTTGTCAATGAATTGCTTTCCCTTGTTCTTTGCTTTCCGCAGTGAAGACACAGTCAACCTAAGGCTCTCATCATTTTCACTTGTGTCG
AAGTTCTCGGCTGATGCAAATCCAACCACGGATCCATTGAATGTACCCGATGGACCAATTACAAGAAGCAAAGCAAAGAAGATTCAAGAGATTTTCATAATACAT
CTTCAAAGGCTAGCTAATGCACACGAGGAGACAAAGATTTCTGAGGCCAAAATTCTTTACAATGTTAACTTAATGAGTCAAGAAAAGAATGGAGCAAAGATGGCA
CGGGAAAAATTGTCTATTTTGAGAGATAACACGGAGGACAAAAAAACAAGGAGACAGATTTCTGAGGCCAAAATTCTTTACAATATTAAACATATTGGAATTCAA
GAACTTTTGTTCCTTAACACATTTGGTGTTGCTTCTTTCAAATCTTGCTTTAGGTTTGATGTTAAACCGATTCATTGGATTAGTCTCGATCAAGCTAATTCTGGA
CTTTGTAAACCTGAGCAACCTTGGCGCTTATCATTTGGTATCAGAGCATTGGTCGACATCCTTTGGCCGGCTCTAGAGGATGGAGTACCACAAGTACCGCAAGAT
CCTAACACGGTGATTCTTCAAGCAATTCAAAGAATGATGGAAATGATGATGGAAGATAGACAAGAAAGGAGAGCGCAACAACAAAAGGAAGAACGAGCCTTACAA
GAAGATGATGCGTTTGACCTTGCTGAACAAGAAAGACAAGTTGGAGGAAGAAGAAATGGGAGAGGAAGAGGACGAAATAACTTTGCCAACGTTATGCAACCTAGA
AAATTGGAAAGATTAAGCATAATGGAATTCAAGAACTTTTGTTCCTTAACACATTTGGTGTTGCTTCTTTCAAATCTTGCTTTAGGCAACTCATTCTCAATCCTG
AGTGAGTTATGGACTCTTGCCCGTGAGGCTTCGTCCTTTGATCTGTATAGAACACAGAACAAAGAAGAGGAACTCCCAGATCTCTCTCAACAAAAAGCTCTCTCT
CTCAAAATTCTCCCTTACGTTCCAAATCGACGCTCCCACAAGCACGATCCCGAGACCCAAGAGGATAGCGAGGAAGACATAGTGGTGGTGTTCGAGGGAAACTTG
TTGAAGAAACGTTCTTCAAAGGCGCTTTTCTGGCTTATGTTGGAAACTGAGTTCTACTCCTATCACAAACCTCTTTTGGTTGTTTTAGAACCTAGGGTAACCATT
GGTTATATGTCTATGCATGATTACAGCCAAGAAATTACTACCAATTCATCTACTTTTATGGATGCCACGGAACGGGATGTGATATCTCTACTAAAGTTGGAGCTT
CTAGCACCATGTGAGGGCCCTGCACATGCTCTTTTGTATAGCCAACTAAGCCAATGCAGCCTTTGTTCTTGGTTTGTTGGAAGTAGTGCCAACCACCTTTTTAGG
TCAGATGTAGAGAACCTCGAAATTGATGGACACAAACCTTTGGGTGATGAAGGAGTAGTGGCAGAAGATTCATGGGGTGCATCCATAAGACCGGATGGCTCCCTT
CCCAAAGGTTAA
Protein sequenceShow/hide protein sequence
MLRGEALNWWDAVATAKDHANEPITWTRFKDLLYDYYFPKTVKDENEIEFLYLTQGTVMMVQYEKKFTELSRFALDLIPTEPRKIKIFVRGLRNETIGPVDLHRS
TTYEEPIRGALVMDKNVIEKAQPQQEVWLILRSKKEGHLYREYSMPDTQKLAPNAPLQGTVQEARVFALTQEEKDVNTTGVSPGGIEKKRVSCDMRLGFGMRLYV
RKGEIITKVNDEHVMFNILDVMLLPNEVEECSTIGATMEKLQELIAADLEVSLLWMLLVLEMIVLLSCVKSAMMLILSMNCFPLFFAFRSEDTVNLRLSSFSLVS
KFSADANPTTDPLNVPDGPITRSKAKKIQEIFIIHLQRLANAHEETKISEAKILYNVNLMSQEKNGAKMAREKLSILRDNTEDKKTRRQISEAKILYNIKHIGIQ
ELLFLNTFGVASFKSCFRFDVKPIHWISLDQANSGLCKPEQPWRLSFGIRALVDILWPALEDGVPQVPQDPNTVILQAIQRMMEMMMEDRQERRAQQQKEERALQ
EDDAFDLAEQERQVGGRRNGRGRGRNNFANVMQPRKLERLSIMEFKNFCSLTHLVLLLSNLALGNSFSILSELWTLAREASSFDLYRTQNKEEELPDLSQQKALS
LKILPYVPNRRSHKHDPETQEDSEEDIVVVFEGNLLKKRSSKALFWLMLETEFYSYHKPLLVVLEPRVTIGYMSMHDYSQEITTNSSTFMDATERDVISLLKLEL
LAPCEGPAHALLYSQLSQCSLCSWFVGSSANHLFRSDVENLEIDGHKPLGDEGVVAEDSWGASIRPDGSLPKG