; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g16770 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g16770
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr5:12571024..12577799
RNA-Seq ExpressionMoc05g16770
SyntenyMoc05g16770
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]6.2e-7667.66Show/hide
Query:  WKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYA
        W      EDHANV + WARFKDLLYDYYY ETVKDMKEAEFLHL QGTL+VAQYERKFTELSRFALELI    MKIKRFV GL KGIRGPVDLQRP +YA
Subjt:  WKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYA

Query:  EAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQRLG
        EAVRGALI+DKDVSNKA  L EVGSSSG+KRK  PTYAD   RAPQ                                CGRE HFARECP+SA NTQRLG
Subjt:  EAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQRLG

Query:  QRAPLTVLTKGGDQRARVSALTRKEATDAEAIVTG
        QR   TV T+G +QRARV ALTRKEA DAE +VTG
Subjt:  QRAPLTVLTKGGDQRARVSALTRKEATDAEAIVTG

XP_022155872.1 uncharacterized protein LOC111022885 [Momordica charantia]1.9e-7271.29Show/hide
Query:  WKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYA
        W      EDHANVPV+WARFKDLLYDYYY ETVKDMKEAEFLHL QGTLTVAQYERKFTELSRFA ELI TE MKIKRFV GLRK IRGPVDLQRP TYA
Subjt:  WKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYA

Query:  EAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQRLG
        EAVRGALI+DKDVSN+ QPL+EVGSSSG+KRKV P YADQPFRAPQ                                C REGHFAREC ++AVNTQRLG
Subjt:  EAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQRLG

Query:  QRAPLTVLT
        QRAP TV T
Subjt:  QRAPLTVLT

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]2.1e-8456.21Show/hide
Query:  MPPRSSMRLRADVDSALRGKNVVDPPPPP---VGVQAGLELAVYKPSHPDTFILPRAKPNSSSISSITDFVPLTEEMR-----ERQQQKNGS--------
        MPPR SMRLRAD D A  G   V  PPP            +  +K   P TF        S   +++ +++   E +      E Q +  G+        
Subjt:  MPPRSSMRLRADVDSALRGKNVVDPPPPP---VGVQAGLELAVYKPSHPDTFILPRAKPNSSSISSITDFVPLTEEMR-----ERQQQKNGS--------

Query:  -ESWKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPT
           W      ED+ANVP+ WARFK+LLYDYYY ETVKDMKEAEFLHL QGTL+VAQYERKFTELSRFALELI TE +KIKRFV GLRKGIRGPVDLQRPT
Subjt:  -ESWKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPT

Query:  TYAEAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQ
        TYAEAVRGAL++DKDVSNKA PL EVGSSSG+KRK P TYAD   RAPQ                                CGREGHFARECP+SA NTQ
Subjt:  TYAEAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQ

Query:  RLGQRAPLTVLTKGGDQRARVSALTRKEATDAEAIVTG
        RLGQR P  V T+G +QRARV ALTRKEA DAE +VTG
Subjt:  RLGQRAPLTVLTKGGDQRARVSALTRKEATDAEAIVTG

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]1.1e-6956.23Show/hide
Query:  MPPRSSMRLRADVDSALRGKNVVDPPPPPVGVQAGL----------ELA------------------VYKPSHPDTFILPRAKPNSSSISSITDFVPLTE
        MPPR SMRLRADVD A  G+NV DPPPPP+G QAG+          ELA                  ++ P     FI    +    +    ++   L E
Subjt:  MPPRSSMRLRADVDSALRGKNVVDPPPPPVGVQAGL----------ELA------------------VYKPSHPDTFILPRAKPNSSSISSITDFVPLTE

Query:  E-MRERQQ--------------------QKNGSESWKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFA
        E +RE +                     +      W     TEDHANVPV WARFK+LLYD+YY ETV+DMKE EFLHL QGTLTVAQYERKFTELS FA
Subjt:  E-MRERQQ--------------------QKNGSESWKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFA

Query:  LELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYAEAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ
        LELI TE MKIKRFV GL KGIRG VDLQRP TYAEAVRG LI+DKDVSN+ QPL+EVGSS G+KRKVPPTYADQPFRAPQ
Subjt:  LELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYAEAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ

XP_022159077.1 uncharacterized protein LOC111025517 [Momordica charantia]1.7e-7067.45Show/hide
Query:  WKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYA
        W      EDHANVP++WARFKDLLYDYYY +T+KDMKEAEFLH + GTLTVAQYERKFTELS FA ELI TE MKIKRFV GLRKGIRGPVDLQRP TYA
Subjt:  WKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYA

Query:  EAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQRLG
        EAVRG LI+D DVSN  QPL+EVGSSSG+KRKV P YADQPFRAPQ                                CGREGHFAREC ++A NTQRLG
Subjt:  EAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQRLG

Query:  QRAPLTVLTKGG
        QRA  TV T+GG
Subjt:  QRAPLTVLTKGG

TrEMBL top hitse value%identityAlignment
A0A6J1DL73 uncharacterized protein LOC1110221443.0e-7667.66Show/hide
Query:  WKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYA
        W      EDHANV + WARFKDLLYDYYY ETVKDMKEAEFLHL QGTL+VAQYERKFTELSRFALELI    MKIKRFV GL KGIRGPVDLQRP +YA
Subjt:  WKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYA

Query:  EAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQRLG
        EAVRGALI+DKDVSNKA  L EVGSSSG+KRK  PTYAD   RAPQ                                CGRE HFARECP+SA NTQRLG
Subjt:  EAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQRLG

Query:  QRAPLTVLTKGGDQRARVSALTRKEATDAEAIVTG
        QR   TV T+G +QRARV ALTRKEA DAE +VTG
Subjt:  QRAPLTVLTKGGDQRARVSALTRKEATDAEAIVTG

A0A6J1DQJ4 uncharacterized protein LOC1110228859.0e-7371.29Show/hide
Query:  WKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYA
        W      EDHANVPV+WARFKDLLYDYYY ETVKDMKEAEFLHL QGTLTVAQYERKFTELSRFA ELI TE MKIKRFV GLRK IRGPVDLQRP TYA
Subjt:  WKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYA

Query:  EAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQRLG
        EAVRGALI+DKDVSN+ QPL+EVGSSSG+KRKV P YADQPFRAPQ                                C REGHFAREC ++AVNTQRLG
Subjt:  EAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQRLG

Query:  QRAPLTVLT
        QRAP TV T
Subjt:  QRAPLTVLT

A0A6J1DUM2 uncharacterized protein LOC1110232471.0e-8456.21Show/hide
Query:  MPPRSSMRLRADVDSALRGKNVVDPPPPP---VGVQAGLELAVYKPSHPDTFILPRAKPNSSSISSITDFVPLTEEMR-----ERQQQKNGS--------
        MPPR SMRLRAD D A  G   V  PPP            +  +K   P TF        S   +++ +++   E +      E Q +  G+        
Subjt:  MPPRSSMRLRADVDSALRGKNVVDPPPPP---VGVQAGLELAVYKPSHPDTFILPRAKPNSSSISSITDFVPLTEEMR-----ERQQQKNGS--------

Query:  -ESWKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPT
           W      ED+ANVP+ WARFK+LLYDYYY ETVKDMKEAEFLHL QGTL+VAQYERKFTELSRFALELI TE +KIKRFV GLRKGIRGPVDLQRPT
Subjt:  -ESWKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPT

Query:  TYAEAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQ
        TYAEAVRGAL++DKDVSNKA PL EVGSSSG+KRK P TYAD   RAPQ                                CGREGHFARECP+SA NTQ
Subjt:  TYAEAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQ

Query:  RLGQRAPLTVLTKGGDQRARVSALTRKEATDAEAIVTG
        RLGQR P  V T+G +QRARV ALTRKEA DAE +VTG
Subjt:  RLGQRAPLTVLTKGGDQRARVSALTRKEATDAEAIVTG

A0A6J1DVA0 uncharacterized protein LOC1110234245.5e-7056.23Show/hide
Query:  MPPRSSMRLRADVDSALRGKNVVDPPPPPVGVQAGL----------ELA------------------VYKPSHPDTFILPRAKPNSSSISSITDFVPLTE
        MPPR SMRLRADVD A  G+NV DPPPPP+G QAG+          ELA                  ++ P     FI    +    +    ++   L E
Subjt:  MPPRSSMRLRADVDSALRGKNVVDPPPPPVGVQAGL----------ELA------------------VYKPSHPDTFILPRAKPNSSSISSITDFVPLTE

Query:  E-MRERQQ--------------------QKNGSESWKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFA
        E +RE +                     +      W     TEDHANVPV WARFK+LLYD+YY ETV+DMKE EFLHL QGTLTVAQYERKFTELS FA
Subjt:  E-MRERQQ--------------------QKNGSESWKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFA

Query:  LELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYAEAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ
        LELI TE MKIKRFV GL KGIRG VDLQRP TYAEAVRG LI+DKDVSN+ QPL+EVGSS G+KRKVPPTYADQPFRAPQ
Subjt:  LELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYAEAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ

A0A6J1DYU5 uncharacterized protein LOC1110255178.4e-7167.45Show/hide
Query:  WKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYA
        W      EDHANVP++WARFKDLLYDYYY +T+KDMKEAEFLH + GTLTVAQYERKFTELS FA ELI TE MKIKRFV GLRKGIRGPVDLQRP TYA
Subjt:  WKPFTPTEDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYA

Query:  EAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQRLG
        EAVRG LI+D DVSN  QPL+EVGSSSG+KRKV P YADQPFRAPQ                                CGREGHFAREC ++A NTQRLG
Subjt:  EAVRGALIIDKDVSNKAQPLLEVGSSSGLKRKVPPTYADQPFRAPQ--------------------------------CGREGHFARECPISAVNTQRLG

Query:  QRAPLTVLTKGG
        QRA  TV T+GG
Subjt:  QRAPLTVLTKGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGTTGGAGTTGCTTATCTCGTGTTTAGAGTTAAGGCTAGCCAGACAATGCCACCCCGTAGTAGTATGAGGTTGCGTGCAGACGTCGACTCAGCTCTTAGAGGCAA
GAATGTGGTAGACCCACCGCCCCCTCCTGTTGGTGTTCAGGCAGGGCTGGAGTTGGCGGTGTACAAGCCCAGCCACCCCGACACTTTCATACTCCCTAGAGCGAAGCCCA
ATTCATCAAGCATTTCAAGCATTACGGACTTCGTACCTTTGACGGAGGAAATGAGAGAGCGACAGCAGCAGAAGAATGGGTCAGAGAGTTGGAAGCCCTTTACGCCGACA
GAAGATCATGCTAATGTACCAGTTTCGTGGGCAAGGTTCAAGGACCTGTTGTACGACTACTATTACTCGGAGACTGTGAAAGATATGAAGGAGGCAGAATTCCTCCATCT
CGCCCAAGGAACCTTAACGGTAGCACAATATGAGAGGAAGTTTACGGAACTCTCCCGTTTTGCTCTGGAGTTAATTCACACTGAGGAAATGAAGATCAAAAGGTTTGTTA
ATGGCTTGCGCAAGGGGATCAGAGGACCAGTGGACCTTCAGCGACCCACCACCTACGCGGAAGCAGTTAGGGGCGCATTAATTATTGATAAAGATGTCTCTAACAAGGCC
CAACCTTTACTAGAAGTAGGTTCGTCTTCAGGTTTGAAAAGGAAAGTCCCTCCGACTTATGCCGACCAGCCATTTAGAGCACCCCAGTGCGGCAGAGAGGGGCATTTTGC
AAGGGAGTGTCCCATATCGGCCGTGAATACCCAGAGGCTAGGCCAAAGGGCTCCCTTAACAGTTTTGACGAAGGGAGGTGACCAGAGGGCTCGTGTCTCCGCACTTACCC
GTAAGGAAGCGACGGATGCCGAAGCCATTGTCACAGGCGAGTTGGACAGTTCCAAGGTGGAGTTGGTGGGAGAAGATGTTTCTGCAGTGTTAGCTCGACTCTCGGTGGAA
CCCACCTTAAGACAGCGGGTCATCGCTGCACAAAGTGGAGATCCCAGCCTAAGCAAGAGTTTCGGTATGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGA
CGGAAAACCCCTCATTGGAGGCGCTAGGCGCCTGTTAGATGAAACCTTGTGCTATAGGGAGGTACCCATTGAGATCTTAGCAAAACAGACAAAGGTGCTGCGGAATAGGG
CGATTGACTTGGTGAAGGTCTTGTGGAGGAATCACCAAGTGGAGGAAGCTACCTCGGAAAGGGAATATGAGACCAGAGCCCGATATCCAGAGTTGTTCGATCAACGAACT
TTTGAGGACGAAAGTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGTTGGAGTTGCTTATCTCGTGTTTAGAGTTAAGGCTAGCCAGACAATGCCACCCCGTAGTAGTATGAGGTTGCGTGCAGACGTCGACTCAGCTCTTAGAGGCAA
GAATGTGGTAGACCCACCGCCCCCTCCTGTTGGTGTTCAGGCAGGGCTGGAGTTGGCGGTGTACAAGCCCAGCCACCCCGACACTTTCATACTCCCTAGAGCGAAGCCCA
ATTCATCAAGCATTTCAAGCATTACGGACTTCGTACCTTTGACGGAGGAAATGAGAGAGCGACAGCAGCAGAAGAATGGGTCAGAGAGTTGGAAGCCCTTTACGCCGACA
GAAGATCATGCTAATGTACCAGTTTCGTGGGCAAGGTTCAAGGACCTGTTGTACGACTACTATTACTCGGAGACTGTGAAAGATATGAAGGAGGCAGAATTCCTCCATCT
CGCCCAAGGAACCTTAACGGTAGCACAATATGAGAGGAAGTTTACGGAACTCTCCCGTTTTGCTCTGGAGTTAATTCACACTGAGGAAATGAAGATCAAAAGGTTTGTTA
ATGGCTTGCGCAAGGGGATCAGAGGACCAGTGGACCTTCAGCGACCCACCACCTACGCGGAAGCAGTTAGGGGCGCATTAATTATTGATAAAGATGTCTCTAACAAGGCC
CAACCTTTACTAGAAGTAGGTTCGTCTTCAGGTTTGAAAAGGAAAGTCCCTCCGACTTATGCCGACCAGCCATTTAGAGCACCCCAGTGCGGCAGAGAGGGGCATTTTGC
AAGGGAGTGTCCCATATCGGCCGTGAATACCCAGAGGCTAGGCCAAAGGGCTCCCTTAACAGTTTTGACGAAGGGAGGTGACCAGAGGGCTCGTGTCTCCGCACTTACCC
GTAAGGAAGCGACGGATGCCGAAGCCATTGTCACAGGCGAGTTGGACAGTTCCAAGGTGGAGTTGGTGGGAGAAGATGTTTCTGCAGTGTTAGCTCGACTCTCGGTGGAA
CCCACCTTAAGACAGCGGGTCATCGCTGCACAAAGTGGAGATCCCAGCCTAAGCAAGAGTTTCGGTATGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGA
CGGAAAACCCCTCATTGGAGGCGCTAGGCGCCTGTTAGATGAAACCTTGTGCTATAGGGAGGTACCCATTGAGATCTTAGCAAAACAGACAAAGGTGCTGCGGAATAGGG
CGATTGACTTGGTGAAGGTCTTGTGGAGGAATCACCAAGTGGAGGAAGCTACCTCGGAAAGGGAATATGAGACCAGAGCCCGATATCCAGAGTTGTTCGATCAACGAACT
TTTGAGGACGAAAGTTTTTGA
Protein sequenceShow/hide protein sequence
MVVGVAYLVFRVKASQTMPPRSSMRLRADVDSALRGKNVVDPPPPPVGVQAGLELAVYKPSHPDTFILPRAKPNSSSISSITDFVPLTEEMRERQQQKNGSESWKPFTPT
EDHANVPVSWARFKDLLYDYYYSETVKDMKEAEFLHLAQGTLTVAQYERKFTELSRFALELIHTEEMKIKRFVNGLRKGIRGPVDLQRPTTYAEAVRGALIIDKDVSNKA
QPLLEVGSSSGLKRKVPPTYADQPFRAPQCGREGHFARECPISAVNTQRLGQRAPLTVLTKGGDQRARVSALTRKEATDAEAIVTGELDSSKVELVGEDVSAVLARLSVE
PTLRQRVIAAQSGDPSLSKSFGMDHLGVQINQKKQKDGKPLIGGARRLLDETLCYREVPIEILAKQTKVLRNRAIDLVKVLWRNHQVEEATSEREYETRARYPELFDQRT
FEDESF