; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g20130 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g20130
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr10:14865270..14871262
RNA-Seq ExpressionMoc10g20130
SyntenyMoc10g20130
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149799.1 uncharacterized protein LOC111018145 [Momordica charantia]2.9e-4450.94Show/hide
Query:  PTLEGENVADPLVPLA----GAQVALLAKVLQALINNIVGGGDAQAQPPRY-------------FK----------------------------------
        P       A PLVP A      Q+ LL + LQA+INN  G G  QAQPP++             FK                                  
Subjt:  PTLEGENVADPLVPLA----GAQVALLAKVLQALINNIVGGGDAQAQPPRY-------------FK----------------------------------

Query:  -LFRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVR
          F+VKGAV MLRGEALN WDS+  AEDHANVPI W +FKD LYDYY+PE VKD+ EAEFLH  QG ++VA YERKF +LS FAL+LIPTEA+KIKRFV+
Subjt:  -LFRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVR

Query:  GLCKGIRGPVDL
        GL KGIRGPVDL
Subjt:  GLCKGIRGPVDL

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]4.2e-4378.18Show/hide
Query:  FRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVRGL
        F+VKGAVFMLRGEALNWWDSV AAED+ANVPI W +FK+ LYDYY+PE VKD+ EAEFLH  QG ++VA YERKF +LSRFAL+LIPTEA+KIKRFV+GL
Subjt:  FRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVRGL

Query:  CKGIRGPVDL
         KGIRGPVDL
Subjt:  CKGIRGPVDL

XP_022156330.1 uncharacterized protein LOC111023250 [Momordica charantia]4.5e-4579.09Show/hide
Query:  FRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVRGL
        F+VKGAVFMLRG+ALNWWDSV AAEDHAN+P+TW +FKD LYDYY+PE VKD+ EAEFLHF+QG +TVA YERKF +LSRFA +LIPTEA+KIKRFV+GL
Subjt:  FRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVRGL

Query:  CKGIRGPVDL
         KGIRGPVDL
Subjt:  CKGIRGPVDL

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]7.4e-4850.44Show/hide
Query:  MPHRRSIRLHANVNPTLEGENVADPLVPLAGAQVALL------AKVLQALINNIVGGGDAQAQPPRY-------------FK------------------
        MP RRS+RL A+V+P   GENVADP  P  G Q  ++      A    ALINN  G G AQ QPPR+             FK                  
Subjt:  MPHRRSIRLHANVNPTLEGENVADPLVPLAGAQVALL------AKVLQALINNIVGGGDAQAQPPRY-------------FK------------------

Query:  -----------------LFRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFA
                          F+VKGAVFMLR EALNWWDSV A EDHANVP+ W +FK+ LYD+Y+ E V+D+ E EFLH  QG +TVA YERKF +LS FA
Subjt:  -----------------LFRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFA

Query:  LDLIPTEAVKIKRFVRGLCKGIRGPVDL
        L+LIPTEA+KIKRFV+GL KGIRG VDL
Subjt:  LDLIPTEAVKIKRFVRGLCKGIRGPVDL

XP_022158637.1 uncharacterized protein LOC111025088 [Momordica charantia]1.8e-4955.1Show/hide
Query:  VPLAGAQVALLAKVLQALINNIVGGGDAQAQPPRYFKL------------------------------------------------FRVKGAVFMLRGEA
        VP    QVALLA+ LQALINN  G G AQA PPR+F                                                  F+VKG VFMLRGEA
Subjt:  VPLAGAQVALLAKVLQALINNIVGGGDAQAQPPRYFKL------------------------------------------------FRVKGAVFMLRGEA

Query:  LNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVRGLCKGIRGPVDL
        LNWWDS+  AEDHANVP+ W +FKD LYDYY+PE VKD  EAEFLH  QG +TVA YERKF +LSRFAL+ IPTEA+KIKRFV+GL KGIRGPVDL
Subjt:  LNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVRGLCKGIRGPVDL

TrEMBL top hitse value%identityAlignment
A0A6J1D841 uncharacterized protein LOC1110181451.4e-4450.94Show/hide
Query:  PTLEGENVADPLVPLA----GAQVALLAKVLQALINNIVGGGDAQAQPPRY-------------FK----------------------------------
        P       A PLVP A      Q+ LL + LQA+INN  G G  QAQPP++             FK                                  
Subjt:  PTLEGENVADPLVPLA----GAQVALLAKVLQALINNIVGGGDAQAQPPRY-------------FK----------------------------------

Query:  -LFRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVR
          F+VKGAV MLRGEALN WDS+  AEDHANVPI W +FKD LYDYY+PE VKD+ EAEFLH  QG ++VA YERKF +LS FAL+LIPTEA+KIKRFV+
Subjt:  -LFRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVR

Query:  GLCKGIRGPVDL
        GL KGIRGPVDL
Subjt:  GLCKGIRGPVDL

A0A6J1DQ01 uncharacterized protein LOC1110232502.2e-4579.09Show/hide
Query:  FRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVRGL
        F+VKGAVFMLRG+ALNWWDSV AAEDHAN+P+TW +FKD LYDYY+PE VKD+ EAEFLHF+QG +TVA YERKF +LSRFA +LIPTEA+KIKRFV+GL
Subjt:  FRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVRGL

Query:  CKGIRGPVDL
         KGIRGPVDL
Subjt:  CKGIRGPVDL

A0A6J1DUM2 uncharacterized protein LOC1110232472.0e-4378.18Show/hide
Query:  FRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVRGL
        F+VKGAVFMLRGEALNWWDSV AAED+ANVPI W +FK+ LYDYY+PE VKD+ EAEFLH  QG ++VA YERKF +LSRFAL+LIPTEA+KIKRFV+GL
Subjt:  FRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVRGL

Query:  CKGIRGPVDL
         KGIRGPVDL
Subjt:  CKGIRGPVDL

A0A6J1DVA0 uncharacterized protein LOC1110234243.6e-4850.44Show/hide
Query:  MPHRRSIRLHANVNPTLEGENVADPLVPLAGAQVALL------AKVLQALINNIVGGGDAQAQPPRY-------------FK------------------
        MP RRS+RL A+V+P   GENVADP  P  G Q  ++      A    ALINN  G G AQ QPPR+             FK                  
Subjt:  MPHRRSIRLHANVNPTLEGENVADPLVPLAGAQVALL------AKVLQALINNIVGGGDAQAQPPRY-------------FK------------------

Query:  -----------------LFRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFA
                          F+VKGAVFMLR EALNWWDSV A EDHANVP+ W +FK+ LYD+Y+ E V+D+ E EFLH  QG +TVA YERKF +LS FA
Subjt:  -----------------LFRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFA

Query:  LDLIPTEAVKIKRFVRGLCKGIRGPVDL
        L+LIPTEA+KIKRFV+GL KGIRG VDL
Subjt:  LDLIPTEAVKIKRFVRGLCKGIRGPVDL

A0A6J1DXQ7 uncharacterized protein LOC1110250888.6e-5055.1Show/hide
Query:  VPLAGAQVALLAKVLQALINNIVGGGDAQAQPPRYFKL------------------------------------------------FRVKGAVFMLRGEA
        VP    QVALLA+ LQALINN  G G AQA PPR+F                                                  F+VKG VFMLRGEA
Subjt:  VPLAGAQVALLAKVLQALINNIVGGGDAQAQPPRYFKL------------------------------------------------FRVKGAVFMLRGEA

Query:  LNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVRGLCKGIRGPVDL
        LNWWDS+  AEDHANVP+ W +FKD LYDYY+PE VKD  EAEFLH  QG +TVA YERKF +LSRFAL+ IPTEA+KIKRFV+GL KGIRGPVDL
Subjt:  LNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALDLIPTEAVKIKRFVRGLCKGIRGPVDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACGAATGCTGAGAAGAGACGATAAATTTGTGCTGGAGAAGCCTAGAGACTATCTGCAGGCCCTCATCGACCAACTCGTCGAGCTCCACTTCACTGCTTGCCACGC
TAGGGCACTGCTAGGAAGGTCAGCTTCTCGCGAGAGCGAAGCCGAGAAGTGCGCCAACAAGGCTGAGGACATGCTGGAGTCTAAACATGTGGCTCAGGTGAAGAGGTGGT
TGTCCTTCGACGAGCTGGAGAAGGCGATGAGGGTCCTACAAGAGTTTAGTGATATGGTTAGGAACATCGCAACCGAGGCTTCGACTTTGCCACTGATGGGAATGGTTCGG
CATAAAACAGAACAGAAGAAGAAAAAGCCTCCCCTGAATTTTTCCTACTCAAGATTTCTCTCTAACCCAATTTCCAAGTTCGTTGCTGTAATGCCCCACGTTACTCAGGC
GCGATGGTCTTCGTTCGTGACAGCAGCAGGTACGGCGGACTGTGGCGTTTTAGGAGCGATTTTCGGCGGCTCGACGTCTGAACAGTCGGGCCCGACGTCCCTTGGTGGCC
CTGAAACCTTCGAAAATGCTAGGAATTGGTTGTTGAGGCTTGTGACCATGAAATGCATGTTGACTTTTGGCTTGTATATTCTTTCTGGGTATACTATGCACGTCACTACT
GGGTGTCGAGGCTCCAAGTATAAAGGGTCGGGGGTCGATAAACCAGTCTTGGTAGAGATGAGTGTCGAGGCTTCGGGTAGAAGCAGTGTGAAGGAGTTAGAGTTCAGCGG
GCCATGTGGTATGAGTGGCATGAGTCTTGATATGTCAGATACTGACCTAGTAACTAGAGTGTTTAGGAGTAGTGGTCCTGGTTGGCCTCCTCGTCATCGCCAGATGATGC
CTCATCGTCGTAGTATAAGACTGCATGCGAACGTTAATCCAACCCTCGAAGGTGAGAATGTGGCAGACCCATTGGTCCCTCTAGCAGGTGCCCAAGTGGCATTGCTCGCG
AAAGTGTTGCAGGCACTGATTAATAACATAGTTGGGGGTGGCGATGCACAAGCTCAGCCACCCCGATATTTCAAGCTCTTCAGAGTCAAAGGTGCAGTCTTCATGTTGAG
GGGCGAAGCCCTAAATTGGTGGGACTCAGTAGTAGCGGCAGAAGACCATGCTAATGTACCGATCACGTGGGTGAAGTTCAAGGACTCGTTGTATGACTACTATTTTCCAG
AGATTGTGAAAGATGTAAATGAGGCGGAGTTTCTCCATTTCACCCAAGGCAATATGACAGTAGCACATTATGAAAGAAAGTTTATGAAACTCTCCCGTTTTGCTCTGGAC
CTAATTCCCACCGAGGCAGTGAAGATCAAAAGGTTTGTTAGAGGTTTATGTAAAGGGATTAGAGGACCAGTTGATCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACACGAATGCTGAGAAGAGACGATAAATTTGTGCTGGAGAAGCCTAGAGACTATCTGCAGGCCCTCATCGACCAACTCGTCGAGCTCCACTTCACTGCTTGCCACGC
TAGGGCACTGCTAGGAAGGTCAGCTTCTCGCGAGAGCGAAGCCGAGAAGTGCGCCAACAAGGCTGAGGACATGCTGGAGTCTAAACATGTGGCTCAGGTGAAGAGGTGGT
TGTCCTTCGACGAGCTGGAGAAGGCGATGAGGGTCCTACAAGAGTTTAGTGATATGGTTAGGAACATCGCAACCGAGGCTTCGACTTTGCCACTGATGGGAATGGTTCGG
CATAAAACAGAACAGAAGAAGAAAAAGCCTCCCCTGAATTTTTCCTACTCAAGATTTCTCTCTAACCCAATTTCCAAGTTCGTTGCTGTAATGCCCCACGTTACTCAGGC
GCGATGGTCTTCGTTCGTGACAGCAGCAGGTACGGCGGACTGTGGCGTTTTAGGAGCGATTTTCGGCGGCTCGACGTCTGAACAGTCGGGCCCGACGTCCCTTGGTGGCC
CTGAAACCTTCGAAAATGCTAGGAATTGGTTGTTGAGGCTTGTGACCATGAAATGCATGTTGACTTTTGGCTTGTATATTCTTTCTGGGTATACTATGCACGTCACTACT
GGGTGTCGAGGCTCCAAGTATAAAGGGTCGGGGGTCGATAAACCAGTCTTGGTAGAGATGAGTGTCGAGGCTTCGGGTAGAAGCAGTGTGAAGGAGTTAGAGTTCAGCGG
GCCATGTGGTATGAGTGGCATGAGTCTTGATATGTCAGATACTGACCTAGTAACTAGAGTGTTTAGGAGTAGTGGTCCTGGTTGGCCTCCTCGTCATCGCCAGATGATGC
CTCATCGTCGTAGTATAAGACTGCATGCGAACGTTAATCCAACCCTCGAAGGTGAGAATGTGGCAGACCCATTGGTCCCTCTAGCAGGTGCCCAAGTGGCATTGCTCGCG
AAAGTGTTGCAGGCACTGATTAATAACATAGTTGGGGGTGGCGATGCACAAGCTCAGCCACCCCGATATTTCAAGCTCTTCAGAGTCAAAGGTGCAGTCTTCATGTTGAG
GGGCGAAGCCCTAAATTGGTGGGACTCAGTAGTAGCGGCAGAAGACCATGCTAATGTACCGATCACGTGGGTGAAGTTCAAGGACTCGTTGTATGACTACTATTTTCCAG
AGATTGTGAAAGATGTAAATGAGGCGGAGTTTCTCCATTTCACCCAAGGCAATATGACAGTAGCACATTATGAAAGAAAGTTTATGAAACTCTCCCGTTTTGCTCTGGAC
CTAATTCCCACCGAGGCAGTGAAGATCAAAAGGTTTGTTAGAGGTTTATGTAAAGGGATTAGAGGACCAGTTGATCTTTAG
Protein sequenceShow/hide protein sequence
MTRMLRRDDKFVLEKPRDYLQALIDQLVELHFTACHARALLGRSASRESEAEKCANKAEDMLESKHVAQVKRWLSFDELEKAMRVLQEFSDMVRNIATEASTLPLMGMVR
HKTEQKKKKPPLNFSYSRFLSNPISKFVAVMPHVTQARWSSFVTAAGTADCGVLGAIFGGSTSEQSGPTSLGGPETFENARNWLLRLVTMKCMLTFGLYILSGYTMHVTT
GCRGSKYKGSGVDKPVLVEMSVEASGRSSVKELEFSGPCGMSGMSLDMSDTDLVTRVFRSSGPGWPPRHRQMMPHRRSIRLHANVNPTLEGENVADPLVPLAGAQVALLA
KVLQALINNIVGGGDAQAQPPRYFKLFRVKGAVFMLRGEALNWWDSVVAAEDHANVPITWVKFKDSLYDYYFPEIVKDVNEAEFLHFTQGNMTVAHYERKFMKLSRFALD
LIPTEAVKIKRFVRGLCKGIRGPVDL