; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g31240 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g31240
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionzf-RVT domain-containing protein
Genome locationchr6:23509892..23512483
RNA-Seq ExpressionMoc06g31240
SyntenyMoc06g31240
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]7.1e-2070Show/hide
Query:  EEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGCNGLTGHLNDEKLQLIVQSSGTNDVNEGDVFDTKKE
        EEG  +AE+ N+++DDALD+E EPDVE VH EI RDE AV+ +GC+GLTG  N E LQLIVQSSGTNDV EG+VFDTKKE
Subjt:  EEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGCNGLTGHLNDEKLQLIVQSSGTNDVNEGDVFDTKKE

XP_022154947.1 uncharacterized protein LOC111022090 [Momordica charantia]8.1e-2469.51Show/hide
Query:  GNDVEGLTPLGSDVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTE
        G+D+ GLTPL SDVV CNL DDRVCDW+VPG+WNDN+DESDESYDPL E EEG  +AE+ N+++DDALD+E EPDVE V  +
Subjt:  GNDVEGLTPLGSDVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTE

XP_022155970.1 uncharacterized protein LOC111022954 [Momordica charantia]1.1e-4168.46Show/hide
Query:  GNDVEGLTPLGSDVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGCNGLTG
        G+D+ GLTPL SDVV CNL DDRVC W++PG+WNDN+DESDESYD L + EEG  +AE+ N+++DDA D++ EPDVE V  EIRRDE  V  +GC+GL G
Subjt:  GNDVEGLTPLGSDVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGCNGLTG

Query:  HLNDEKLQLIVQSSGTNDVNEGDVFDTKKE
          NDEKLQLIVQSSGTNDV EG VFDTKKE
Subjt:  HLNDEKLQLIVQSSGTNDVNEGDVFDTKKE

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]4.1e-5290.68Show/hide
Query:  DVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGCNGLTGHLNDEKLQLIVQ
        +VV CNL DDRVCDWDVPGVWNDNEDES ESYDPLAE +EGHSQAEYGNEEHDDALDDELEPDVE VHTEIRRDEEAVRP GCNGLTG  NDEKLQLIVQ
Subjt:  DVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGCNGLTGHLNDEKLQLIVQ

Query:  SSGTNDVNEGDVFDTKKE
        SSGTNDVNEGDVFD KKE
Subjt:  SSGTNDVNEGDVFDTKKE

XP_022157017.1 uncharacterized protein LOC111023843 [Momordica charantia]1.5e-5786.67Show/hide
Query:  VVVFGGNDVEGLTPLGSDVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGC
        +++  G+DVEGLTPLGSDVV CNL DDRVCDWDVPGVWNDNEDES ESYDPLA  EEGHSQAEYGNEEHDDALDDELE DVE VHTEIRRDEEAVR  GC
Subjt:  VVVFGGNDVEGLTPLGSDVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGC

Query:  NGLTGHLNDEKLQLIVQSSGTNDVNEGDVFDTKKE
        NGLTG  NDEKLQLIVQSSGTNDVNEGDVFD KKE
Subjt:  NGLTGHLNDEKLQLIVQSSGTNDVNEGDVFDTKKE

TrEMBL top hitse value%identityAlignment
A0A6J1DJT1 uncharacterized protein LOC1110207153.5e-2070Show/hide
Query:  EEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGCNGLTGHLNDEKLQLIVQSSGTNDVNEGDVFDTKKE
        EEG  +AE+ N+++DDALD+E EPDVE VH EI RDE AV+ +GC+GLTG  N E LQLIVQSSGTNDV EG+VFDTKKE
Subjt:  EEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGCNGLTGHLNDEKLQLIVQSSGTNDVNEGDVFDTKKE

A0A6J1DN26 uncharacterized protein LOC1110220903.9e-2469.51Show/hide
Query:  GNDVEGLTPLGSDVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTE
        G+D+ GLTPL SDVV CNL DDRVCDW+VPG+WNDN+DESDESYDPL E EEG  +AE+ N+++DDALD+E EPDVE V  +
Subjt:  GNDVEGLTPLGSDVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTE

A0A6J1DP00 uncharacterized protein LOC1110229545.5e-4268.46Show/hide
Query:  GNDVEGLTPLGSDVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGCNGLTG
        G+D+ GLTPL SDVV CNL DDRVC W++PG+WNDN+DESDESYD L + EEG  +AE+ N+++DDA D++ EPDVE V  EIRRDE  V  +GC+GL G
Subjt:  GNDVEGLTPLGSDVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGCNGLTG

Query:  HLNDEKLQLIVQSSGTNDVNEGDVFDTKKE
          NDEKLQLIVQSSGTNDV EG VFDTKKE
Subjt:  HLNDEKLQLIVQSSGTNDVNEGDVFDTKKE

A0A6J1DQB9 Reverse transcriptase2.0e-5290.68Show/hide
Query:  DVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGCNGLTGHLNDEKLQLIVQ
        +VV CNL DDRVCDWDVPGVWNDNEDES ESYDPLAE +EGHSQAEYGNEEHDDALDDELEPDVE VHTEIRRDEEAVRP GCNGLTG  NDEKLQLIVQ
Subjt:  DVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGCNGLTGHLNDEKLQLIVQ

Query:  SSGTNDVNEGDVFDTKKE
        SSGTNDVNEGDVFD KKE
Subjt:  SSGTNDVNEGDVFDTKKE

A0A6J1DTG5 uncharacterized protein LOC1110238437.1e-5886.67Show/hide
Query:  VVVFGGNDVEGLTPLGSDVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGC
        +++  G+DVEGLTPLGSDVV CNL DDRVCDWDVPGVWNDNEDES ESYDPLA  EEGHSQAEYGNEEHDDALDDELE DVE VHTEIRRDEEAVR  GC
Subjt:  VVVFGGNDVEGLTPLGSDVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEGHSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGC

Query:  NGLTGHLNDEKLQLIVQSSGTNDVNEGDVFDTKKE
        NGLTG  NDEKLQLIVQSSGTNDVNEGDVFD KKE
Subjt:  NGLTGHLNDEKLQLIVQSSGTNDVNEGDVFDTKKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.2e-0926.67Show/hide
Query:  DKIIWGPDKKGCFSVKSAYHVGMKSQFQAEASSSNLDNHRRVRNCLWNADALPKAKISCSRTLNDILPTNSNLFKKGIRANHLCVLCRKEEETSEHLFWE
        DKIIW  +  G ++V+S Y +          + +       ++  +WN   +PK K    R L+  L T   L  +G+R +  C  C +E E+  H  + 
Subjt:  DKIIWGPDKKGCFSVKSAYHVGMKSQFQAEASSSNLDNHRRVRNCLWNADALPKAKISCSRTLNDILPTNSNLFKKGIRANHLCVLCRKEEETSEHLFWE

Query:  SPRGAFCC-LTSFSLNKPTLSCRDYWDEILAISGSPCITK--------AIILVWKIWGFRNVVVF
         P       L+  SL +  L   D+ + I  I      T          + L+W+IW  RN VVF
Subjt:  SPRGAFCC-LTSFSLNKPTLSCRDYWDEILAISGSPCITK--------AIILVWKIWGFRNVVVF

AT3G25270.1 Ribonuclease H-like superfamily protein4.8e-0626.05Show/hide
Query:  LWNADALPKAKISCSRTLNDILPTNSNLFKKGIRANHLCVLCRKEEETSEHLFWESPRGAFCCLTSFSLNKPTLSCRDYWDEILAISGSPCITK------
        +W     PK K    + L+  L T  NL ++ IR +  C  C +E+ETS+HLF++          S   ++   +     +  + +  S C+        
Subjt:  LWNADALPKAKISCSRTLNDILPTNSNLFKKGIRANHLCVLCRKEEETSEHLFWESPRGAFCCLTSFSLNKPTLSCRDYWDEILAISGSPCITK------

Query:  --AIILVWKIWGFRNVVVF
          AI ++W++W  RN +VF
Subjt:  --AIILVWKIWGFRNVVVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGACCAAACAGAGTTGGGATGCTAATGACAAAATTATTTGGGGTCCAGATAAGAAAGGTTGTTTCTCGGTGAAAAGTGCTTATCACGTGGGGATGAAGTCACAATT
TCAGGCGGAAGCTTCCTCTTCAAATCTAGATAATCATAGGCGTGTGAGGAATTGTCTTTGGAATGCTGATGCCCTTCCAAAAGCAAAGATTTCATGCTCGAGGACCTTAA
ATGATATCCTTCCAACAAACTCTAATTTATTTAAGAAAGGGATTCGAGCTAACCACTTATGTGTCTTATGTAGGAAGGAAGAGGAGACATCCGAACACCTATTTTGGGAG
TCACCTCGAGGTGCATTTTGTTGTCTTACTTCGTTTTCGCTGAACAAGCCTACGCTATCTTGTCGGGACTACTGGGATGAGATCCTAGCTATCAGTGGCAGCCCATGCAT
CACAAAAGCTATAATTTTAGTCTGGAAAATATGGGGATTTAGAAATGTGGTGGTGTTTGGAGGTAATGATGTAGAGGGTTTAACACCATTAGGGTCCGATGTTGTTTCAT
GTAATCTGAGAGATGACAGGGTGTGTGATTGGGATGTGCCGGGAGTGTGGAATGATAACGAAGATGAAAGTGATGAATCATATGACCCGTTGGCAGAGTTTGAAGAAGGA
CACTCTCAAGCAGAATATGGGAACGAAGAGCATGACGATGCTCTTGATGATGAGCTTGAGCCTGATGTGGAACCGGTGCATACTGAGATTCGCAGGGATGAAGAAGCGGT
CCGGCCACTAGGATGTAATGGTCTCACCGGACACCTTAATGATGAGAAATTGCAACTCATAGTACAGTCTTCTGGGACAAATGATGTTAATGAGGGCGATGTATTTGATA
CTAAGAAGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTGACCAAACAGAGTTGGGATGCTAATGACAAAATTATTTGGGGTCCAGATAAGAAAGGTTGTTTCTCGGTGAAAAGTGCTTATCACGTGGGGATGAAGTCACAATT
TCAGGCGGAAGCTTCCTCTTCAAATCTAGATAATCATAGGCGTGTGAGGAATTGTCTTTGGAATGCTGATGCCCTTCCAAAAGCAAAGATTTCATGCTCGAGGACCTTAA
ATGATATCCTTCCAACAAACTCTAATTTATTTAAGAAAGGGATTCGAGCTAACCACTTATGTGTCTTATGTAGGAAGGAAGAGGAGACATCCGAACACCTATTTTGGGAG
TCACCTCGAGGTGCATTTTGTTGTCTTACTTCGTTTTCGCTGAACAAGCCTACGCTATCTTGTCGGGACTACTGGGATGAGATCCTAGCTATCAGTGGCAGCCCATGCAT
CACAAAAGCTATAATTTTAGTCTGGAAAATATGGGGATTTAGAAATGTGGTGGTGTTTGGAGGTAATGATGTAGAGGGTTTAACACCATTAGGGTCCGATGTTGTTTCAT
GTAATCTGAGAGATGACAGGGTGTGTGATTGGGATGTGCCGGGAGTGTGGAATGATAACGAAGATGAAAGTGATGAATCATATGACCCGTTGGCAGAGTTTGAAGAAGGA
CACTCTCAAGCAGAATATGGGAACGAAGAGCATGACGATGCTCTTGATGATGAGCTTGAGCCTGATGTGGAACCGGTGCATACTGAGATTCGCAGGGATGAAGAAGCGGT
CCGGCCACTAGGATGTAATGGTCTCACCGGACACCTTAATGATGAGAAATTGCAACTCATAGTACAGTCTTCTGGGACAAATGATGTTAATGAGGGCGATGTATTTGATA
CTAAGAAGGAGTAG
Protein sequenceShow/hide protein sequence
MLTKQSWDANDKIIWGPDKKGCFSVKSAYHVGMKSQFQAEASSSNLDNHRRVRNCLWNADALPKAKISCSRTLNDILPTNSNLFKKGIRANHLCVLCRKEEETSEHLFWE
SPRGAFCCLTSFSLNKPTLSCRDYWDEILAISGSPCITKAIILVWKIWGFRNVVVFGGNDVEGLTPLGSDVVSCNLRDDRVCDWDVPGVWNDNEDESDESYDPLAEFEEG
HSQAEYGNEEHDDALDDELEPDVEPVHTEIRRDEEAVRPLGCNGLTGHLNDEKLQLIVQSSGTNDVNEGDVFDTKKE