; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g12460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g12460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:9571244..9575655
RNA-Seq ExpressionMoc06g12460
SyntenyMoc06g12460
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022135942.1 uncharacterized protein LOC111007775 [Momordica charantia]2.5e-1432.12Show/hide
Query:  KWPDFEEVIVYLWAIWDKRNAKALNKGGDGF------------------------------STSPNHVVWKSPISGVYKINTDASFNPLDLNAGLRIIIR
        KW   EE+ V+LWAIW+ RN +++   G GF                                      W  P++  +K+N DA+F   + +AGL I+IR
Subjt:  KWPDFEEVIVYLWAIWDKRNAKALNKGGDGF------------------------------STSPNHVVWKSPISGVYKINTDASFNPLDLNAGLRIIIR

Query:  DSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHNKGDLSE
        DS   VL +   ++ H   V + E LAA+EG+ LA++ G+ P Q+E +S ++FNL   +  D SE
Subjt:  DSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHNKGDLSE

XP_022139684.1 uncharacterized protein LOC111010533 [Momordica charantia]3.4e-2742.77Show/hide
Query:  WPDFEEVIVYLWAIWDKRNAKALNKG--------------------------------GDGFSTS------PNHVVWKSPISGVYKINTDASFNPLDLNA
        W DFEE++V+LW++W++RNA   NK                                    F  S       NH +W     GV+K+ TDASF+ +D NA
Subjt:  WPDFEEVIVYLWAIWDKRNAKALNKG--------------------------------GDGFSTS------PNHVVWKSPISGVYKINTDASFNPLDLNA

Query:  GLR-IIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHNKGDLSE
        GL  IIIRD +GQVLA+ TKYL+H  SVD  EALAA EGL++A++ GISP+ +E +SLRI+NLF  +K  LS+
Subjt:  GLR-IIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHNKGDLSE

XP_022140628.1 uncharacterized protein LOC111011237 [Momordica charantia]7.5e-1952.83Show/hide
Query:  GFSTSPNHVVWKSPISGVYKINTDASFNPLDLNAGLRIIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHN
        G  T+   V+W  P   +YKINTDASF   D +AGL IIIR+ +GQV+A+ TKYL++  SVD+ EA+ A EGL+LA  IG++P+ +E +S RIFNLF   
Subjt:  GFSTSPNHVVWKSPISGVYKINTDASFNPLDLNAGLRIIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHN

Query:  KGDLSE
          DLSE
Subjt:  KGDLSE

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]6.3e-2638.04Show/hide
Query:  CGKKGENSFHLFLDCKFSRSIWNSAKW------------------PDFEEVIVYLWAIWDKRNAKALNKGG-----------------------------
        CG+ GE+S HLF  CKF+ ++W ++K+                   DFEE+ V +W +W++RNA+A N                                
Subjt:  CGKKGENSFHLFLDCKFSRSIWNSAKW------------------PDFEEVIVYLWAIWDKRNAKALNKGG-----------------------------

Query:  DGFSTSPNHVVWKSPISGVYKINTDASFNPLDLNAGLRIIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISP
         G  T+   ++W+ P  G+YKINTDASF   D +AGL III + +GQV+AA TKYL++  SVD+ EA+AA EGL+LA +IG+ P
Subjt:  DGFSTSPNHVVWKSPISGVYKINTDASFNPLDLNAGLRIIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISP

XP_030497588.1 uncharacterized protein LOC115713245 [Cannabis sativa]5.0e-1528.57Show/hide
Query:  KLEGISAASDAPTLTCSGTTTLRHFSSNPGILLTKSCE--ARAIKECLKLYERASCQMINHDKSRLACSPNAGASLKERMKGILSVSLVDCHHQYLGLPS
        +L+G + +  APT++         F ++  +L  ++ +    +IK  L +Y RAS Q +N DKS ++ SPN    ++   + IL + + DCH +YLGLP+
Subjt:  KLEGISAASDAPTLTCSGTTTLRHFSSNPGILLTKSCE--ARAIKECLKLYERASCQMINHDKSRLACSPNAGASLKERMKGILSVSLVDCHHQYLGLPS

Query:  FMPRNRCGKKGENSFHLFLDCKFSRSIWN-SAKWPD------FEEVIVYLWAIWDKRNAK-----ALNKGGDGFSTSPN----------HVVWKSPISGV
        +  R++          LF D K    IW     W D       +E  +Y W      N       A        STSPN          H  W  P    
Subjt:  FMPRNRCGKKGENSFHLFLDCKFSRSIWN-SAKWPD------FEEVIVYLWAIWDKRNAK-----ALNKGGDGFSTSPN----------HVVWKSPISGV

Query:  YKINTDASFNPLDLNAGLRIIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHNKGDLS
         K+N DA+ +      G+ III++S GQV+AA +K L   +    +EA A   G+  A    +S   +E +SL + N    N   +S
Subjt:  YKINTDASFNPLDLNAGLRIIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHNKGDLS

TrEMBL top hitse value%identityAlignment
A0A6J1C467 uncharacterized protein LOC1110077751.2e-1432.12Show/hide
Query:  KWPDFEEVIVYLWAIWDKRNAKALNKGGDGF------------------------------STSPNHVVWKSPISGVYKINTDASFNPLDLNAGLRIIIR
        KW   EE+ V+LWAIW+ RN +++   G GF                                      W  P++  +K+N DA+F   + +AGL I+IR
Subjt:  KWPDFEEVIVYLWAIWDKRNAKALNKGGDGF------------------------------STSPNHVVWKSPISGVYKINTDASFNPLDLNAGLRIIIR

Query:  DSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHNKGDLSE
        DS   VL +   ++ H   V + E LAA+EG+ LA++ G+ P Q+E +S ++FNL   +  D SE
Subjt:  DSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHNKGDLSE

A0A6J1CDQ4 uncharacterized protein LOC1110105331.6e-2742.77Show/hide
Query:  WPDFEEVIVYLWAIWDKRNAKALNKG--------------------------------GDGFSTS------PNHVVWKSPISGVYKINTDASFNPLDLNA
        W DFEE++V+LW++W++RNA   NK                                    F  S       NH +W     GV+K+ TDASF+ +D NA
Subjt:  WPDFEEVIVYLWAIWDKRNAKALNKG--------------------------------GDGFSTS------PNHVVWKSPISGVYKINTDASFNPLDLNA

Query:  GLR-IIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHNKGDLSE
        GL  IIIRD +GQVLA+ TKYL+H  SVD  EALAA EGL++A++ GISP+ +E +SLRI+NLF  +K  LS+
Subjt:  GLR-IIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHNKGDLSE

A0A6J1CIF1 uncharacterized protein LOC1110112373.6e-1952.83Show/hide
Query:  GFSTSPNHVVWKSPISGVYKINTDASFNPLDLNAGLRIIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHN
        G  T+   V+W  P   +YKINTDASF   D +AGL IIIR+ +GQV+A+ TKYL++  SVD+ EA+ A EGL+LA  IG++P+ +E +S RIFNLF   
Subjt:  GFSTSPNHVVWKSPISGVYKINTDASFNPLDLNAGLRIIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHN

Query:  KGDLSE
          DLSE
Subjt:  KGDLSE

A0A6J1DAR4 uncharacterized protein LOC1110189543.1e-2638.04Show/hide
Query:  CGKKGENSFHLFLDCKFSRSIWNSAKW------------------PDFEEVIVYLWAIWDKRNAKALNKGG-----------------------------
        CG+ GE+S HLF  CKF+ ++W ++K+                   DFEE+ V +W +W++RNA+A N                                
Subjt:  CGKKGENSFHLFLDCKFSRSIWNSAKW------------------PDFEEVIVYLWAIWDKRNAKALNKGG-----------------------------

Query:  DGFSTSPNHVVWKSPISGVYKINTDASFNPLDLNAGLRIIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISP
         G  T+   ++W+ P  G+YKINTDASF   D +AGL III + +GQV+AA TKYL++  SVD+ EA+AA EGL+LA +IG+ P
Subjt:  DGFSTSPNHVVWKSPISGVYKINTDASFNPLDLNAGLRIIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISP

A0A803Q5N4 Uncharacterized protein1.4e-1531.11Show/hide
Query:  ARAIKECLKLYERASCQMINHDKSRLACSPNAGASLKERMKGILSVSLVDCHHQYLGLPSFMPRNRCGK---------KGENSFHLFL------DCKFSR
        ARAI  CL LY RAS QM+N +KS L+ SPN  +S +   + +L++ +  CH QYLGLPSF  R++            K  N++   L      +   SR
Subjt:  ARAIKECLKLYERASCQMINHDKSRLACSPNAGASLKERMKGILSVSLVDCHHQYLGLPSFMPRNRCGK---------KGENSFHLFL------DCKFSR

Query:  SIW------NSAKWPDFE---------------EVIVYLWAIWDKRNAKALNKGGDGFSTS-------------------PNHVVWKSPISGVYKINTDA
          W      N+  W +++                + +   +  D  NA A   G      S                   P    W +P SG  K+NTDA
Subjt:  SIW------NSAKWPDFE---------------EVIVYLWAIWDKRNAKALNKGGDGFSTS-------------------PNHVVWKSPISGVYKINTDA

Query:  SFNPLDLNAGLRIIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFN
        + N      GL  I+RDS G VLAA  K +      + +EALA    LKL L + +S   VE +SL + N
Subjt:  SFNPLDLNAGLRIIIRDSKGQVLAAETKYLDHAFSVDVVEALAAEEGLKLALDIGISPLQVEMNSLRIFN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGTGCTCCTTCAAAGGCGAAAAATCAGGATCCTTCATTAATATTAATGGCGTTTCCTAATGGTCAGAACATGGCCAAGGCAATTGGATCCAACGAGCCT
ATAAAATCGACAATAAATGGGGAATCACATCCCATTGTTAGTGGGTCGAAAGATCATATTGGGCCAACTGATGGTAATAGGCCGAAACAAGGTTGGAAAAGGAAG
GAAAGGGGAATCCAGATTAAGGAGATGGACTTGAATTTATCCAACCCGCTAATAGGGAAAAGGAAATTAGAAGGAATTTCTGCAGCAAGTGATGCTCCAACATTG
ACATGCTCTGGCACAACAACGCTCCGACATTTCTCCTCTAATCCTGGCATTCTCCTCACAAAGAGTTGTGAAGCTAGAGCTATCAAGGAGTGCCTCAAGCTCTAT
GAACGTGCTTCTTGTCAGATGATAAATCATGACAAATCAAGGCTAGCTTGCAGCCCAAATGCTGGTGCTTCTTTGAAAGAAAGGATGAAGGGAATCCTCTCAGTT
TCTTTGGTAGATTGCCATCATCAATATCTTGGCCTTCCTTCCTTCATGCCGAGGAATAGATGTGGGAAGAAGGGCGAGAATAGTTTCCATCTATTCCTAGACTGT
AAATTTTCGAGGAGTATATGGAATTCAGCTAAGTGGCCTGATTTTGAAGAAGTGATCGTTTATCTGTGGGCTATATGGGACAAGCGGAATGCTAAAGCTTTGAAT
AAGGGTGGTGATGGGTTTAGCACAAGCCCTAACCATGTGGTTTGGAAATCGCCTATATCAGGAGTCTACAAAATAAATACGGATGCATCTTTTAATCCACTTGAT
TTAAATGCAGGGCTAAGGATTATTATCAGAGATTCGAAAGGGCAAGTTCTGGCTGCTGAAACTAAATACCTGGATCATGCATTCTCGGTGGATGTTGTCGAAGCC
TTAGCAGCGGAGGAGGGTCTGAAGCTAGCACTAGATATTGGAATTTCGCCACTTCAAGTCGAAATGAATTCTTTGCGAATTTTCAATCTATTTCTGCACAATAAA
GGTGATTTATCAGAGGGATGGAAGAAGGTCATTTTGGGGTTGTCTCCCGGAGAGAGATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGTGCTCCTTCAAAGGCGAAAAATCAGGATCCTTCATTAATATTAATGGCGTTTCCTAATGGTCAGAACATGGCCAAGGCAATTGGATCCAACGAGCCT
ATAAAATCGACAATAAATGGGGAATCACATCCCATTGTTAGTGGGTCGAAAGATCATATTGGGCCAACTGATGGTAATAGGCCGAAACAAGGTTGGAAAAGGAAG
GAAAGGGGAATCCAGATTAAGGAGATGGACTTGAATTTATCCAACCCGCTAATAGGGAAAAGGAAATTAGAAGGAATTTCTGCAGCAAGTGATGCTCCAACATTG
ACATGCTCTGGCACAACAACGCTCCGACATTTCTCCTCTAATCCTGGCATTCTCCTCACAAAGAGTTGTGAAGCTAGAGCTATCAAGGAGTGCCTCAAGCTCTAT
GAACGTGCTTCTTGTCAGATGATAAATCATGACAAATCAAGGCTAGCTTGCAGCCCAAATGCTGGTGCTTCTTTGAAAGAAAGGATGAAGGGAATCCTCTCAGTT
TCTTTGGTAGATTGCCATCATCAATATCTTGGCCTTCCTTCCTTCATGCCGAGGAATAGATGTGGGAAGAAGGGCGAGAATAGTTTCCATCTATTCCTAGACTGT
AAATTTTCGAGGAGTATATGGAATTCAGCTAAGTGGCCTGATTTTGAAGAAGTGATCGTTTATCTGTGGGCTATATGGGACAAGCGGAATGCTAAAGCTTTGAAT
AAGGGTGGTGATGGGTTTAGCACAAGCCCTAACCATGTGGTTTGGAAATCGCCTATATCAGGAGTCTACAAAATAAATACGGATGCATCTTTTAATCCACTTGAT
TTAAATGCAGGGCTAAGGATTATTATCAGAGATTCGAAAGGGCAAGTTCTGGCTGCTGAAACTAAATACCTGGATCATGCATTCTCGGTGGATGTTGTCGAAGCC
TTAGCAGCGGAGGAGGGTCTGAAGCTAGCACTAGATATTGGAATTTCGCCACTTCAAGTCGAAATGAATTCTTTGCGAATTTTCAATCTATTTCTGCACAATAAA
GGTGATTTATCAGAGGGATGGAAGAAGGTCATTTTGGGGTTGTCTCCCGGAGAGAGATTGTAA
Protein sequenceShow/hide protein sequence
MGSAPSKAKNQDPSLILMAFPNGQNMAKAIGSNEPIKSTINGESHPIVSGSKDHIGPTDGNRPKQGWKRKERGIQIKEMDLNLSNPLIGKRKLEGISAASDAPTL
TCSGTTTLRHFSSNPGILLTKSCEARAIKECLKLYERASCQMINHDKSRLACSPNAGASLKERMKGILSVSLVDCHHQYLGLPSFMPRNRCGKKGENSFHLFLDC
KFSRSIWNSAKWPDFEEVIVYLWAIWDKRNAKALNKGGDGFSTSPNHVVWKSPISGVYKINTDASFNPLDLNAGLRIIIRDSKGQVLAAETKYLDHAFSVDVVEA
LAAEEGLKLALDIGISPLQVEMNSLRIFNLFLHNKGDLSEGWKKVILGLSPGERL