; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g17380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g17380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr4:12812280..12813125
RNA-Seq ExpressionMoc04g17380
SyntenyMoc04g17380
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145060.1 uncharacterized protein LOC111014578 [Momordica charantia]8.0e-2133Show/hide
Query:  MTNLKKKGIEEIPLYPFCKRSEETVDHFLAGCKHSSQLWMRLFPNLEMGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNT
        M NL K+GI+  P    C +  ET DH L  CK + ++W  L P        N   +      ++ LS     + G+  WA+WNDRNA+   + + +   
Subjt:  MTNLKKKGIEEIPLYPFCKRSEETVDHFLAGCKHSSQLWMRLFPNLEMGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNT

Query:  KADWILEYVNEFS-------GACKAGAKAEGERIPRKNEWVPPPTGLTKINRDRACANNRY--GIGVLSRNDKLDIVAALVDVQAGSNSLSAKIRAIRDG
        ++DWIL YV +F        G  +    A       +N W PPP G  KIN D AC  +++  GIG++ RN+K  I+AA     A  + L A+  A+ DG
Subjt:  KADWILEYVNEFS-------GACKAGAKAEGERIPRKNEWVPPPTGLTKINRDRACANNRY--GIGVLSRNDKLDIVAALVDVQAGSNSLSAKIRAIRDG

XP_022155289.1 uncharacterized protein LOC111022426 [Momordica charantia]1.4e-3042.54Show/hide
Query:  QLWMRLFPNLEMGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTKADWILEYVNEFSGACKAGAKAEGERIPRKNEWVPP
        Q+W  + P  E+G   N C+   W+ WMK LSA+ + +A ITCWALWND NA+IN K + E   K                        RIPR N W PP
Subjt:  QLWMRLFPNLEMGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTKADWILEYVNEFSGACKAGAKAEGERIPRKNEWVPP

Query:  PTG-LTKINRDRACANNRYGIGVLSRNDKLDIVAALVDVQAGSNSLSAKIRAIRDGLKLATRLGLKRVLVGFNSLEAIHLI
        P G + K+N D A +    G+GVL R+   +IVAA++D       L  +I AIR+GL+LATRLG+ RV+V  +SLEA++LI
Subjt:  PTG-LTKINRDRACANNRYGIGVLSRNDKLDIVAALVDVQAGSNSLSAKIRAIRDGLKLATRLGLKRVLVGFNSLEAIHLI

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.7e-2931.1Show/hide
Query:  NLKKKGIEEIPLYPFCKRSEETVDHFLAGCKHSSQLWMRLFPNLE-MGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTK
        NL  +GI E+P    C    E++ H    CK + Q+W  LFP L  +    N  F   W+   + L    + +A IT W +WNDRN++I+ K V  V  K
Subjt:  NLKKKGIEEIPLYPFCKRSEETVDHFLAGCKHSSQLWMRLFPNLE-MGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTK

Query:  ADWILEYVNEFSGA--CKAGAKAEGERIPRKNEWVPPPTGLTKINRDRACANNRYGIGVLSRNDKLDIVAAL-VDVQAGSNSLSAKIRAIRDGLKLATRL
         +W+  +++  S A       + +    P    W P  +   K+N D AC       G + R+    +VAA  + V    + L A+IR I +GLK A   
Subjt:  ADWILEYVNEFSGA--CKAGAKAEGERIPRKNEWVPPPTGLTKINRDRACANNRYGIGVLSRNDKLDIVAAL-VDVQAGSNSLSAKIRAIRDGLKLATRL

Query:  GLKRVLVGFNSLEAIHLICEDEEWWGEVGNWLAEIRTLSSAFSSISFYHIFRELTNRAHYFAR-EASRLGQPCLWLSNFPCFV
            + V  +SL AI LI  +    G+  NW+ EI+ L+  F+ ISF H  R+    AH  A+   +       WL NFP ++
Subjt:  GLKRVLVGFNSLEAIHLICEDEEWWGEVGNWLAEIRTLSSAFSSISFYHIFRELTNRAHYFAR-EASRLGQPCLWLSNFPCFV

XP_022158489.1 uncharacterized protein LOC111024968 [Momordica charantia]4.8e-4244.24Show/hide
Query:  MKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTKADWILEYVNEFSGACKAGAKAEGERIPRKNEWVPPPTGLTKINRDRACANNRYGIGVLSRND
        M+ LS +++ + GITCWALWNDR+A+IN K + E   K +WIL+Y  E       G K    RIPR NEW PP  G+ K+N D A +    G+GVL R  
Subjt:  MKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTKADWILEYVNEFSGACKAGAKAEGERIPRKNEWVPPPTGLTKINRDRACANNRYGIGVLSRND

Query:  KLDIVAALVDVQAGSNSLSAKIRAIRDGLKLATRLGLKRVLVGFNSLEAIHLICEDEEWWGEVGNWLAEIRTLSSAFSSISFYHIFRELTNRAHYFAREA
          +IV A+VD       L AKI AIR+GL LATRLG+ RV+V  +SLEA++LI +   W GE  +W+ +IR  +  F  I F H+FRE    A++  RE 
Subjt:  KLDIVAALVDVQAGSNSLSAKIRAIRDGLKLATRLGLKRVLVGFNSLEAIHLICEDEEWWGEVGNWLAEIRTLSSAFSSISFYHIFRELTNRAHYFAREA

Query:  SRLGQPCLWLSNFPCFV
          L    LW  +FP ++
Subjt:  SRLGQPCLWLSNFPCFV

XP_042980185.1 uncharacterized protein LOC122310356 [Carya illinoinensis]3.0e-2026.88Show/hide
Query:  NLKKKGIEEIPLYPFCKRSEETVDHFLAGCKHSSQLWMRLFPNLEMGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTKA
        NL K+ +   PL P C   EETV H L  CK +  +W      L+   S    F+      ++ L    ++   +T   LWN RN+++   L       +
Subjt:  NLKKKGIEEIPLYPFCKRSEETVDHFLAGCKHSSQLWMRLFPNLEMGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTKA

Query:  DWILEYVNEFSGACKAGAKAEGERIPRKNEWVPPPTGLTKINRDRAC--ANNRYGIGVLSRNDKLDIVAAL--VDVQAGSNSLSAKIRAIRDGLKLATRL
          I   ++      + G   +  ++     W PPP G+ K N D A    N+R GIGV+ R+ K  ++A L           L   + A+R    L   L
Subjt:  DWILEYVNEFSGACKAGAKAEGERIPRKNEWVPPPTGLTKINRDRAC--ANNRYGIGVLSRNDKLDIVAAL--VDVQAGSNSLSAKIRAIRDGLKLATRL

Query:  GLKRVLVGFNSLEAIHLICEDEEWWGEVGNWLAEIRTLSSAFSSISFYHIFRELTNRAHYFAREASRLGQPCLWLSNFP
        GL  +++  +S+  +  + + EE W  VG  + +IR + S F S S  H+ R +   A+  A+ A    + C+ L ++P
Subjt:  GLKRVLVGFNSLEAIHLICEDEEWWGEVGNWLAEIRTLSSAFSSISFYHIFRELTNRAHYFAREASRLGQPCLWLSNFP

TrEMBL top hitse value%identityAlignment
A0A6J1CK80 uncharacterized protein LOC1110120472.0e-1733.16Show/hide
Query:  MGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTKADWILEYVNEFSGACKAGAKAEGERIPRKNEWVPPPTGLTKINRDR
        +G   N  F   W  W + L+  ++ +AGI CWA WNDRN   N   V +V+T++DWI  Y  E     +   +A+ + +P    W PP  G  K+N D 
Subjt:  MGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTKADWILEYVNEFSGACKAGAKAEGERIPRKNEWVPPPTGLTKINRDR

Query:  A--CANNRYGIGVLSRNDKLDIVAALVDVQAGSNSLSAKIRAIRDGLKLATRLGLKRVLVGFNSLEAIHLICEDEEWWGEVGNWLAE
        A    N R G+GVL R+D+  I+AAL+     ++ L A+I AIR+ ++LA R     + VG    +  H +  +      +  WL++
Subjt:  A--CANNRYGIGVLSRNDKLDIVAALVDVQAGSNSLSAKIRAIRDGLKLATRLGLKRVLVGFNSLEAIHLICEDEEWWGEVGNWLAE

A0A6J1CTE3 uncharacterized protein LOC1110145783.9e-2133Show/hide
Query:  MTNLKKKGIEEIPLYPFCKRSEETVDHFLAGCKHSSQLWMRLFPNLEMGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNT
        M NL K+GI+  P    C +  ET DH L  CK + ++W  L P        N   +      ++ LS     + G+  WA+WNDRNA+   + + +   
Subjt:  MTNLKKKGIEEIPLYPFCKRSEETVDHFLAGCKHSSQLWMRLFPNLEMGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNT

Query:  KADWILEYVNEFS-------GACKAGAKAEGERIPRKNEWVPPPTGLTKINRDRACANNRY--GIGVLSRNDKLDIVAALVDVQAGSNSLSAKIRAIRDG
        ++DWIL YV +F        G  +    A       +N W PPP G  KIN D AC  +++  GIG++ RN+K  I+AA     A  + L A+  A+ DG
Subjt:  KADWILEYVNEFS-------GACKAGAKAEGERIPRKNEWVPPPTGLTKINRDRACANNRY--GIGVLSRNDKLDIVAALVDVQAGSNSLSAKIRAIRDG

A0A6J1DPU1 uncharacterized protein LOC1110224267.0e-3142.54Show/hide
Query:  QLWMRLFPNLEMGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTKADWILEYVNEFSGACKAGAKAEGERIPRKNEWVPP
        Q+W  + P  E+G   N C+   W+ WMK LSA+ + +A ITCWALWND NA+IN K + E   K                        RIPR N W PP
Subjt:  QLWMRLFPNLEMGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTKADWILEYVNEFSGACKAGAKAEGERIPRKNEWVPP

Query:  PTG-LTKINRDRACANNRYGIGVLSRNDKLDIVAALVDVQAGSNSLSAKIRAIRDGLKLATRLGLKRVLVGFNSLEAIHLI
        P G + K+N D A +    G+GVL R+   +IVAA++D       L  +I AIR+GL+LATRLG+ RV+V  +SLEA++LI
Subjt:  PTG-LTKINRDRACANNRYGIGVLSRNDKLDIVAALVDVQAGSNSLSAKIRAIRDGLKLATRLGLKRVLVGFNSLEAIHLI

A0A6J1DX30 uncharacterized protein LOC1110248741.3e-2931.1Show/hide
Query:  NLKKKGIEEIPLYPFCKRSEETVDHFLAGCKHSSQLWMRLFPNLE-MGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTK
        NL  +GI E+P    C    E++ H    CK + Q+W  LFP L  +    N  F   W+   + L    + +A IT W +WNDRN++I+ K V  V  K
Subjt:  NLKKKGIEEIPLYPFCKRSEETVDHFLAGCKHSSQLWMRLFPNLE-MGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTK

Query:  ADWILEYVNEFSGA--CKAGAKAEGERIPRKNEWVPPPTGLTKINRDRACANNRYGIGVLSRNDKLDIVAAL-VDVQAGSNSLSAKIRAIRDGLKLATRL
         +W+  +++  S A       + +    P    W P  +   K+N D AC       G + R+    +VAA  + V    + L A+IR I +GLK A   
Subjt:  ADWILEYVNEFSGA--CKAGAKAEGERIPRKNEWVPPPTGLTKINRDRACANNRYGIGVLSRNDKLDIVAAL-VDVQAGSNSLSAKIRAIRDGLKLATRL

Query:  GLKRVLVGFNSLEAIHLICEDEEWWGEVGNWLAEIRTLSSAFSSISFYHIFRELTNRAHYFAR-EASRLGQPCLWLSNFPCFV
            + V  +SL AI LI  +    G+  NW+ EI+ L+  F+ ISF H  R+    AH  A+   +       WL NFP ++
Subjt:  GLKRVLVGFNSLEAIHLICEDEEWWGEVGNWLAEIRTLSSAFSSISFYHIFRELTNRAHYFAR-EASRLGQPCLWLSNFPCFV

A0A6J1DZK3 uncharacterized protein LOC1110249682.3e-4244.24Show/hide
Query:  MKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTKADWILEYVNEFSGACKAGAKAEGERIPRKNEWVPPPTGLTKINRDRACANNRYGIGVLSRND
        M+ LS +++ + GITCWALWNDR+A+IN K + E   K +WIL+Y  E       G K    RIPR NEW PP  G+ K+N D A +    G+GVL R  
Subjt:  MKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTKADWILEYVNEFSGACKAGAKAEGERIPRKNEWVPPPTGLTKINRDRACANNRYGIGVLSRND

Query:  KLDIVAALVDVQAGSNSLSAKIRAIRDGLKLATRLGLKRVLVGFNSLEAIHLICEDEEWWGEVGNWLAEIRTLSSAFSSISFYHIFRELTNRAHYFAREA
          +IV A+VD       L AKI AIR+GL LATRLG+ RV+V  +SLEA++LI +   W GE  +W+ +IR  +  F  I F H+FRE    A++  RE 
Subjt:  KLDIVAALVDVQAGSNSLSAKIRAIRDGLKLATRLGLKRVLVGFNSLEAIHLICEDEEWWGEVGNWLAEIRTLSSAFSSISFYHIFRELTNRAHYFAREA

Query:  SRLGQPCLWLSNFPCFV
          L    LW  +FP ++
Subjt:  SRLGQPCLWLSNFPCFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAATCTAAAGAAGAAAGGGATAGAAGAAATACCTCTTTACCCATTTTGTAAAAGAAGTGAGGAAACGGTGGATCACTTTCTTGCTGGATGCAAGCACTCA
AGTCAGTTATGGATGAGACTATTTCCAAATTTGGAGATGGGGAAAAGCTTAAATGGATGTTTTCGTGCTTGGTGGACTGAGTGGATGAAAATTCTTTCAGCGACA
AAAATAACCATTGCGGGCATCACCTGCTGGGCCTTATGGAATGACAGAAACGCTGTGATAAATAACAAGCTAGTCTTAGAAGTGAACACAAAAGCCGATTGGATT
CTTGAGTATGTGAATGAGTTTTCAGGTGCTTGTAAAGCAGGTGCGAAGGCTGAAGGAGAAAGAATACCTCGGAAGAATGAGTGGGTCCCTCCTCCGACAGGCTTG
ACGAAAATTAATAGGGATAGAGCCTGCGCAAATAACAGATATGGGATTGGGGTTCTAAGCAGGAACGACAAATTGGACATCGTTGCAGCTTTGGTTGATGTTCAG
GCTGGAAGCAACTCCCTCTCGGCTAAAATCCGGGCAATTCGTGATGGTCTAAAACTTGCAACAAGGTTAGGGCTAAAAAGGGTGTTAGTTGGATTTAACTCTTTG
GAAGCTATACATCTTATTTGTGAAGATGAAGAGTGGTGGGGTGAAGTAGGAAATTGGCTTGCTGAAATTAGAACCCTCTCAAGTGCTTTCTCGTCAATTTCCTTC
TACCATATTTTTAGGGAGTTAACTAATAGGGCTCATTATTTTGCTAGAGAAGCTTCAAGGCTTGGACAACCATGTCTCTGGCTTTCAAATTTCCCTTGTTTTGTC
TGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAAATCTAAAGAAGAAAGGGATAGAAGAAATACCTCTTTACCCATTTTGTAAAAGAAGTGAGGAAACGGTGGATCACTTTCTTGCTGGATGCAAGCACTCA
AGTCAGTTATGGATGAGACTATTTCCAAATTTGGAGATGGGGAAAAGCTTAAATGGATGTTTTCGTGCTTGGTGGACTGAGTGGATGAAAATTCTTTCAGCGACA
AAAATAACCATTGCGGGCATCACCTGCTGGGCCTTATGGAATGACAGAAACGCTGTGATAAATAACAAGCTAGTCTTAGAAGTGAACACAAAAGCCGATTGGATT
CTTGAGTATGTGAATGAGTTTTCAGGTGCTTGTAAAGCAGGTGCGAAGGCTGAAGGAGAAAGAATACCTCGGAAGAATGAGTGGGTCCCTCCTCCGACAGGCTTG
ACGAAAATTAATAGGGATAGAGCCTGCGCAAATAACAGATATGGGATTGGGGTTCTAAGCAGGAACGACAAATTGGACATCGTTGCAGCTTTGGTTGATGTTCAG
GCTGGAAGCAACTCCCTCTCGGCTAAAATCCGGGCAATTCGTGATGGTCTAAAACTTGCAACAAGGTTAGGGCTAAAAAGGGTGTTAGTTGGATTTAACTCTTTG
GAAGCTATACATCTTATTTGTGAAGATGAAGAGTGGTGGGGTGAAGTAGGAAATTGGCTTGCTGAAATTAGAACCCTCTCAAGTGCTTTCTCGTCAATTTCCTTC
TACCATATTTTTAGGGAGTTAACTAATAGGGCTCATTATTTTGCTAGAGAAGCTTCAAGGCTTGGACAACCATGTCTCTGGCTTTCAAATTTCCCTTGTTTTGTC
TGTTAG
Protein sequenceShow/hide protein sequence
MTNLKKKGIEEIPLYPFCKRSEETVDHFLAGCKHSSQLWMRLFPNLEMGKSLNGCFRAWWTEWMKILSATKITIAGITCWALWNDRNAVINNKLVLEVNTKADWI
LEYVNEFSGACKAGAKAEGERIPRKNEWVPPPTGLTKINRDRACANNRYGIGVLSRNDKLDIVAALVDVQAGSNSLSAKIRAIRDGLKLATRLGLKRVLVGFNSL
EAIHLICEDEEWWGEVGNWLAEIRTLSSAFSSISFYHIFRELTNRAHYFAREASRLGQPCLWLSNFPCFVC