; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g01080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g01080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:859019..863722
RNA-Seq ExpressionMoc07g01080
SyntenyMoc07g01080
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65867.1 hypothetical protein VITISV_034935 [Vitis vinifera]3.0e-1333.17Show/hide
Query:  VKDLLTCKKIH-KTLGERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTAKYLLKALQ------------------HSWETMKTAVLNSLEENS
        ++D L  +K+H   LG +PE M  ++W  +D Q +  IR++LS +V   V KE T   L+KAL                   +SWE M+ AV NS  +  
Subjt:  VKDLLTCKKIH-KTLGERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTAKYLLKALQ------------------HSWETMKTAVLNSLEENS

Query:  LKFTAICDAALSEEAQRKLGKMSASTSGAETGVESALVAQNKGKTKMSYNGKQQQR---YNKGSGSSSGEVECYYCHKKGHIKRFCRKFKEDLEKGNITA
        LK+  I D  L+EE +R   + +  TSG+ + +   L  + +G  + S  G+   R    N+    S  +V+C+ C K GH KR C+  K+  E  + +A
Subjt:  LKFTAICDAALSEEAQRKLGKMSASTSGAETGVESALVAQNKGKTKMSYNGKQQQR---YNKGSGSSSGEVECYYCHKKGHIKRFCRKFKEDLEKGNITA

Query:  NVVTEEEQ
        N VTEE Q
Subjt:  NVVTEEEQ

GFY92962.1 hypothetical protein Acr_08g0013580 [Actinidia rufa]6.1e-1429.26Show/hide
Query:  VKDLLTCKKIHKTL---GERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTA-----------KYLLKALQHSWETMKTAVLNSLEENSLKFTA
        ++D+L CK +H  L   GE+PE   D++W +M+ + +  IR  +   V   VA+ET+A             LL +L  SWET+  ++ NS     L  + 
Subjt:  VKDLLTCKKIHKTL---GERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTA-----------KYLLKALQHSWETMKTAVLNSLEENSLKFTA

Query:  ICDAALSEEAQRK-LGKMSASTSGAETGVESALVAQNKGKTKMSYNGKQQQRYNKGSGSSSGEVECYYCHKKGHIKRFCRKFKEDLEKGNITAN-VVTEE
        + DA  +EEA+R+ +G  S   S ++  V      + +G+ +  + G ++ R+   S +    V C+YC ++ HIKR C K+K  ++  +  A  V+ +E
Subjt:  ICDAALSEEAQRK-LGKMSASTSGAETGVESALVAQNKGKTKMSYNGKQQQRYNKGSGSSSGEVECYYCHKKGHIKRFCRKFKEDLEKGNITAN-VVTEE

Query:  EQIEEVVATGHKRSSVYVSEFGVAKGLLR
        ++I+ ++A      S +V + G A  L R
Subjt:  EQIEEVVATGHKRSSVYVSEFGVAKGLLR

KAE8683520.1 ORF65c [Hibiscus syriacus]1.8e-1336.2Show/hide
Query:  KIHKTLGERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTAKYLLKALQHSWETMKTAVLNSLEENSLKFTAICDAALSEEAQRKLGKMSASTS
        KI K  G++PEGM ++DW  +D QA+  IR++LS NV   +AKE T   L+ AL   W T  T   +S   N LKF  + D  LSEE Q++         
Subjt:  KIHKTLGERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTAKYLLKALQHSWETMKTAVLNSLEENSLKFTAICDAALSEEAQRKLGKMSASTS

Query:  GAETGVESALVAQNKGKTKMSYNGKQQQRYNKGSGSSSGEVECYY-CHKKGHIKRFCRKFKED
          E    SAL  +++G+T    + + + +  +G   +  +   YY C KKGH KR+ R  K+D
Subjt:  GAETGVESALVAQNKGKTKMSYNGKQQQRYNKGSGSSSGEVECYY-CHKKGHIKRFCRKFKED

XP_022152155.1 cinnamoyl-CoA reductase-like SNL6 [Momordica charantia]3.3e-4481.82Show/hide
Query:  VVATGHKRSSVYVSEFGVAKGLLRQTMHRVAADGSGRDLRGPAALMVRTDQKNLPSTQVKQLRSTEKRNRNLIGHRVHTSAVRHSGELVKSHRRISALKG
        VVATGHKRS VYVSEF VAKG LRQTMH+V A GS R LR PAALM +TDQKNLPS QVKQLRST+K N NLIGHRVHTSAVR  GELVKSHRRISALKG
Subjt:  VVATGHKRSSVYVSEFGVAKGLLRQTMHRVAADGSGRDLRGPAALMVRTDQKNLPSTQVKQLRSTEKRNRNLIGHRVHTSAVRHSGELVKSHRRISALKG

Query:  TGSVSSVATDLGGSAKSSSGESSFRGRWVRSR
        TGS+SSVAT L GSAK SSGESSFR  W+RSR
Subjt:  TGSVSSVATDLGGSAKSSSGESSFRGRWVRSR

XP_022157059.1 uncharacterized protein LOC111023870 [Momordica charantia]3.9e-3783.33Show/hide
Query:  GLLRQTMHRVAADGSGRDLRGPAALMVRTDQKNLPSTQVKQLRSTEKRNRNLIGHRVHTSAVRHSGELVKSHRRISALKGTGSVSSVATDLGGSAKSSSG
        G LRQTMHRVA D SGRDL+GP  LM RTDQKNLPS  VKQLRSTEK N NLIGH+VHTSAVR SGELVKSHRRISA KGTG VSSV TDLGGSAK SSG
Subjt:  GLLRQTMHRVAADGSGRDLRGPAALMVRTDQKNLPSTQVKQLRSTEKRNRNLIGHRVHTSAVRHSGELVKSHRRISALKGTGSVSSVATDLGGSAKSSSG

Query:  ESSFRGRW
        ESSF+GRW
Subjt:  ESSFRGRW

TrEMBL top hitse value%identityAlignment
A0A2N9F2J7 Uncharacterized protein2.5e-1331.65Show/hide
Query:  VKDLLTCKKIH-KTLGERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTAKYLLKAL-------------------------------QH--SW
        ++D L  KK+H   LGE+P+ M D +W  +D Q +  IR++LS +V   V KE T   L+ AL                               QH   W
Subjt:  VKDLLTCKKIH-KTLGERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTAKYLLKAL-------------------------------QH--SW

Query:  ETMKTAVLNSLEENSLKFTAICDAALSEEAQRKLGKMSASTSGAETGVESALVAQNKGKTKMSYNGKQQQRYNKGSGSSSGEVECYYCHKKGHIKRFCRK
        E M+ A  NS  +  LK+  I D  L EE +R+      S+SG+   +E+    + +GK +    G+ + R  +       ++EC+ C K GHI++ CR+
Subjt:  ETMKTAVLNSLEENSLKFTAICDAALSEEAQRKLGKMSASTSGAETGVESALVAQNKGKTKMSYNGKQQQRYNKGSGSSSGEVECYYCHKKGHIKRFCRK

Query:  FKEDLEKGNITANVVTEE
         K+  E  N +ANVVTEE
Subjt:  FKEDLEKGNITANVVTEE

A0A6A2YW35 ORF65c8.6e-1436.2Show/hide
Query:  KIHKTLGERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTAKYLLKALQHSWETMKTAVLNSLEENSLKFTAICDAALSEEAQRKLGKMSASTS
        KI K  G++PEGM ++DW  +D QA+  IR++LS NV   +AKE T   L+ AL   W T  T   +S   N LKF  + D  LSEE Q++         
Subjt:  KIHKTLGERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTAKYLLKALQHSWETMKTAVLNSLEENSLKFTAICDAALSEEAQRKLGKMSASTS

Query:  GAETGVESALVAQNKGKTKMSYNGKQQQRYNKGSGSSSGEVECYY-CHKKGHIKRFCRKFKED
          E    SAL  +++G+T    + + + +  +G   +  +   YY C KKGH KR+ R  K+D
Subjt:  GAETGVESALVAQNKGKTKMSYNGKQQQRYNKGSGSSSGEVECYY-CHKKGHIKRFCRKFKED

A0A6J1DD48 cinnamoyl-CoA reductase-like SNL61.6e-4481.82Show/hide
Query:  VVATGHKRSSVYVSEFGVAKGLLRQTMHRVAADGSGRDLRGPAALMVRTDQKNLPSTQVKQLRSTEKRNRNLIGHRVHTSAVRHSGELVKSHRRISALKG
        VVATGHKRS VYVSEF VAKG LRQTMH+V A GS R LR PAALM +TDQKNLPS QVKQLRST+K N NLIGHRVHTSAVR  GELVKSHRRISALKG
Subjt:  VVATGHKRSSVYVSEFGVAKGLLRQTMHRVAADGSGRDLRGPAALMVRTDQKNLPSTQVKQLRSTEKRNRNLIGHRVHTSAVRHSGELVKSHRRISALKG

Query:  TGSVSSVATDLGGSAKSSSGESSFRGRWVRSR
        TGS+SSVAT L GSAK SSGESSFR  W+RSR
Subjt:  TGSVSSVATDLGGSAKSSSGESSFRGRWVRSR

A0A6J1DVE9 uncharacterized protein LOC1110238701.9e-3783.33Show/hide
Query:  GLLRQTMHRVAADGSGRDLRGPAALMVRTDQKNLPSTQVKQLRSTEKRNRNLIGHRVHTSAVRHSGELVKSHRRISALKGTGSVSSVATDLGGSAKSSSG
        G LRQTMHRVA D SGRDL+GP  LM RTDQKNLPS  VKQLRSTEK N NLIGH+VHTSAVR SGELVKSHRRISA KGTG VSSV TDLGGSAK SSG
Subjt:  GLLRQTMHRVAADGSGRDLRGPAALMVRTDQKNLPSTQVKQLRSTEKRNRNLIGHRVHTSAVRHSGELVKSHRRISALKGTGSVSSVATDLGGSAKSSSG

Query:  ESSFRGRW
        ESSF+GRW
Subjt:  ESSFRGRW

A5BPB3 Uncharacterized protein1.5e-1333.17Show/hide
Query:  VKDLLTCKKIH-KTLGERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTAKYLLKALQ------------------HSWETMKTAVLNSLEENS
        ++D L  +K+H   LG +PE M  ++W  +D Q +  IR++LS +V   V KE T   L+KAL                   +SWE M+ AV NS  +  
Subjt:  VKDLLTCKKIH-KTLGERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTAKYLLKALQ------------------HSWETMKTAVLNSLEENS

Query:  LKFTAICDAALSEEAQRKLGKMSASTSGAETGVESALVAQNKGKTKMSYNGKQQQR---YNKGSGSSSGEVECYYCHKKGHIKRFCRKFKEDLEKGNITA
        LK+  I D  L+EE +R   + +  TSG+ + +   L  + +G  + S  G+   R    N+    S  +V+C+ C K GH KR C+  K+  E  + +A
Subjt:  LKFTAICDAALSEEAQRKLGKMSASTSGAETGVESALVAQNKGKTKMSYNGKQQQR---YNKGSGSSSGEVECYYCHKKGHIKRFCRKFKEDLEKGNITA

Query:  NVVTEEEQ
        N VTEE Q
Subjt:  NVVTEEEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein2.5e-0535.21Show/hide
Query:  VKDLLTCKKIHKTLGERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTAKYLLKALQHSWETMKT
        ++D L  KK+H+ LG++ E M   DWN +  Q +  IR+++S N+   VAKE +   L+K L   ++   T
Subjt:  VKDLLTCKKIHKTLGERPEGMPDKDWNEMDEQAVANIRMSLSINVCSLVAKETTAKYLLKALQHSWETMKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACGCAACAAGAGGATGTCGAGATTAGACGAGCCAAGCCGACTTAGAGCTAATCGGCTTGGCCCATGGCCAACCCAAAGAATGGTCGACTGGGTCGCAAGCCGATC
GGAGACTGGTCGGCTTGTGGTTGCTCCTTGGTCGGATTCACACTTAGGTATCGACCTCCGAGTTCTCAAGCGGCCTTTTGAGATTTGGGTCTTCAATTTCCCAACAAGTG
GTATCAGAGCTGTAAAGGATCTTCTTACATGCAAGAAGATACACAAGACTTTAGGGGAGAGACCAGAAGGGATGCCGGACAAGGATTGGAATGAGATGGATGAGCAGGCC
GTTGCGAACATCAGAATGTCGTTGTCGATCAATGTTTGTAGTCTGGTGGCAAAAGAGACTACAGCGAAATATTTGTTGAAGGCCTTGCAACACAGTTGGGAGACGATGAA
GACCGCGGTGTTGAATTCGCTCGAGGAAAATAGCTTGAAATTTACAGCTATTTGTGATGCCGCCTTATCTGAGGAAGCCCAGAGAAAATTAGGGAAAATGTCTGCATCTA
CTTCAGGGGCAGAAACTGGGGTTGAATCAGCTTTGGTAGCTCAGAACAAAGGGAAGACAAAGATGAGTTACAATGGGAAACAGCAGCAGAGATATAACAAGGGTAGTGGG
AGTTCCAGTGGAGAAGTAGAATGTTATTACTGCCACAAGAAGGGACACATTAAACGCTTTTGCAGGAAGTTTAAAGAAGATCTTGAGAAGGGGAACATTACTGCAAATGT
TGTAACAGAAGAAGAACAAATTGAAGAGGTGGTGGCAACAGGCCACAAGAGATCTTCTGTTTATGTGTCAGAATTTGGGGTTGCCAAGGGTTTACTGAGACAGACGATGC
ACAGAGTAGCTGCAGATGGTTCAGGGCGAGACCTTAGAGGACCAGCAGCATTGATGGTCAGAACAGATCAGAAGAATCTGCCATCAACTCAAGTAAAACAGTTGAGAAGT
ACAGAAAAGAGAAACAGGAATTTGATTGGCCATCGAGTTCATACCTCAGCTGTCAGACATTCAGGCGAGCTGGTGAAGTCGCATAGGCGAATTAGTGCATTGAAGGGTAC
GGGTTCTGTTTCTAGTGTGGCGACAGACTTGGGTGGGAGCGCCAAGTCATCATCAGGGGAATCTTCCTTTAGAGGTCGTTGGGTTCGATCGAGAAAGGAAGCGACGGGGA
CCACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAACGCAACAAGAGGATGTCGAGATTAGACGAGCCAAGCCGACTTAGAGCTAATCGGCTTGGCCCATGGCCAACCCAAAGAATGGTCGACTGGGTCGCAAGCCGATC
GGAGACTGGTCGGCTTGTGGTTGCTCCTTGGTCGGATTCACACTTAGGTATCGACCTCCGAGTTCTCAAGCGGCCTTTTGAGATTTGGGTCTTCAATTTCCCAACAAGTG
GTATCAGAGCTGTAAAGGATCTTCTTACATGCAAGAAGATACACAAGACTTTAGGGGAGAGACCAGAAGGGATGCCGGACAAGGATTGGAATGAGATGGATGAGCAGGCC
GTTGCGAACATCAGAATGTCGTTGTCGATCAATGTTTGTAGTCTGGTGGCAAAAGAGACTACAGCGAAATATTTGTTGAAGGCCTTGCAACACAGTTGGGAGACGATGAA
GACCGCGGTGTTGAATTCGCTCGAGGAAAATAGCTTGAAATTTACAGCTATTTGTGATGCCGCCTTATCTGAGGAAGCCCAGAGAAAATTAGGGAAAATGTCTGCATCTA
CTTCAGGGGCAGAAACTGGGGTTGAATCAGCTTTGGTAGCTCAGAACAAAGGGAAGACAAAGATGAGTTACAATGGGAAACAGCAGCAGAGATATAACAAGGGTAGTGGG
AGTTCCAGTGGAGAAGTAGAATGTTATTACTGCCACAAGAAGGGACACATTAAACGCTTTTGCAGGAAGTTTAAAGAAGATCTTGAGAAGGGGAACATTACTGCAAATGT
TGTAACAGAAGAAGAACAAATTGAAGAGGTGGTGGCAACAGGCCACAAGAGATCTTCTGTTTATGTGTCAGAATTTGGGGTTGCCAAGGGTTTACTGAGACAGACGATGC
ACAGAGTAGCTGCAGATGGTTCAGGGCGAGACCTTAGAGGACCAGCAGCATTGATGGTCAGAACAGATCAGAAGAATCTGCCATCAACTCAAGTAAAACAGTTGAGAAGT
ACAGAAAAGAGAAACAGGAATTTGATTGGCCATCGAGTTCATACCTCAGCTGTCAGACATTCAGGCGAGCTGGTGAAGTCGCATAGGCGAATTAGTGCATTGAAGGGTAC
GGGTTCTGTTTCTAGTGTGGCGACAGACTTGGGTGGGAGCGCCAAGTCATCATCAGGGGAATCTTCCTTTAGAGGTCGTTGGGTTCGATCGAGAAAGGAAGCGACGGGGA
CCACTTAA
Protein sequenceShow/hide protein sequence
MQRNKRMSRLDEPSRLRANRLGPWPTQRMVDWVASRSETGRLVVAPWSDSHLGIDLRVLKRPFEIWVFNFPTSGIRAVKDLLTCKKIHKTLGERPEGMPDKDWNEMDEQA
VANIRMSLSINVCSLVAKETTAKYLLKALQHSWETMKTAVLNSLEENSLKFTAICDAALSEEAQRKLGKMSASTSGAETGVESALVAQNKGKTKMSYNGKQQQRYNKGSG
SSSGEVECYYCHKKGHIKRFCRKFKEDLEKGNITANVVTEEEQIEEVVATGHKRSSVYVSEFGVAKGLLRQTMHRVAADGSGRDLRGPAALMVRTDQKNLPSTQVKQLRS
TEKRNRNLIGHRVHTSAVRHSGELVKSHRRISALKGTGSVSSVATDLGGSAKSSSGESSFRGRWVRSRKEATGTT