; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g14530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g14530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:11088485..11090005
RNA-Seq ExpressionMoc08g14530
SyntenyMoc08g14530
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0016740 - transferase activity (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]1.4e-2641.67Show/hide
Query:  GVLKFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNI
        G+ KF+G +F+FW+MQ++D L  KK+H+ L  +P +M  ++W+ +D Q +  IR+TLS NV   VAKE T + L+K L D YEK SAN K+ L  K F++
Subjt:  GVLKFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNI

Query:  HMDEGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT--PEKLGKMSASTSGA
         M+EG  V +H+NE   I+N+L  + ++ D+EV+A+ L+   P     M A+ S +
Subjt:  HMDEGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT--PEKLGKMSASTSGA

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]8.2e-2742.31Show/hide
Query:  GVLKFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNI
        G+ KF+G +F+FW+MQ++D L  KK+H+ L  +P +M  ++W+ +D Q +  IR+TLS NV   VAKE T + L+K L D YEK SAN K+ L  K F++
Subjt:  GVLKFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNI

Query:  HMDEGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT--PEKLGKMSASTSGA
         M+EG  V +H+NE   I+N+L  + ++ D+EV+A+ LL   P     M A+ S +
Subjt:  HMDEGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT--PEKLGKMSASTSGA

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]8.2e-2742.31Show/hide
Query:  GVLKFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNI
        G+ KF+G +F+FW+MQ++D L  KK+H+ L  +P +M  ++W+ +D Q +  IR+TLS NV   VAKE T + L+K L D YEK SAN K+ L  K F++
Subjt:  GVLKFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNI

Query:  HMDEGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT--PEKLGKMSASTSGA
         M+EG  V +H+NE   I+N+L  + ++ D+EV+A+ LL   P     M A+ S +
Subjt:  HMDEGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT--PEKLGKMSASTSGA

VFQ60264.1 unnamed protein product [Cuscuta campestris]2.4e-2646.72Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIHMD
        KFNG +FS+WKMQ++DLL  K +   LG++P +M D DW  +D +A++ IR++L+ NV   + KE TAK++++AL + YEK SA  K+ L  +  N  M 
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIHMD

Query:  EGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT
        EGTSV  HIN+L  IL +L  +G+K D+EV+A+ LL+
Subjt:  EGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT

VFQ71379.1 unnamed protein product [Cuscuta campestris]5.3e-2646.72Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIHMD
        KF+G +FS+WKMQ++DLL  K ++  LG++P +M D DW  +D +A++ IR++L+ NV   + KE TAK +L AL + YEK S   K+ L  +  N  M 
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIHMD

Query:  EGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT
        EGTSV  HIN+L  IL +L  +G+K D+EVKA+ LL+
Subjt:  EGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT

TrEMBL top hitse value%identityAlignment
A0A0D3DQC2 Abhydrolase_3 domain-containing protein1.2e-2641.94Show/hide
Query:  VLKFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIH
        + KF+G +++FW+MQ++D L  KK+H+ L ++P +M   +W  +D Q +  IR+TLS NV   VAKE TA+ L+K L D YEK SAN K+ L  K F++ 
Subjt:  VLKFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIH

Query:  MDEGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT--PEKLGKMSASTSGA
        M+EG  V +H+NE   I+N+L  + ++ D+EV+A+ LL   P     M A+ S +
Subjt:  MDEGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT--PEKLGKMSASTSGA

A0A484K8X3 Uncharacterized protein1.2e-2646.72Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIHMD
        KFNG +FS+WKMQ++DLL  K +   LG++P +M D DW  +D +A++ IR++L+ NV   + KE TAK++++AL + YEK SA  K+ L  +  N  M 
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIHMD

Query:  EGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT
        EGTSV  HIN+L  IL +L  +G+K D+EV+A+ LL+
Subjt:  EGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT

A0A484L4X7 CCHC-type domain-containing protein2.6e-2646.72Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIHMD
        KF+G +FS+WKMQ++DLL  K ++  LG++P +M D DW  +D +A++ IR++L+ NV   + KE TAK +L AL + YEK S   K+ L  +  N  M 
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIHMD

Query:  EGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT
        EGTSV  HIN+L  IL +L  +G+K D+EVKA+ LL+
Subjt:  EGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT

A0A484MTE7 CCHC-type domain-containing protein3.4e-2645.99Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIHMD
        KF+G +FS+WKMQ++DLL  K +   LG++P +M D+DW  +D +A++ IR++L+ NV   + KE TAK +++AL + YEK SA  K+ L  +  N  M 
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIHMD

Query:  EGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT
        EGTSV  HIN+L  IL +L  +G+K D+EV+A+ LL+
Subjt:  EGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT

A0A484N247 CCHC-type domain-containing protein2.6e-2644.16Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIHMD
        KF+G +FS+W++Q++DLL  K +   LG++P +M D DW  +D +A++ IR++L+ NV   + KETTAK +++AL + YEK SA  K+ L  ++ N  M 
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYFNIHMD

Query:  EGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT--PEK-LGKMSASTSGA
        EGTSV  HIN+L  IL +L  +G+K D+EV+A+ LL+  P+   G ++A TS A
Subjt:  EGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT--PEK-LGKMSASTSGA

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.5e-1932.29Show/hide
Query:  VLKFNGEN-FSFWKMQVKDLLTCKKIHKTL---GERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKY
        V KFNG+N FS W+ +++DLL  + +HK L    ++P  M  +DW ++DE+A + IR+ LS +V + +  E TA+ +   L+  Y   +   K+ L  + 
Subjt:  VLKFNGEN-FSFWKMQVKDLLTCKKIHKTL---GERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKY

Query:  FNIHMDEGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT--PEKLGKMSASTSGAENRVE-----SALVAQNKGKAKISYNGKQKFKE
        + +HM EGT+  SH+N    ++ +L  +GVKI+EE KA+ LL   P     ++ +    +  +E     SAL+   K + K    G+    E
Subjt:  FNIHMDEGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLT--PEKLGKMSASTSGAENRVE-----SALVAQNKGKAKISYNGKQKFKE

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein3.6e-1238.2Show/hide
Query:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKIL
        K +G ++SF +M+++D L  KK+H+ LG++   M   DWN +  Q +  IR+T+S N+   VAKE +   L+K L D Y+K S N  ++
Subjt:  KFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGACAGATATCAAGCACACCATTATTACGATGGGGTCCTGAAGTTCAATGGGGAGAATTTCAGTTTTTGGAAGATGCAGGTAAAGGATCTTCTTACATGCAAGAA
GATACACAAGACTTTAGGGGAAAGACCAGCACGAATGCCGGACAAGGATTGGAATGAGATGGATGAGCAGGCTGTTGCCAACATCAGAATGACATTATCAATGAATGTTT
GTAGTCTGGTGGCGAAAGAGACTACAGCGAAAGATTTGTTGAAGGCCTTGCAAGATAGGTATGAAAAACTTTCTGCCAATACAAAGATACTTCTATGGATGAAGTATTTT
AATATCCACATGGATGAGGGAACCTCGGTGAATTCCCACATTAATGAACTCACCGATATCTTGAACAAGTTAGAAGGCATGGGTGTCAAGATTGACGAGGAGGTGAAAGC
TATGAGGCTGTTGACCCCGGAGAAATTAGGGAAAATGTCTGCATCTACTTCAGGGGCAGAAAACAGGGTTGAATCGGCTTTGGTAGCTCAGAACAAAGGGAAGGCAAAGA
TTAGTTACAATGGGAAGCAGAAGTTTAAAGAAGATCTTGAGAAGGGGAACACTACTGCAAATGTTGTAACAGAAGAAGAACGGATGGAAGAGTTTCTGGCTTACGATACA
GATCAGAAGAATCTGCTATCAGCTCAAGTAAAACAGTTGAGAAGTACAGAAAAGGGAAACGGGAATTTGATAGGCCATCGAGTTCATACCTCAGCTGTCAGACGTTCAGG
CAAGCTGGTGAAGTCGCATAGGCGAATTAGTGCATTGAAGGGTATGTGTTCTGTTTCTAGTGTGGTGACAGACTTGGGTGGGAGCGCCAAGTCATCAAGGGAATCTTCCT
TCAGCGGTCGTTGGGTTCAATCAAGAAAGGAAGCGACGGGGACTACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGACAGATATCAAGCACACCATTATTACGATGGGGTCCTGAAGTTCAATGGGGAGAATTTCAGTTTTTGGAAGATGCAGGTAAAGGATCTTCTTACATGCAAGAA
GATACACAAGACTTTAGGGGAAAGACCAGCACGAATGCCGGACAAGGATTGGAATGAGATGGATGAGCAGGCTGTTGCCAACATCAGAATGACATTATCAATGAATGTTT
GTAGTCTGGTGGCGAAAGAGACTACAGCGAAAGATTTGTTGAAGGCCTTGCAAGATAGGTATGAAAAACTTTCTGCCAATACAAAGATACTTCTATGGATGAAGTATTTT
AATATCCACATGGATGAGGGAACCTCGGTGAATTCCCACATTAATGAACTCACCGATATCTTGAACAAGTTAGAAGGCATGGGTGTCAAGATTGACGAGGAGGTGAAAGC
TATGAGGCTGTTGACCCCGGAGAAATTAGGGAAAATGTCTGCATCTACTTCAGGGGCAGAAAACAGGGTTGAATCGGCTTTGGTAGCTCAGAACAAAGGGAAGGCAAAGA
TTAGTTACAATGGGAAGCAGAAGTTTAAAGAAGATCTTGAGAAGGGGAACACTACTGCAAATGTTGTAACAGAAGAAGAACGGATGGAAGAGTTTCTGGCTTACGATACA
GATCAGAAGAATCTGCTATCAGCTCAAGTAAAACAGTTGAGAAGTACAGAAAAGGGAAACGGGAATTTGATAGGCCATCGAGTTCATACCTCAGCTGTCAGACGTTCAGG
CAAGCTGGTGAAGTCGCATAGGCGAATTAGTGCATTGAAGGGTATGTGTTCTGTTTCTAGTGTGGTGACAGACTTGGGTGGGAGCGCCAAGTCATCAAGGGAATCTTCCT
TCAGCGGTCGTTGGGTTCAATCAAGAAAGGAAGCGACGGGGACTACTTAA
Protein sequenceShow/hide protein sequence
MGDRYQAHHYYDGVLKFNGENFSFWKMQVKDLLTCKKIHKTLGERPARMPDKDWNEMDEQAVANIRMTLSMNVCSLVAKETTAKDLLKALQDRYEKLSANTKILLWMKYF
NIHMDEGTSVNSHINELTDILNKLEGMGVKIDEEVKAMRLLTPEKLGKMSASTSGAENRVESALVAQNKGKAKISYNGKQKFKEDLEKGNTTANVVTEEERMEEFLAYDT
DQKNLLSAQVKQLRSTEKGNGNLIGHRVHTSAVRRSGKLVKSHRRISALKGMCSVSSVVTDLGGSAKSSRESSFSGRWVQSRKEATGTT