; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g11170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g11170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:8416153..8416647
RNA-Seq ExpressionMoc04g11170
SyntenyMoc04g11170
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]5.1e-2843.87Show/hide
Query:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE
        MQ++D L  KK+H+ L   P  M  + W+ +D Q +  IR+TLS  V   VAKE T + L++ L D YEKPSAN K+ L  K F++ ME+G  V +H+NE
Subjt:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE

Query:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD
           I+N+L  + I+ D+EV+A+ L+ SLP+SWE M+  +SNS+ +  LKF  + D
Subjt:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]3.0e-2844.52Show/hide
Query:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE
        MQ++D L  KK+H+ L   P  M  + W+ +D Q +  IR+TLS  V   VAKE T + L++ L D YEKPSAN K+ L  K F++ ME+G  V +H+NE
Subjt:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE

Query:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD
           I+N+L  + I+ D+EV+A+ LL SLP+SWE M+  +SNS+ +  LKF  + D
Subjt:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]3.0e-2844.52Show/hide
Query:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE
        MQ++D L  KK+H+ L   P  M  + W+ +D Q +  IR+TLS  V   VAKE T + L++ L D YEKPSAN K+ L  K F++ ME+G  V +H+NE
Subjt:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE

Query:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD
           I+N+L  + I+ D+EV+A+ LL SLP+SWE M+  +SNS+ +  LKF  + D
Subjt:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD

KHN02029.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja]2.2e-2644.87Show/hide
Query:  MQVKDLLPCKKIHKTL-GESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHIN
        MQ++D L  KK+++ L G  P +M  + WN +D QA+  IR+TL+  V   +  E T   L++ L D YEKPSA  K+ L  ++FN+ M +GISV  HIN
Subjt:  MQVKDLLPCKKIHKTL-GESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHIN

Query:  ELTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD
        E   IL +LE + IK ++EVKA+ LL+SLPDSW      +S+S  +N+LK S I D
Subjt:  ELTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD

VDD56318.1 unnamed protein product [Brassica oleracea]3.3e-2742.58Show/hide
Query:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE
        MQ++D L  KK+H+ L + P  +    W  +D Q +  IR+TLS  V   VAKE T + L++ L D YEKPSAN K+ L  K F++ ME+G  V +H+NE
Subjt:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE

Query:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD
           I+N+L  + I+ ++EV+A+ LL SLP+SWE M+  +SN +    LKF+ + D
Subjt:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD

TrEMBL top hitse value%identityAlignment
A0A0D3A8G8 Uncharacterized protein2.1e-2743.79Show/hide
Query:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE
        MQ++D L  KK+H+ L + P  M    W  +D Q +  IR+TLS  V   VAKE T + L++ L D YEKPSAN K+ L  K F++ ME+G  V +H+NE
Subjt:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE

Query:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTI
           I+N+L  + I+ ++EV+A+ LL SLP+SWE M+  +SNS+    LKF+ +
Subjt:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTI

A0A0D3AEM1 CCHC-type domain-containing protein1.1e-2843.87Show/hide
Query:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE
        MQ++D L  KK+H+ L + P  M    W  +D Q +  IR+TLS  V   V KE T + L++ L D YEKPSAN+K+ L  K F++ ME+G  V +HINE
Subjt:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE

Query:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD
           I+N+L  + I+ ++EV+A+ LL SLP+SWE+M++ +SNS+    LKF+ + D
Subjt:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD

A0A0D3BM55 Uncharacterized protein6.5e-2945.16Show/hide
Query:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE
        MQ++D L  KK+H+TL + P  M    W  +D Q +  IR+TLS  V   VAKE T + L++ L D YEKPSAN K+ L  K F++ ME+G  V +H+NE
Subjt:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE

Query:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD
           I+N+L  + I+ D+EV+A+ LL SLP+SWE M+  +SNS+    LKF+ + D
Subjt:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD

A0A0D3DMW7 CCHC-type domain-containing protein2.1e-2742.58Show/hide
Query:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE
        MQ++D L  KK+H+ L + P  M    W  +D Q +  IR+TLS  V   +AKE T + L++ L D YEKPS N K+ L  K F++ ME+G  V +H+NE
Subjt:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE

Query:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD
           I+N+L  + I+ D+EV+A+ LL SLP+SWE M+  ++NS+    LKF+ + D
Subjt:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD

A0A3P6FTI2 Abhydrolase_3 domain-containing protein1.6e-2742.58Show/hide
Query:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE
        MQ++D L  KK+H+ L + P  +    W  +D Q +  IR+TLS  V   VAKE T + L++ L D YEKPSAN K+ L  K F++ ME+G  V +H+NE
Subjt:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINE

Query:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD
           I+N+L  + I+ ++EV+A+ LL SLP+SWE M+  +SN +    LKF+ + D
Subjt:  LTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICD

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-1430.46Show/hide
Query:  QVKDLLPCKKIHKTL---GESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHI
        +++DLL  + +HK L    + P  M  + W ++DE+A + IR+ LS  V + +  E TA+ +   L+  Y   +   K+ L  + + +HM +G +  SH+
Subjt:  QVKDLLPCKKIHKTL---GESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHI

Query:  NELTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLK
        N    ++ +L  +G+KI+EE KA+ LL SLP S++ +   + +      LK
Subjt:  NELTDILNKLEGMGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLK

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein2.2e-0835.9Show/hide
Query:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKIL
        M+++D L  KK+H+ LG+    M+   WN +  Q +  IR+T+S  +   VAKE +   L++ L D Y+KPS N  ++
Subjt:  MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGTAAAAGATCTTCTTCCGTGCAAGAAGATACACAAGACTTTGGGGGAGAGTCCAACGAATATGACAGATAAGACTTGGAATGAGATGGATGAGCAGGCCGTTGC
AAATATCAGAATGACATTATCGATGAGGGTATGCAGTCTGGTTGCGAAAGAGACTACAGCGAAGGAACTATTGCAGACCTTGCAAGACAGGTATGAGAAACCTTCTGCCA
ATACAAAAATACTTCTATGGACGAAGTATTTTAATATCCACATGGAGAAGGGAATCTCGGTGAATTCCCACATTAATGAGCTCACTGACATCTTGAACAAATTAGAAGGG
ATGGGTATTAAGATCGATGAGGAGGTAAAGGCTATGAGGTTGCTGACATCTTTGCCTGATAGTTGGGAGACGATGAAGATTGTGTTGTCAAATTCGTTAGAGGACAATAG
CTTGAAATTTTCAACTATTTGTGATGCTGCCTGTTGGGATTTTCTCTACCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGTAAAAGATCTTCTTCCGTGCAAGAAGATACACAAGACTTTGGGGGAGAGTCCAACGAATATGACAGATAAGACTTGGAATGAGATGGATGAGCAGGCCGTTGC
AAATATCAGAATGACATTATCGATGAGGGTATGCAGTCTGGTTGCGAAAGAGACTACAGCGAAGGAACTATTGCAGACCTTGCAAGACAGGTATGAGAAACCTTCTGCCA
ATACAAAAATACTTCTATGGACGAAGTATTTTAATATCCACATGGAGAAGGGAATCTCGGTGAATTCCCACATTAATGAGCTCACTGACATCTTGAACAAATTAGAAGGG
ATGGGTATTAAGATCGATGAGGAGGTAAAGGCTATGAGGTTGCTGACATCTTTGCCTGATAGTTGGGAGACGATGAAGATTGTGTTGTCAAATTCGTTAGAGGACAATAG
CTTGAAATTTTCAACTATTTGTGATGCTGCCTGTTGGGATTTTCTCTACCTTTAA
Protein sequenceShow/hide protein sequence
MQVKDLLPCKKIHKTLGESPTNMTDKTWNEMDEQAVANIRMTLSMRVCSLVAKETTAKELLQTLQDRYEKPSANTKILLWTKYFNIHMEKGISVNSHINELTDILNKLEG
MGIKIDEEVKAMRLLTSLPDSWETMKIVLSNSLEDNSLKFSTICDAACWDFLYL