; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g10380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g10380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:8755343..8756512
RNA-Seq ExpressionMoc09g10380
SyntenyMoc09g10380
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0016772 - transferase activity, transferring phosphorus-containing groups (molecular function)
GO:0016787 - hydrolase activity (molecular function)
GO:0032555 - purine ribonucleotide binding (molecular function)
GO:0043168 - anion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7561662.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]2.2e-3043.26Show/hide
Query:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD
        +D Q + +IRL LS NVA  VA +KT  GLMK L+D YEKPSAN+KV+L+ + F++ MEEG  V +H+N+   ++NQL S+EI F  +V+A+ L++ LP+
Subjt:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD

Query:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRKN
         WE M+ AVSNS+G++ LKF D+ D  + EE+RR            GE    SA   +++G+ + R  + +G S+ +N
Subjt:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRKN

KAG7584790.1 Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]5.7e-3143.82Show/hide
Query:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD
        +D Q + +IRL LS NVA  VA +KT  GLMK L+D YEKPSAN+KV+L+ + F++ MEEG  V +H+N+   ++NQL S+EI F  +V+A+ LL+ LP+
Subjt:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD

Query:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRKN
         WE M+ AVSNS+G++ LKF D+ D  + EE+RR            GE  + SA   +++G+ + R  + +G S+ +N
Subjt:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRKN

KAG7593230.1 Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]2.8e-3041.79Show/hide
Query:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD
        +D Q + +IRL LS NVA  VA +KT  GLMK L+D YEKPSAN+KV+L+ + F++ MEEG  V +H+N+   ++NQL S+EI F  +V+A+ LL+ LP+
Subjt:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD

Query:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRR--KGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRK-NRWKLMRGSEVVVVGHKKAS
         WE M+ AVSNS+G++ LKF D+ D  + EE+RR   G    S++ +      +     Q++G+ K R GK Q  S++    W   +      V   KAS
Subjt:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRR--KGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRK-NRWKLMRGSEVVVVGHKKAS

Query:  M
        +
Subjt:  M

RZC29599.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]7.4e-3145.76Show/hide
Query:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD
        +D QA+ +IRL L+ NVA  + N+KT  GLMKAL+D YEKPSA +KVYL+ R FN+ M EG SV  HIN+   ++ QLES++I F  +VKA+ LLS LPD
Subjt:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD

Query:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRK
         W    TAVS+S  + +LK SDI D+ ++E++R++ + + S+  S    + E    T  KG+      K +G  QRK
Subjt:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRK

VDD56318.1 unnamed protein product [Brassica oleracea]3.7e-3045.61Show/hide
Query:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD
        +D Q + +IRL LS NVA  VA +KT  GLMK L+D YEKPSAN+KV+L+ + F++ MEEG  V +H+N+   ++NQL S+EI F  +V+A+ LL+ LP+
Subjt:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD

Query:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQ
         WE M+ AVSN +G + LKF+D+ D  +AEE+RR  + + S S++    +       +S GK K R G+ Q
Subjt:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQ

TrEMBL top hitse value%identityAlignment
A0A0D3AEM1 CCHC-type domain-containing protein1.2e-3144.94Show/hide
Query:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD
        +D Q + +IRL LS NVA  V  +KT  GLMK L+D YEKPSANSKV+L+ + F++ MEEG  V +HIN+   ++NQL S+EI F  +V+A+ LL+ LP+
Subjt:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD

Query:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRKN
         WE+M+ AVSNS+G + LKF+D+ D  +AEE+RR            GE    SA   +++G+   R  +  G S+ +N
Subjt:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRKN

A0A0D3BM55 Uncharacterized protein4.3e-3244.44Show/hide
Query:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD
        +D Q + +IRL LS NVA  VA +KT  GLMK L+D YEKPSAN+KV+L+ + F++ MEEG  V +H+N+   ++NQL S+EI F  +V+A+ LL+ LP+
Subjt:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD

Query:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRKNRW
         WE M+ AVSNS+G + LKF+D+ D  +AEE+RR            GE    SA   +++G+   R  +  G S+ +N W
Subjt:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRKNRW

A0A3P6FTI2 Abhydrolase_3 domain-containing protein1.8e-3045.61Show/hide
Query:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD
        +D Q + +IRL LS NVA  VA +KT  GLMK L+D YEKPSAN+KV+L+ + F++ MEEG  V +H+N+   ++NQL S+EI F  +V+A+ LL+ LP+
Subjt:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD

Query:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQ
         WE M+ AVSN +G + LKF+D+ D  +AEE+RR  + + S S++    +       +S GK K R G+ Q
Subjt:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQ

A0A445FR60 Nucleolar pre-ribosomal-associated protein 1 isoform B1.8e-3045.2Show/hide
Query:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD
        +D QA+ +IRL L+ NVA  + N+KT  GLMKAL+D YEKPSA +KVYL+ R FN+ M E  SV  HIN+   ++ QLES++I F  +VKA+ LLS LPD
Subjt:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD

Query:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRK
         W    TAVS+S  + +LK SDI D+ ++E++R++ + + S+  S    ++E    T  KG+      K +G  QRK
Subjt:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRK

A0A445M280 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-3145.76Show/hide
Query:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD
        +D QA+ +IRL L+ NVA  + N+KT  GLMKAL+D YEKPSA +KVYL+ R FN+ M EG SV  HIN+   ++ QLES++I F  +VKA+ LLS LPD
Subjt:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD

Query:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRK
         W    TAVS+S  + +LK SDI D+ ++E++R++ + + S+  S    + E    T  KG+      K +G  QRK
Subjt:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-1328.65Show/hide
Query:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD
        +DE+A + IRL LS +V + + ++ TA G+   L   Y   +  +K+YL  + + +HM EGT+  SH+N    L+ QL ++ +   ++ KAI LL+ LP 
Subjt:  MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPD

Query:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMR----YGKQQGHSQRKNRWK
         ++ + T + +  G  +++  D+    +  E  RK    +             ALIT+ +G+   R    YG+     + KNR K
Subjt:  GWETMKTAVSNSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMR----YGKQQGHSQRKNRWK

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein1.3e-0444.83Show/hide
Query:  QAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEG
        Q + +IRL +S N+A  VA +K+  GLMK L+D Y+KPS N+ V  I+    I +E+G
Subjt:  QAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAACAAGCCGTAGCGATCATCAGGTTGTGCTTGTCAATGAATGTGGCAAGTCTCGTGGCGAATCAGAAAACTGCAATGGGATTGATGAAGGCGCTGACGGACAG
ATATGAAAAACCTTCTGCCAATAGCAAGGTGTATCTCATCACGAGATATTTTAACATTCACATGGAGGAAGGCACGTCGGTGAACTCCCACATCAATAAGGTCACTCAAC
TGATGAACCAGTTGGAGTCGATGGAGATCACTTTCTCAAAGAAGGTAAAGGCTATAAAGCTGTTGTCTTTTTTGCCTGACGGTTGGGAAACGATGAAGACGGCGGTGTCG
AATTCATTGGGGGACAAGAGTCTGAAATTTTCAGATATATGTGACGTTGCAATTGCTGAGGAGATTCGCAGGAAAGGAAATAGAAAGAAGTCTGCATCCACTTCTGGTGG
TGAGAACCATTTAGAGTCAGCATTGATAACGCAGAGTAAAGGCAAAGGGAAGATGAGATATGGGAAACAGCAGGGACATAGCCAGAGGAAGAATAGATGGAAACTCATGA
GGGGATCTGAGGTAGTGGTTGTTGGCCACAAAAAAGCTTCAATGTACATGTTGAGGTTTGGTGTTGCCAGAGAATCAGAGAGACGGTTTATGCACAGGGTTGCAGATAGT
TTTGGGGGAGACTTGAAAGAACTAGCAGCATTGACAGTCATGACAGATCAGGAGAATCTGCCATCAGTTCAAGTAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGAACAAGCCGTAGCGATCATCAGGTTGTGCTTGTCAATGAATGTGGCAAGTCTCGTGGCGAATCAGAAAACTGCAATGGGATTGATGAAGGCGCTGACGGACAG
ATATGAAAAACCTTCTGCCAATAGCAAGGTGTATCTCATCACGAGATATTTTAACATTCACATGGAGGAAGGCACGTCGGTGAACTCCCACATCAATAAGGTCACTCAAC
TGATGAACCAGTTGGAGTCGATGGAGATCACTTTCTCAAAGAAGGTAAAGGCTATAAAGCTGTTGTCTTTTTTGCCTGACGGTTGGGAAACGATGAAGACGGCGGTGTCG
AATTCATTGGGGGACAAGAGTCTGAAATTTTCAGATATATGTGACGTTGCAATTGCTGAGGAGATTCGCAGGAAAGGAAATAGAAAGAAGTCTGCATCCACTTCTGGTGG
TGAGAACCATTTAGAGTCAGCATTGATAACGCAGAGTAAAGGCAAAGGGAAGATGAGATATGGGAAACAGCAGGGACATAGCCAGAGGAAGAATAGATGGAAACTCATGA
GGGGATCTGAGGTAGTGGTTGTTGGCCACAAAAAAGCTTCAATGTACATGTTGAGGTTTGGTGTTGCCAGAGAATCAGAGAGACGGTTTATGCACAGGGTTGCAGATAGT
TTTGGGGGAGACTTGAAAGAACTAGCAGCATTGACAGTCATGACAGATCAGGAGAATCTGCCATCAGTTCAAGTAAAATAG
Protein sequenceShow/hide protein sequence
MDEQAVAIIRLCLSMNVASLVANQKTAMGLMKALTDRYEKPSANSKVYLITRYFNIHMEEGTSVNSHINKVTQLMNQLESMEITFSKKVKAIKLLSFLPDGWETMKTAVS
NSLGDKSLKFSDICDVAIAEEIRRKGNRKKSASTSGGENHLESALITQSKGKGKMRYGKQQGHSQRKNRWKLMRGSEVVVVGHKKASMYMLRFGVARESERRFMHRVADS
FGGDLKELAALTVMTDQENLPSVQVK