; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS028784 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS028784
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationscaffold570:376212..376933
RNA-Seq ExpressionMS028784
SyntenyMS028784
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0016829 - lyase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOY22705.1 Transducin/WD40 repeat-like superfamily protein [Theobroma cacao]1.5e-1347.47Show/hide
Query:  RTWIRIHHNLTGFLMDAAMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLW
        R+W+ ++         +   EI+K NGRNDF LWR ++RAL   QGLL+ L+GKE L   LSD EK  LM  A SV  L+L+DE+L EV DE   A +W
Subjt:  RTWIRIHHNLTGFLMDAAMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLW

KAF7811757.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Senna tora]4.4e-1354.76Show/hide
Query:  AAMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLWI
        AA  EI+K N  NDF LWR ++RAL   QGLL+ L GKE LS  LS++EK  L+  ALS   LSLAD++L EV DE   A LW+
Subjt:  AAMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLWI

KAF7817786.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Senna tora]2.6e-1354.76Show/hide
Query:  AAMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLWI
        AA  EI+K NG NDF LWR ++RAL   QGLL+ L GKE L   LS++EK  L+  ALS   LSLAD++L EV DE   A LW+
Subjt:  AAMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLWI

KAF7829260.1 putative rhamnogalacturonate lyase B isoform X3 [Senna tora]9.8e-1353.57Show/hide
Query:  AAMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLWI
        AA  EI+K NG NDF LW  ++RAL   QGLL+ L GKE L   LS++EK  L+  ALS   LSLAD++L EV DE   A LW+
Subjt:  AAMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLWI

KAF7841267.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Senna tora]8.9e-1455.95Show/hide
Query:  AAMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLWI
        AA  EI+K NG NDF LWR ++RAL   QGLL+ L GKE L   LS++EK  L+  ALS   LSLAD++L EVVDE   A LW+
Subjt:  AAMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLWI

TrEMBL top hitse value%identityAlignment
A0A061EWE5 CCHC-type domain-containing protein5.3e-1254.43Show/hide
Query:  EIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLW
        EI+K NGRNDF LWR ++ AL   QGLL+ L+GKE L   LSD EK  LM  A S   L+L+DE+L EV DE   A +W
Subjt:  EIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLW

A0A061FYY2 Transducin/WD40 repeat-like superfamily protein7.3e-1447.47Show/hide
Query:  RTWIRIHHNLTGFLMDAAMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLW
        R+W+ ++         +   EI+K NGRNDF LWR ++RAL   QGLL+ L+GKE L   LSD EK  LM  A SV  L+L+DE+L EV DE   A +W
Subjt:  RTWIRIHHNLTGFLMDAAMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLW

A0A2N9EX70 Uncharacterized protein4.4e-1154.67Show/hide
Query:  AMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDE
        A  EI+K NG+NDF LWR ++R L   QGLL  L+GK+ L   LS+ EK+ L+  A S  QLSLADE+L EVV+E
Subjt:  AMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDE

A0A2N9JAK8 gag_pre-integrs domain-containing protein4.8e-1351.81Show/hide
Query:  AMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLWI
        A +EI+K NG+NDF LW  ++R L   QGLL  L+GK+ L   LS+ EK+ L+  A S  QLSLADE+L EVV+E   A+LW+
Subjt:  AMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLWI

A0A5N6MMT4 Integrase catalytic domain-containing protein3.8e-1050.63Show/hide
Query:  EIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLW
        +I+K NG+NDF LWR ++RAL   QGL++ L+G+  L  GLSD EK+ LM  A S   LSL D +L EV  ET  A +W
Subjt:  EIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKEVLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCGGCGATGAATGAGATCAAGACATCGAAAGGTAAAGAACCTCTATCTGACGGATTAAGCGATGAGAAGTTACTTACATTTGAAGCTGCATTATTTGATTATGT
TTTTCGTGCAATGACCAACTCCTCTGGGACTGATAATCCGAGAACTGGAAGATTGCGGACGTGGATCAGGATTCACCACAATCTTACTGGGTTTTTAATGGATGCGGCGA
TGAGTGAGATCAAGAAATTGAACGGAAGAAATGATTTCCGTCTGTGGCGTGAAGAAGTGAGAGCATTATCAGATTCTCAAGGATTACTCGAGACATTGAGAGGTAAAGAA
GTTCTGTCTGACGGATTAAGCGATCGCGAAAAGCAGCCTCTTATGAGTAACGCACTTAGCGTTACACAATTGTCATTAGCGGATGAAATTCTGATTGAAGTTGTTGATGA
GACGTGTCCGGCAAAGCTATGGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCGGCGATGAATGAGATCAAGACATCGAAAGGTAAAGAACCTCTATCTGACGGATTAAGCGATGAGAAGTTACTTACATTTGAAGCTGCATTATTTGATTATGT
TTTTCGTGCAATGACCAACTCCTCTGGGACTGATAATCCGAGAACTGGAAGATTGCGGACGTGGATCAGGATTCACCACAATCTTACTGGGTTTTTAATGGATGCGGCGA
TGAGTGAGATCAAGAAATTGAACGGAAGAAATGATTTCCGTCTGTGGCGTGAAGAAGTGAGAGCATTATCAGATTCTCAAGGATTACTCGAGACATTGAGAGGTAAAGAA
GTTCTGTCTGACGGATTAAGCGATCGCGAAAAGCAGCCTCTTATGAGTAACGCACTTAGCGTTACACAATTGTCATTAGCGGATGAAATTCTGATTGAAGTTGTTGATGA
GACGTGTCCGGCAAAGCTATGGATTTGA
Protein sequenceShow/hide protein sequence
MDAAMNEIKTSKGKEPLSDGLSDEKLLTFEAALFDYVFRAMTNSSGTDNPRTGRLRTWIRIHHNLTGFLMDAAMSEIKKLNGRNDFRLWREEVRALSDSQGLLETLRGKE
VLSDGLSDREKQPLMSNALSVTQLSLADEILIEVVDETCPAKLWI