; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g01170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g01170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr1:860825..863649
RNA-Seq ExpressionMoc01g01170
SyntenyMoc01g01170
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]1.1e-3260.14Show/hide
Query:  LDKTIFRYMRCLEEQKVQCAVFMLKDDALL-WGSLQR------------------------AITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELV
        L +TIFRYMRCLEEQKVQC VFMLKDDA L W S +R                        AIT ++KQ +FLNLKQ NRS+EE++R+FTKLSRFA ELV
Subjt:  LDKTIFRYMRCLEEQKVQCAVFMLKDDALL-WGSLQR------------------------AITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELV

Query:  DTEAKKTERLIMGLNDEIQAFVVALSLPNYATALRVAALIDNRSAKMS
        DTEA K ER I+ L DE + FV  LS P+YATALR AALIDNRSA  S
Subjt:  DTEAKKTERLIMGLNDEIQAFVVALSLPNYATALRVAALIDNRSAKMS

XP_038880159.1 uncharacterized protein LOC120071839 [Benincasa hispida]8.8e-1938.79Show/hide
Query:  MRCLEEQKVQCAVFMLKDDALLWGSLQR-------------------------AITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDTEAKKTE
        MRC EEQK+QCA F+L D+A  W  L                           A  R+ KQA+F+NLKQG  ++EE+E KFT+LS FA +LV TEAK+ E
Subjt:  MRCLEEQKVQCAVFMLKDDALLWGSLQR-------------------------AITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDTEAKKTE

Query:  RLIMGLNDEIQAFVVALSLPNYATALRVAALID-----NRSAKMSDQGVHGESLQEEEAQISSNE
        R + GL DE++  V AL  PNYATA R AA +        S  ++++   G+  + E+  +  ++
Subjt:  RLIMGLNDEIQAFVVALSLPNYATALRVAALID-----NRSAKMSDQGVHGESLQEEEAQISSNE

XP_038883046.1 uncharacterized protein LOC120074107 [Benincasa hispida]7.5e-1844.6Show/hide
Query:  KTIFRYMRCLEEQKVQCAVFMLKDDA-LLWGSLQRAI------------------------TRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDT
        +TIFR+MRC EE K+QC VFML  +  + W S+++ I                        TR+ KQA+FLNLKQG  SME++E++F KLS F  ELV T
Subjt:  KTIFRYMRCLEEQKVQCAVFMLKDDA-LLWGSLQRAI------------------------TRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDT

Query:  EAKKTERLIMGLNDEIQAFVVALSLPNYATALRVAALID
        EA +TER I GL   ++  V AL L  Y   LR A  ID
Subjt:  EAKKTERLIMGLNDEIQAFVVALSLPNYATALRVAALID

XP_038885815.1 uncharacterized protein LOC120076109 [Benincasa hispida]1.7e-1743.88Show/hide
Query:  KTIFRYMRCLEEQKVQCAVFMLKDDALLWG-SLQRAI------------------------TRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDT
        +TIF +MRC EE K+QCA+FML  +A +W  S+++ I                        TR+ KQA+FLNLKQG  S+E++E++F KLSRF  ELV T
Subjt:  KTIFRYMRCLEEQKVQCAVFMLKDDALLWG-SLQRAI------------------------TRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDT

Query:  EAKKTERLIMGLNDEIQAFVVALSLPNYATALRVAALID
        +A +TER I  L   ++  V AL L  Y  ALR A  ID
Subjt:  EAKKTERLIMGLNDEIQAFVVALSLPNYATALRVAALID

XP_038891712.1 uncharacterized protein LOC120081110 [Benincasa hispida]4.0e-1943.66Show/hide
Query:  KTIFRYMRCLEEQKVQCAVFMLKDDALLWGSLQR-------------------------AITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDT
        +TIFRYM+C E+QKVQCAVFML D A +W  L                           A  R+ KQ +FL L+QG+RS+EE++++F  LSRFA ELV T
Subjt:  KTIFRYMRCLEEQKVQCAVFMLKDDALLWGSLQR-------------------------AITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDT

Query:  EAKKTERLIMGLNDEIQAFVVALSLPNYATALRVAALIDNRS
        EA + ER I GL + I+  V A     +  ALR+AA +D +S
Subjt:  EAKKTERLIMGLNDEIQAFVVALSLPNYATALRVAALIDNRS

TrEMBL top hitse value%identityAlignment
A0A5A7TGL9 Reverse transcriptase8.9e-1745.45Show/hide
Query:  KTIFRYMRCLEEQKVQCAVFMLKDDALLWGSLQRAITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDTEAKKTERLIMGLNDEIQAFVVALSL
        +TIFRYM+C E+QKVQCAVFML D    W      +    K+ +FLNL+QG+ ++E+++ KF  LSRFA E++ TEA + ++ + GL  +IQ FV A   
Subjt:  KTIFRYMRCLEEQKVQCAVFMLKDDALLWGSLQRAITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDTEAKKTERLIMGLNDEIQAFVVALSL

Query:  PNYATALRVA
          +  ALR+A
Subjt:  PNYATALRVA

A0A5A7V1K4 Pol protein3.7e-1539.57Show/hide
Query:  KTIFRYMRCLEEQKVQCAVFMLKDDALLWGSLQRAITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDTEAKKTERLIMGLNDEIQAFVVALSL
        +TIFRYM+C  +QKVQCAVF+LKD ++ W         ++KQ KF NL+ G+ ++E+++ KF  LSRFA ++V  EA + E+ + GL  ++Q FV A   
Subjt:  KTIFRYMRCLEEQKVQCAVFMLKDDALLWGSLQRAITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDTEAKKTERLIMGLNDEIQAFVVALSL

Query:  PNYATALRVAALID-NRSAKMSDQGVHGESL-QEEEAQI
          +A ALR+A  +  +  A +S     G +L Q+ +A++
Subjt:  PNYATALRVAALID-NRSAKMSDQGVHGESL-QEEEAQI

A0A6J1DSJ6 uncharacterized protein LOC1110235125.2e-3360.14Show/hide
Query:  LDKTIFRYMRCLEEQKVQCAVFMLKDDALL-WGSLQR------------------------AITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELV
        L +TIFRYMRCLEEQKVQC VFMLKDDA L W S +R                        AIT ++KQ +FLNLKQ NRS+EE++R+FTKLSRFA ELV
Subjt:  LDKTIFRYMRCLEEQKVQCAVFMLKDDALL-WGSLQR------------------------AITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELV

Query:  DTEAKKTERLIMGLNDEIQAFVVALSLPNYATALRVAALIDNRSAKMS
        DTEA K ER I+ L DE + FV  LS P+YATALR AALIDNRSA  S
Subjt:  DTEAKKTERLIMGLNDEIQAFVVALSLPNYATALRVAALIDNRSAKMS

A0A6J1EKD9 uncharacterized protein LOC1114354601.2e-1637.09Show/hide
Query:  KTIFRYMRCLEEQKVQCAVFMLKDDA--------------------LLWGSLQR--------AITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHEL
        +TIF +M C E+QKV+CA FMLK +A                    + W  ++R        A+  +  +  F++LKQGN ++EE+E +FT+LSRFA E 
Subjt:  KTIFRYMRCLEEQKVQCAVFMLKDDA--------------------LLWGSLQR--------AITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHEL

Query:  VDTEAKKTERLIMGLNDEIQAFVVALSLPNYATALRVAALIDNRSAKMSDQ
        +DTE K+T + I+GL  EIQ  V A++  +Y  AL  A+++D    K S++
Subjt:  VDTEAKKTERLIMGLNDEIQAFVVALSLPNYATALRVAALIDNRSAKMSDQ

A0A6J1FB78 uncharacterized protein LOC1114438457.6e-1637.93Show/hide
Query:  MRCLEEQKVQCAVFMLKDDA-LLWGSLQRAI------------------------TRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDTEAKKTE
        M+C EE ++ C  ++L++DA L W S  RAI                         R+KKQ +FL+++QGNRS+EE+ER+FT+LSRFA  +V  E  K E
Subjt:  MRCLEEQKVQCAVFMLKDDA-LLWGSLQRAI------------------------TRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDTEAKKTE

Query:  RLIMGLNDEIQAFVVALSLPNYATALRVAALIDNRSAKMSDQGVH
          +MGL  +I+  V+    P+YATAL++A  +D +     D   H
Subjt:  RLIMGLNDEIQAFVVALSLPNYATALRVAALIDNRSAKMSDQGVH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCATGTAGAAGAAGGTCTGTGAGACGGAGTGGGCTAGATAGGGATGTTGACCCTGAGACAATAGAGCCGACGGTGAATAACCAAACTTCAGGTCAAACGGAGAA
TCCACCAGTGGTTCAACCTACTGGTCGGACAGGAATTCCACCACTGTTTCAACCTACTAGTCGGACGGAAATTCCACCACTGGTTCAACCTCCTGGTCAAATGGGGAATC
CACCAGTGGGTCAAACTTCTGGACAGGCACAGCCTACTATAGCATCCAACGTTGGTAGAGGCTTGGTTATCCTCGATAAGACTATCTTTCGTTATATGAGGTGTCTAGAG
GAACAAAAAGTGCAGTGTGCTGTTTTCATGCTGAAAGATGATGCCCTTTTATGGGGGAGTCTGCAGAGAGCGATCACCCGATTCAAGAAACAAGCGAAGTTTCTAAACTT
GAAGCAAGGTAACAGATCAATGGAGGAGTTTGAGAGGAAATTCACAAAATTGTCTCGTTTTGCTCATGAGCTAGTAGACACAGAGGCCAAGAAGACTGAACGATTAATCA
TGGGCCTGAATGATGAGATTCAAGCTTTCGTGGTAGCTCTCTCCCTGCCCAATTATGCTACAGCACTTCGAGTAGCTGCATTGATTGATAACCGTTCAGCAAAAATGAGT
GATCAAGGAGTGCATGGGGAAAGCCTACAAGAGGAGGAAGCTCAAATCAGCTCAAATGAAGCTTATGCTCAAAGTCAACCCAATCATAAGCCAATCATTATGCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCATGTAGAAGAAGGTCTGTGAGACGGAGTGGGCTAGATAGGGATGTTGACCCTGAGACAATAGAGCCGACGGTGAATAACCAAACTTCAGGTCAAACGGAGAA
TCCACCAGTGGTTCAACCTACTGGTCGGACAGGAATTCCACCACTGTTTCAACCTACTAGTCGGACGGAAATTCCACCACTGGTTCAACCTCCTGGTCAAATGGGGAATC
CACCAGTGGGTCAAACTTCTGGACAGGCACAGCCTACTATAGCATCCAACGTTGGTAGAGGCTTGGTTATCCTCGATAAGACTATCTTTCGTTATATGAGGTGTCTAGAG
GAACAAAAAGTGCAGTGTGCTGTTTTCATGCTGAAAGATGATGCCCTTTTATGGGGGAGTCTGCAGAGAGCGATCACCCGATTCAAGAAACAAGCGAAGTTTCTAAACTT
GAAGCAAGGTAACAGATCAATGGAGGAGTTTGAGAGGAAATTCACAAAATTGTCTCGTTTTGCTCATGAGCTAGTAGACACAGAGGCCAAGAAGACTGAACGATTAATCA
TGGGCCTGAATGATGAGATTCAAGCTTTCGTGGTAGCTCTCTCCCTGCCCAATTATGCTACAGCACTTCGAGTAGCTGCATTGATTGATAACCGTTCAGCAAAAATGAGT
GATCAAGGAGTGCATGGGGAAAGCCTACAAGAGGAGGAAGCTCAAATCAGCTCAAATGAAGCTTATGCTCAAAGTCAACCCAATCATAAGCCAATCATTATGCCCTAA
Protein sequenceShow/hide protein sequence
MPPCRRRSVRRSGLDRDVDPETIEPTVNNQTSGQTENPPVVQPTGRTGIPPLFQPTSRTEIPPLVQPPGQMGNPPVGQTSGQAQPTIASNVGRGLVILDKTIFRYMRCLE
EQKVQCAVFMLKDDALLWGSLQRAITRFKKQAKFLNLKQGNRSMEEFERKFTKLSRFAHELVDTEAKKTERLIMGLNDEIQAFVVALSLPNYATALRVAALIDNRSAKMS
DQGVHGESLQEEEAQISSNEAYAQSQPNHKPIIMP