; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g21970 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g21970
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationchr4:15979891..15989960
RNA-Seq ExpressionMoc04g21970
SyntenyMoc04g21970
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141216.1 uncharacterized protein LOC111011669 [Momordica charantia]2.0e-0471.79Show/hide
Query:  AFLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQ
        AFLMGLN+SF+Q+RAQLLLMEP  TINRAF+L A   +Q
Subjt:  AFLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQ

XP_022152756.1 uncharacterized protein LOC111020399 [Momordica charantia]1.1e-0737.58Show/hide
Query:  FLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQCQGLLNLLQSQL-NKVKAGSGPDSGINHVAITCSHIFS----FHTAVDQWVIDSGASTHICY
        FLMGLN+SFSQ+R QLLLMEP PTINR F+L +  A Q   L +     L   + A S   SG +  +++ S  +S     HT     +  S  +     
Subjt:  FLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQCQGLLNLLQSQL-NKVKAGSGPDSGINHVAITCSHIFS----FHTAVDQWVIDSGASTHICY

Query:  SRDFLSTFERFLVSLYFCLISLASQWDKSSLKTIGSARYWQGLYLLSTK
        S  F             C++      DKSS K IG A  W GLYLLS +
Subjt:  SRDFLSTFERFLVSLYFCLISLASQWDKSSLKTIGSARYWQGLYLLSTK

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]8.0e-0684.62Show/hide
Query:  AFLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQ
        AFLMGLN SFSQIRAQLLLMEPAPTINRAFAL A    Q
Subjt:  AFLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQ

XP_022158736.1 uncharacterized protein LOC111025199 [Momordica charantia]2.8e-0682.05Show/hide
Query:  FLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQC
        FLMGLNDSFSQIRAQLLLMEPAP+IN AFAL A    QC
Subjt:  FLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQC

XP_038875043.1 uncharacterized protein LOC120067569 [Benincasa hispida]3.5e-0945.54Show/hide
Query:  AFLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQCQGLLNLLQSQLNKVKAGSGPDSGIN---HVAITCSHIFSFHTAVDQWVIDSGASTHICYS
        AFLMGLNDS + IR+QLLLMEP P+INRAF+L     DQ +   +     ++  K+ +     IN   HV   CS       + +QW++DSGASTHICY+
Subjt:  AFLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQCQGLLNLLQSQLNKVKAGSGPDSGIN---HVAITCSHIFSFHTAVDQWVIDSGASTHICYS

Query:  R
        +
Subjt:  R

TrEMBL top hitse value%identityAlignment
A0A2N9G500 Uncharacterized protein2.1e-0738.6Show/hide
Query:  FLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQCQGLLNLLQSQLNKVKAGSGPDSGINHVAITCSHIFSFHTAV---DQWVIDSGASTHICYSR
        FLMGLNDSF  +RAQ+L+MEP P IN+AF+L +L++   Q  L+  Q  L          +GI   A   S   + H AV    Q++ D+GA+ H+ YS 
Subjt:  FLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQCQGLLNLLQSQLNKVKAGSGPDSGINHVAITCSHIFSFHTAV---DQWVIDSGASTHICYSR

Query:  DFLSTFERFLVSLY
          LS+F     +++
Subjt:  DFLSTFERFLVSLY

A0A6J1CIG1 uncharacterized protein LOC1110116699.6e-0571.79Show/hide
Query:  AFLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQ
        AFLMGLN+SF+Q+RAQLLLMEP  TINRAF+L A   +Q
Subjt:  AFLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQ

A0A6J1DIP8 uncharacterized protein LOC1110203995.4e-0837.58Show/hide
Query:  FLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQCQGLLNLLQSQL-NKVKAGSGPDSGINHVAITCSHIFS----FHTAVDQWVIDSGASTHICY
        FLMGLN+SFSQ+R QLLLMEP PTINR F+L +  A Q   L +     L   + A S   SG +  +++ S  +S     HT     +  S  +     
Subjt:  FLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQCQGLLNLLQSQL-NKVKAGSGPDSGINHVAITCSHIFS----FHTAVDQWVIDSGASTHICY

Query:  SRDFLSTFERFLVSLYFCLISLASQWDKSSLKTIGSARYWQGLYLLSTK
        S  F             C++      DKSS K IG A  W GLYLLS +
Subjt:  SRDFLSTFERFLVSLYFCLISLASQWDKSSLKTIGSARYWQGLYLLSTK

A0A6J1DNP7 uncharacterized protein LOC1110220653.9e-0684.62Show/hide
Query:  AFLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQ
        AFLMGLN SFSQIRAQLLLMEPAPTINRAFAL A    Q
Subjt:  AFLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQ

A0A6J1E1U3 uncharacterized protein LOC1110251991.3e-0682.05Show/hide
Query:  FLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQC
        FLMGLNDSFSQIRAQLLLMEPAP+IN AFAL A    QC
Subjt:  FLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGAGTGCAGTTGGAATGTGAACTACCTGTCAGCGAGATTGATGGGTGTGTTTTGGAATCTAGAGCTTCACATCCACAATGCCAGCATCCTGGATATACC
AAGCCACCGAATCCAATCTCTTCCTTGGAGAACAGAGTCTGCCACTGCCACAAAAGTAAGGAGTATCTAGCAAATTCTTTCTCCACTTTGGCGGTTACCTTCATT
CGTTCCGCTGTTGCCATGGCTATGGATGATCGTCTCAATCCGACTGCTACGGATGAACATCTCAATCTGATTGCTACTTCGTCTTCATCTCTCAATCAACCAACT
CTTGAACAAGTATGTGATGCTTTTCTGATGGGCTTGAATGATTCGTTTAGTCAAATTAGGGCTCAATTACTCCTTATGGAGCCAGCACCCACTATTAATCGCGCG
TTTGCTCTTTTTGCTCTCAATGCAGATCAGTGTCAAGGGTTATTAAATTTGCTTCAATCTCAATTGAATAAAGTGAAGGCTGGATCTGGTCCCGATTCTGGCATT
AATCATGTAGCAATTACTTGTTCTCATATTTTTTCCTTTCACACTGCTGTTGATCAGTGGGTGATTGATTCTGGTGCATCTACTCATATTTGTTATTCTCGAGAT
TTTTTATCAACCTTCGAGCGGTTTCTAGTGTCACTGTATTTTTGCCTGATCAGTCTTGCATCTCAGTGGGACAAGTCCTCTTTGAAGACGATTGGCAGTGCTAGA
TATTGGCAAGGACTCTATTTGCTGTCCACCAAGCCCACGGTTTCTGCTGCTGCTACTAGTCCTATTTCTGCTGCTATAAACTCTGATCCATCCAATGTCATTCAT
GCTACTACCCATACTGATTTGCCTAATGCTTTGTGTACTAATTTGAGTTCTATGCCTCCTGATTTGAATTCTGCTCCACTTTCTTATGCTATGAATACTGCTATT
AATGCACCTACTGAACCTACTGATATGAATACTATTCCTACTGATATGGTTTCTCATATGGCTGTTGATATAACCAATGCTTCTATTGATGTGCCTACTAGTACT
TTTTCTGCTGTACCTATCCCTGATATACCTGATATGTCTCATCCCCAACCTAGTGTTGCTTCACTAAAATACTCACGGCTGGAAACACTCCACGAGTTTGGTGAG
CGATCAAACAAGCATGCTCTGACGGGGCAAGTTGGCGCCAAGGGTGGTCGCCCAAGGGATGGAGCATCAATGGATGGGTCTCCTATAGAGGCTGGCAGAGTGGCC
GATGGAGGGGCGTACTTAGGGCACTGGCGACAGGTAGGGCACGCTGACGGTGCTGGCATGGTCGGCATGCGCGTGGGTAAAGGCATGCGGCTGAGGCAGCATGAG
AGCAAGCGATTGAAGCGAGCGGGTGCGCGGGCGTGTGCGGATGTGCAGCAAGAAGCATGTGAATGTGTGCGCGGATGGTGTGCGCGGTTGAGGCAGCATGCGCGC
GAACGATTTGAGTGCATGCACGAGCTAGATGCATGCATGCGTGTACGGGGCATTGGCGCGCGTGGAGATAAGCAATTGAGCAAAAGGCGTGCACGCAGCGGGCGA
GCTCTTGTGACGTTAACGCGGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGAGAGTGCAGTTGGAATGTGAACTACCTGTCAGCGAGATTGATGGGTGTGTTTTGGAATCTAGAGCTTCACATCCACAATGCCAGCATCCTGGATATACC
AAGCCACCGAATCCAATCTCTTCCTTGGAGAACAGAGTCTGCCACTGCCACAAAAGTAAGGAGTATCTAGCAAATTCTTTCTCCACTTTGGCGGTTACCTTCATT
CGTTCCGCTGTTGCCATGGCTATGGATGATCGTCTCAATCCGACTGCTACGGATGAACATCTCAATCTGATTGCTACTTCGTCTTCATCTCTCAATCAACCAACT
CTTGAACAAGTATGTGATGCTTTTCTGATGGGCTTGAATGATTCGTTTAGTCAAATTAGGGCTCAATTACTCCTTATGGAGCCAGCACCCACTATTAATCGCGCG
TTTGCTCTTTTTGCTCTCAATGCAGATCAGTGTCAAGGGTTATTAAATTTGCTTCAATCTCAATTGAATAAAGTGAAGGCTGGATCTGGTCCCGATTCTGGCATT
AATCATGTAGCAATTACTTGTTCTCATATTTTTTCCTTTCACACTGCTGTTGATCAGTGGGTGATTGATTCTGGTGCATCTACTCATATTTGTTATTCTCGAGAT
TTTTTATCAACCTTCGAGCGGTTTCTAGTGTCACTGTATTTTTGCCTGATCAGTCTTGCATCTCAGTGGGACAAGTCCTCTTTGAAGACGATTGGCAGTGCTAGA
TATTGGCAAGGACTCTATTTGCTGTCCACCAAGCCCACGGTTTCTGCTGCTGCTACTAGTCCTATTTCTGCTGCTATAAACTCTGATCCATCCAATGTCATTCAT
GCTACTACCCATACTGATTTGCCTAATGCTTTGTGTACTAATTTGAGTTCTATGCCTCCTGATTTGAATTCTGCTCCACTTTCTTATGCTATGAATACTGCTATT
AATGCACCTACTGAACCTACTGATATGAATACTATTCCTACTGATATGGTTTCTCATATGGCTGTTGATATAACCAATGCTTCTATTGATGTGCCTACTAGTACT
TTTTCTGCTGTACCTATCCCTGATATACCTGATATGTCTCATCCCCAACCTAGTGTTGCTTCACTAAAATACTCACGGCTGGAAACACTCCACGAGTTTGGTGAG
CGATCAAACAAGCATGCTCTGACGGGGCAAGTTGGCGCCAAGGGTGGTCGCCCAAGGGATGGAGCATCAATGGATGGGTCTCCTATAGAGGCTGGCAGAGTGGCC
GATGGAGGGGCGTACTTAGGGCACTGGCGACAGGTAGGGCACGCTGACGGTGCTGGCATGGTCGGCATGCGCGTGGGTAAAGGCATGCGGCTGAGGCAGCATGAG
AGCAAGCGATTGAAGCGAGCGGGTGCGCGGGCGTGTGCGGATGTGCAGCAAGAAGCATGTGAATGTGTGCGCGGATGGTGTGCGCGGTTGAGGCAGCATGCGCGC
GAACGATTTGAGTGCATGCACGAGCTAGATGCATGCATGCGTGTACGGGGCATTGGCGCGCGTGGAGATAAGCAATTGAGCAAAAGGCGTGCACGCAGCGGGCGA
GCTCTTGTGACGTTAACGCGGTGCTGA
Protein sequenceShow/hide protein sequence
MVRVQLECELPVSEIDGCVLESRASHPQCQHPGYTKPPNPISSLENRVCHCHKSKEYLANSFSTLAVTFIRSAVAMAMDDRLNPTATDEHLNLIATSSSSLNQPT
LEQVCDAFLMGLNDSFSQIRAQLLLMEPAPTINRAFALFALNADQCQGLLNLLQSQLNKVKAGSGPDSGINHVAITCSHIFSFHTAVDQWVIDSGASTHICYSRD
FLSTFERFLVSLYFCLISLASQWDKSSLKTIGSARYWQGLYLLSTKPTVSAAATSPISAAINSDPSNVIHATTHTDLPNALCTNLSSMPPDLNSAPLSYAMNTAI
NAPTEPTDMNTIPTDMVSHMAVDITNASIDVPTSTFSAVPIPDIPDMSHPQPSVASLKYSRLETLHEFGERSNKHALTGQVGAKGGRPRDGASMDGSPIEAGRVA
DGGAYLGHWRQVGHADGAGMVGMRVGKGMRLRQHESKRLKRAGARACADVQQEACECVRGWCARLRQHARERFECMHELDACMRVRGIGARGDKQLSKRRARSGR
ALVTLTRC