; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g00210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g00210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:109558..112318
RNA-Seq ExpressionMoc07g00210
SyntenyMoc07g00210
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060208.1 Integrase, catalytic core [Cucumis melo var. makuwa]4.3e-2449.3Show/hide
Query:  VDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKLV
        VDSGA+NHVT DY N+ +PSEY G E   VGN N+  IS  G S L+     + LENVL VPD  KNL+SVSKL +DNNV LEF+ D C VKD  +G+ +
Subjt:  VDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKLV

Query:  LKGSLKDGLYQLDTGGAITSSASSHTTSCL---ESDISSSGS
        ++G L+DGLY L   G +      H  S +   + D+  SGS
Subjt:  LKGSLKDGLYQLDTGGAITSSASSHTTSCL---ESDISSSGS

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.7e-2855.65Show/hide
Query:  YVDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKL
        Y+DSGA+NH+T +Y N+ +PSEY G EK  VGN +   IS+IG + LT     L+L+NVLCVPD  KNL+SVSKLA+DNNV++EFH   C +KD  +G+ 
Subjt:  YVDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKL

Query:  VLKGSLKDGLYQLDT
        +L  ++KDGLY LDT
Subjt:  VLKGSLKDGLYQLDT

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]1.9e-2455.56Show/hide
Query:  YVDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGK
        Y+DSGA+NHVT +  N+ +P+EY G EK TVGN N+  IS++G +CLT     L L+N+LCVPD AKNL+SVSKLA+DN++++EFH   C +KD  +GK
Subjt:  YVDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGK

XP_016902204.1 PREDICTED: uncharacterized protein LOC107991581 isoform X4 [Cucumis melo]1.5e-2446.94Show/hide
Query:  QAYLNNTQVNNSRQTQHFNNNQRDGGGRSRGRGHWNNNNRLICQVCGKYVDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPG
        Q +  N Q N S +  +  N Q   G R       N NN   CQ+C    DSGA+NHVT +  N+ +P+EY G EK TVGN N+  IS++G +CLT    
Subjt:  QAYLNNTQVNNSRQTQHFNNNQRDGGGRSRGRGHWNNNNRLICQVCGKYVDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPG

Query:  LLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGK
         L L+N+LCVPD AKNL+SVSKLA+DN++++EFH   C +KD  +GK
Subjt:  LLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGK

XP_022153251.1 uncharacterized protein LOC111020787 [Momordica charantia]2.3e-2547.43Show/hide
Query:  TSRISRESMNAVLVHARSLPATALLEHDCALLQRWFYERRTYTSSRETILIDYGEKKMRTAKNLS---CITP----------------------------
        TS I+ ESMNAVLVHARSLP TALLEH  ALLQRWFYERRTY SSRETIL DYGE KMRTA+NLS    ITP                            
Subjt:  TSRISRESMNAVLVHARSLPATALLEHDCALLQRWFYERRTYTSSRETILIDYGEKKMRTAKNLS---CITP----------------------------

Query:  --------SHPLTTISW-----------------------------SDEEDWILLDDFVDRKVESPRYVPHVGRR
                SH +   ++                              DEEDWIL DDFVDR VE+PRYVP +GRR
Subjt:  --------SHPLTTISW-----------------------------SDEEDWILLDDFVDRKVESPRYVPHVGRR

TrEMBL top hitse value%identityAlignment
A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-948.2e-2955.65Show/hide
Query:  YVDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKL
        Y+DSGA+NH+T +Y N+ +PSEY G EK  VGN +   IS+IG + LT     L+L+NVLCVPD  KNL+SVSKLA+DNNV++EFH   C +KD  +G+ 
Subjt:  YVDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKL

Query:  VLKGSLKDGLYQLDT
        +L  ++KDGLY LDT
Subjt:  VLKGSLKDGLYQLDT

A0A803P4G6 Uncharacterized protein2.0e-2740.44Show/hide
Query:  QTQHFNNNQRDGGGRSRGRGHWNNNNRLICQVCGK----------------------------------------------YVDSGASNHVTADYRNIIH
        Q Q  NN  R   GRSRGRG +NNN+R  CQVCGK                                              +VDSGASNH+T+   ++  
Subjt:  QTQHFNNNQRDGGGRSRGRGHWNNNNRLICQVCGK----------------------------------------------YVDSGASNHVTADYRNIIH

Query:  PSEYGGTEKATVGNRNKFMISHIGKSCLTSDPG-LLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKLVLKGSLKDGLYQL-----
         SEYGG E  TVG+ +K  ISHIG   L ++ G LL L+ +L VP  AKNL+SV KL  DNNV +EF++D+CLVKD  + K++L+G LKDGLYQ+     
Subjt:  PSEYGGTEKATVGNRNKFMISHIGKSCLTSDPG-LLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKLVLKGSLKDGLYQL-----

Query:  DTGGAITSSASSHTTSCLESDISSS
            A+ S  S  T +   S +S S
Subjt:  DTGGAITSSASSHTTSCLESDISSS

A0A803PM38 Uncharacterized protein6.3e-2947.25Show/hide
Query:  NNTQVNNSRQTQHFNNNQRDGGGRSRGRGHWNNNNRLICQVCGKYVDS-------GASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCL-T
        N    NN+ +  H NN  R    RSRGRG   +  R  CQVCGKY  S       GASNH+T++   +    EY G EK TV N N+  I HIG   L T
Subjt:  NNTQVNNSRQTQHFNNNQRDGGGRSRGRGHWNNNNRLICQVCGKYVDS-------GASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCL-T

Query:  SDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKLVLKGSLKDGLYQLDTGGAITSSASSHTTSC
             L L+ +L VP   KNLLS+SKL  DNNV +EF +DLC VKD  +G++VLKG LKDGLYQ D   + TS +S+ + SC
Subjt:  SDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKLVLKGSLKDGLYQLDTGGAITSSASSHTTSC

A0A803PR45 Uncharacterized protein4.8e-2951.35Show/hide
Query:  NNNQRDGGGRSRGRGHWNNNNRLICQVCGKYVDSGA----SNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPG-LLDLENVLCVPDN
        N+N   GGG +RGRG    N +  CQVCG+Y  S A    SNH+TA+  N+ + S Y G ++ TVG+ N+  ISH+G S L SD G  L L +VL VP  
Subjt:  NNNQRDGGGRSRGRGHWNNNNRLICQVCGKYVDSGA----SNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPG-LLDLENVLCVPDN

Query:  AKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKLVLKGSLKDGLYQL
        AKNL+S+SKL  DN VF+EF +D+C VKD+ +  +VL+G LKDGLYQL
Subjt:  AKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKLVLKGSLKDGLYQL

A0A803QCY3 Uncharacterized protein2.2e-2641.75Show/hide
Query:  NNSRQTQHFNNNQRDGGGRSRGRGHWNNNNRLICQVCGKY---------------------------------------------------VDSGASNHV
        NN+ + +HFN   R  GGRSRGRG   NN++  CQVCGKY                                                    DSGASN++
Subjt:  NNSRQTQHFNNNQRDGGGRSRGRGHWNNNNRLICQVCGKY---------------------------------------------------VDSGASNHV

Query:  TADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPG-LLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKLVLKGSLKDG
        TAD   I    EYGG EK TVGN +K +ISH G   L +  G  L L  +L VP  AKN LSVSKL  DN+V +EFH++ C VKDI + +++L+G LKDG
Subjt:  TADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPG-LLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKLVLKGSLKDG

Query:  LYQLDT
        LYQL T
Subjt:  LYQLDT

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-1437.84Show/hide
Query:  VDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKLV
        +DSGA++H+T+D+ N+     Y G +   V + +   ISH G + L++    L+L N+L VP+  KNL+SV +L   N V +EF      VKD+++G  +
Subjt:  VDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKLV

Query:  LKGSLKDGLYQ
        L+G  KD LY+
Subjt:  LKGSLKDGLYQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-1333.11Show/hide
Query:  WNNNNRLICQVCGKYVDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFH
        +N NN L+        DSGA++H+T+D+ N+     Y G +   + + +   I+H G + L +    LDL  VL VP+  KNL+SV +L   N V +EF 
Subjt:  WNNNNRLICQVCGKYVDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTSDPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFH

Query:  ADLCLVKDIHSGKLVLKGSLKDGLYQLDTGGAITSSASSHTTSCLESDISS
             VKD+++G  +L+G  KD LY+     A + + S   + C ++  SS
Subjt:  ADLCLVKDIHSGKLVLKGSLKDGLYQLDTGGAITSSASSHTTSCLESDISS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGCAGCAAAAATAATGTTCATGGTGGACAACGACAGGCCTACCTTAACAACACCCAAGTTAACAATAGTCGGCAAACCCAACATTTTAATAATAATCAAAGGGA
TGGTGGAGGCAGAAGTCGTGGTAGAGGTCATTGGAACAACAACAATCGTCTCATTTGTCAGGTTTGTGGAAAGTATGTTGACAGTGGTGCCTCAAATCATGTGACAGCGG
ACTATCGCAATATTATACATCCTTCGGAGTATGGAGGTACTGAGAAAGCGACGGTCGGTAATAGAAATAAATTTATGATTTCTCATATTGGTAAATCTTGTTTAACCTCT
GATCCTGGTTTGCTTGATCTTGAGAATGTACTTTGTGTGCCTGACAATGCTAAAAATCTTCTAAGTGTGTCAAAACTTGCTAGAGATAATAATGTGTTTTTGGAATTTCA
TGCTGATTTATGTCTTGTAAAGGACATTCATTCGGGCAAGTTGGTGCTGAAAGGGTCTCTTAAAGATGGACTTTACCAACTTGACACGGGAGGTGCAATTACTAGTAGTG
CTTCAAGTCACACTACGAGTTGCTTGGAGTCGGATATTTCCTCAAGCGGCTCATGGAATATGCTTGTAACACTTGTCCACGAACCTGAAGGACAAGTTCAAGGACGATGC
CATGCAAGAAACGTCATATTAGCAGCAAAGGTCTGCCGGGAAATTAGAGTTCAGGTACTATTTTTCCCAACTAGCAGGATTTCTAGAGAGTCGATGAATGCAGTTCTCGT
CCATGCACGTTCTTTGCCAGCCACTGCACTTCTTGAACATGATTGTGCGCTCTTACAACGCTGGTTTTACGAAAGACGAACCTACACATCCAGTCGCGAAACCATTCTTA
TTGACTACGGTGAGAAGAAGATGCGTACTGCAAAGAACCTCTCTTGTATCACTCCATCACACCCATTGACCACCATAAGTTGGAGCGATGAGGAGGATTGGATTTTACTT
GATGACTTCGTCGACCGTAAAGTGGAGTCGCCCAGATATGTTCCGCATGTTGGCCGACGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGCAGCAAAAATAATGTTCATGGTGGACAACGACAGGCCTACCTTAACAACACCCAAGTTAACAATAGTCGGCAAACCCAACATTTTAATAATAATCAAAGGGA
TGGTGGAGGCAGAAGTCGTGGTAGAGGTCATTGGAACAACAACAATCGTCTCATTTGTCAGGTTTGTGGAAAGTATGTTGACAGTGGTGCCTCAAATCATGTGACAGCGG
ACTATCGCAATATTATACATCCTTCGGAGTATGGAGGTACTGAGAAAGCGACGGTCGGTAATAGAAATAAATTTATGATTTCTCATATTGGTAAATCTTGTTTAACCTCT
GATCCTGGTTTGCTTGATCTTGAGAATGTACTTTGTGTGCCTGACAATGCTAAAAATCTTCTAAGTGTGTCAAAACTTGCTAGAGATAATAATGTGTTTTTGGAATTTCA
TGCTGATTTATGTCTTGTAAAGGACATTCATTCGGGCAAGTTGGTGCTGAAAGGGTCTCTTAAAGATGGACTTTACCAACTTGACACGGGAGGTGCAATTACTAGTAGTG
CTTCAAGTCACACTACGAGTTGCTTGGAGTCGGATATTTCCTCAAGCGGCTCATGGAATATGCTTGTAACACTTGTCCACGAACCTGAAGGACAAGTTCAAGGACGATGC
CATGCAAGAAACGTCATATTAGCAGCAAAGGTCTGCCGGGAAATTAGAGTTCAGGTACTATTTTTCCCAACTAGCAGGATTTCTAGAGAGTCGATGAATGCAGTTCTCGT
CCATGCACGTTCTTTGCCAGCCACTGCACTTCTTGAACATGATTGTGCGCTCTTACAACGCTGGTTTTACGAAAGACGAACCTACACATCCAGTCGCGAAACCATTCTTA
TTGACTACGGTGAGAAGAAGATGCGTACTGCAAAGAACCTCTCTTGTATCACTCCATCACACCCATTGACCACCATAAGTTGGAGCGATGAGGAGGATTGGATTTTACTT
GATGACTTCGTCGACCGTAAAGTGGAGTCGCCCAGATATGTTCCGCATGTTGGCCGACGTTAA
Protein sequenceShow/hide protein sequence
MASSKNNVHGGQRQAYLNNTQVNNSRQTQHFNNNQRDGGGRSRGRGHWNNNNRLICQVCGKYVDSGASNHVTADYRNIIHPSEYGGTEKATVGNRNKFMISHIGKSCLTS
DPGLLDLENVLCVPDNAKNLLSVSKLARDNNVFLEFHADLCLVKDIHSGKLVLKGSLKDGLYQLDTGGAITSSASSHTTSCLESDISSSGSWNMLVTLVHEPEGQVQGRC
HARNVILAAKVCREIRVQVLFFPTSRISRESMNAVLVHARSLPATALLEHDCALLQRWFYERRTYTSSRETILIDYGEKKMRTAKNLSCITPSHPLTTISWSDEEDWILL
DDFVDRKVESPRYVPHVGRR