; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g17780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g17780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr3:11799739..11800584
RNA-Seq ExpressionMoc03g17780
SyntenyMoc03g17780
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KZV26181.1 hypothetical protein F511_06348 [Dorcoceras hygrometricum]9.2e-2234.62Show/hide
Query:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLST-HNPVKQPSIVNNDPNRR
        ++K  +D LA  G  I   D +LHIL G+G EY+SVV  +T +  S SL +V +LLL  E RIE ++    + + PSVN++T  +  K  +   + P  R
Subjt:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLST-HNPVKQPSIVNNDPNRR

Query:  GKNSGQ--------------------------------RFERNFQGPYAHSPHQNTNKFSQISRQSSSQPPFNAFTLQHELNKESQWFPDSGASNHVTND
        G+  G+                                RF++ F      S   +     Q +R S S PP    + + E   E  W+PDSGAS+HVTND
Subjt:  GKNSGQ--------------------------------RFERNFQGPYAHSPHQNTNKFSQISRQSSSQPPFNAFTLQHELNKESQWFPDSGASNHVTND

Query:  IGNLTIGFEYLRDNKVLVGNGASLDVLNVGSSFL
        +GNL++  EY   +KV VGNGA L + N+G S L
Subjt:  IGNLTIGFEYLRDNKVLVGNGASLDVLNVGSSFL

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]1.8e-2538.52Show/hide
Query:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLS--THNPVKQPSIVNNDP--
        ++K LVD+LA AG+K++  DH++HIL GL  E++S VSVI+ +  + +LQ+VYSLLL  E R E +S IN +G+LPSVNL+  T N     SI    P  
Subjt:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLS--THNPVKQPSIVNNDP--

Query:  -NRRGKNSGQ------------------------------RFERNFQGPYAHSPHQNTNKFSQISRQSSS-----------------QPPFN----AFTL
         N R KNSG                               RFE+ F GP   S  Q  +KFS  S  S +                 QP  N    AF  
Subjt:  -NRRGKNSGQ------------------------------RFERNFQGPYAHSPHQNTNKFSQISRQSSS-----------------QPPFN----AFTL

Query:  QHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNG
        Q + N+++ W+PDSGA+NHVT++  NL    EY  DN+V +GNG
Subjt:  QHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNG

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]8.0e-2638.62Show/hide
Query:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLS--THNPVKQPSIVNNDP--
        ++K LVD+LA AG+K++  DH++HIL GL  E++S VSVI+ +  + +LQ+VYSLLL  E R E +S IN +G+LPSVNL+  T N     SI    P  
Subjt:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLS--THNPVKQPSIVNNDP--

Query:  -NRRGKNSGQ------------------------------RFERNFQGPYAHSPHQNTNKFSQISRQSSS-----------------QPPFN----AFTL
         N R KNSG                               RFE+ F GP   S  Q  +KFS  S  S +                 QP  N    AF  
Subjt:  -NRRGKNSGQ------------------------------RFERNFQGPYAHSPHQNTNKFSQISRQSSS-----------------QPPFN----AFTL

Query:  QHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNGAS
        Q + N+++ W+PDSGA+NHVT++  NL    EY  DN+V +GNG S
Subjt:  QHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNGAS

XP_022152240.1 uncharacterized protein LOC111020007 [Momordica charantia]4.7e-2645.78Show/hide
Query:  LLLDQENRIEHHSTINPNGSLPSVNLSTHNPVKQPSI-----VNNDPNRRGKNSGQR--------------------------FERNFQGPYAHSPHQNT
        ++L   +RI+HHS+IN +GSLPSVNL+T +   Q S+      +ND N+ G+N G +                          F+R+FQG  A+S H N+
Subjt:  LLLDQENRIEHHSTINPNGSLPSVNLSTHNPVKQPSI-----VNNDPNRRGKNSGQR--------------------------FERNFQGPYAHSPHQNT

Query:  NKFSQISRQSSSQPPFNAFTLQHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNGA
        ++FSQ +    S   FNA TLQH+LNKE+QWFPDSG SNHV +D+ NL I  EYL DNKVL+GNGA
Subjt:  NKFSQISRQSSSQPPFNAFTLQHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNGA

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]7.3e-2736.76Show/hide
Query:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLSTHNPVKQPSI---------
        +IKNLVD+LA+AG+K+S  DH++HIL GLG E+D+++SVIT ++   +LQ+V SLLL QE R E  + IN +GSLPSVNL+ ++  K+ ++         
Subjt:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLSTHNPVKQPSI---------

Query:  -----------VNNDPNRR---GKNSGQ----------------RFERNFQGPYAHSPHQNTNKFS--------------------------QISRQSSS
                    N   NRR   G N  Q                RFERNF G     P+ N N FS                            S  S+S
Subjt:  -----------VNNDPNRR---GKNSGQ----------------RFERNFQGPYAHSPHQNTNKFS--------------------------QISRQSSS

Query:  QPPFNAFTLQHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNG
             A  +  + N++S W+ DSG +NHVTN+ GN ++G EY  D K+ VGNG
Subjt:  QPPFNAFTLQHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNG

TrEMBL top hitse value%identityAlignment
A0A2Z7AWA7 Integrase catalytic domain-containing protein4.4e-2234.62Show/hide
Query:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLST-HNPVKQPSIVNNDPNRR
        ++K  +D LA  G  I   D +LHIL G+G EY+SVV  +T +  S SL +V +LLL  E RIE ++    + + PSVN++T  +  K  +   + P  R
Subjt:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLST-HNPVKQPSIVNNDPNRR

Query:  GKNSGQ--------------------------------RFERNFQGPYAHSPHQNTNKFSQISRQSSSQPPFNAFTLQHELNKESQWFPDSGASNHVTND
        G+  G+                                RF++ F      S   +     Q +R S S PP    + + E   E  W+PDSGAS+HVTND
Subjt:  GKNSGQ--------------------------------RFERNFQGPYAHSPHQNTNKFSQISRQSSSQPPFNAFTLQHELNKESQWFPDSGASNHVTND

Query:  IGNLTIGFEYLRDNKVLVGNGASLDVLNVGSSFL
        +GNL++  EY   +KV VGNGA L + N+G S L
Subjt:  IGNLTIGFEYLRDNKVLVGNGASLDVLNVGSSFL

A0A6J1C6N9 dr1-associated corepressor homolog isoform X18.7e-2638.52Show/hide
Query:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLS--THNPVKQPSIVNNDP--
        ++K LVD+LA AG+K++  DH++HIL GL  E++S VSVI+ +  + +LQ+VYSLLL  E R E +S IN +G+LPSVNL+  T N     SI    P  
Subjt:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLS--THNPVKQPSIVNNDP--

Query:  -NRRGKNSGQ------------------------------RFERNFQGPYAHSPHQNTNKFSQISRQSSS-----------------QPPFN----AFTL
         N R KNSG                               RFE+ F GP   S  Q  +KFS  S  S +                 QP  N    AF  
Subjt:  -NRRGKNSGQ------------------------------RFERNFQGPYAHSPHQNTNKFSQISRQSSS-----------------QPPFN----AFTL

Query:  QHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNG
        Q + N+++ W+PDSGA+NHVT++  NL    EY  DN+V +GNG
Subjt:  QHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNG

A0A6J1C8R2 dr1-associated corepressor homolog isoform X23.9e-2638.62Show/hide
Query:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLS--THNPVKQPSIVNNDP--
        ++K LVD+LA AG+K++  DH++HIL GL  E++S VSVI+ +  + +LQ+VYSLLL  E R E +S IN +G+LPSVNL+  T N     SI    P  
Subjt:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLS--THNPVKQPSIVNNDP--

Query:  -NRRGKNSGQ------------------------------RFERNFQGPYAHSPHQNTNKFSQISRQSSS-----------------QPPFN----AFTL
         N R KNSG                               RFE+ F GP   S  Q  +KFS  S  S +                 QP  N    AF  
Subjt:  -NRRGKNSGQ------------------------------RFERNFQGPYAHSPHQNTNKFSQISRQSSS-----------------QPPFN----AFTL

Query:  QHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNGAS
        Q + N+++ W+PDSGA+NHVT++  NL    EY  DN+V +GNG S
Subjt:  QHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNGAS

A0A6J1DDD5 uncharacterized protein LOC1110200072.3e-2645.78Show/hide
Query:  LLLDQENRIEHHSTINPNGSLPSVNLSTHNPVKQPSI-----VNNDPNRRGKNSGQR--------------------------FERNFQGPYAHSPHQNT
        ++L   +RI+HHS+IN +GSLPSVNL+T +   Q S+      +ND N+ G+N G +                          F+R+FQG  A+S H N+
Subjt:  LLLDQENRIEHHSTINPNGSLPSVNLSTHNPVKQPSI-----VNNDPNRRGKNSGQR--------------------------FERNFQGPYAHSPHQNT

Query:  NKFSQISRQSSSQPPFNAFTLQHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNGA
        ++FSQ +    S   FNA TLQH+LNKE+QWFPDSG SNHV +D+ NL I  EYL DNKVL+GNGA
Subjt:  NKFSQISRQSSSQPPFNAFTLQHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNGA

A0A6J1DLT9 uncharacterized protein LOC1110217573.5e-2736.76Show/hide
Query:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLSTHNPVKQPSI---------
        +IKNLVD+LA+AG+K+S  DH++HIL GLG E+D+++SVIT ++   +LQ+V SLLL QE R E  + IN +GSLPSVNL+ ++  K+ ++         
Subjt:  EIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLSTHNPVKQPSI---------

Query:  -----------VNNDPNRR---GKNSGQ----------------RFERNFQGPYAHSPHQNTNKFS--------------------------QISRQSSS
                    N   NRR   G N  Q                RFERNF G     P+ N N FS                            S  S+S
Subjt:  -----------VNNDPNRR---GKNSGQ----------------RFERNFQGPYAHSPHQNTNKFS--------------------------QISRQSSS

Query:  QPPFNAFTLQHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNG
             A  +  + N++S W+ DSG +NHVTN+ GN ++G EY  D K+ VGNG
Subjt:  QPPFNAFTLQHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.9e-1126.67Show/hide
Query:  DALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRI--EHHSTINP--NGSLPSVNLSTHNPVKQPSIVNNDPNRRGKN
        D LA+ G+ + H + V  +L+ L  EY  V+  I  KDT P+L +++  LL+ E++I     +T+ P    ++   N +T N     +  N   NR   N
Subjt:  DALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRI--EHHSTINP--NGSLPSVNLSTHNPVKQPSIVNNDPNRRGKN

Query:  SG---QRFERNFQ----------------GPYAHSPHQNTNKFSQISRQSSSQPPFNAFTLQHELN-------KESQWFPDSGASNHVTNDIGNLTIGFE
        +    Q+   NF                 G   HS  + +     +S  +S QPP      Q   N         + W  DSGA++H+T+D  NL++   
Subjt:  SG---QRFERNFQ----------------GPYAHSPHQNTNKFSQISRQSSSQPPFNAFTLQHELN-------KESQWFPDSGASNHVTNDIGNLTIGFE

Query:  YLRDNKVLVGNGASLDVLNVGSSFL
        Y   + V+V +G+++ + + GS+ L
Subjt:  YLRDNKVLVGNGASLDVLNVGSSFL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.4e-1125.66Show/hide
Query:  DALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLP-SVNLSTHNPVKQPSIVNNDPNRRGKN---
        D LA+ G+ + H + V  +L+ L  +Y  V+  I  KDT PSL +++  L+++E+++     +N    +P + N+ TH         NN  + R  N   
Subjt:  DALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLP-SVNLSTHNPVKQPSIVNNDPNRRGKN---

Query:  ----------SGQRFERNFQGPY----------AHSPHQ--NTNKFSQISRQSSSQPPFNAFTLQHELNKES-----QWFPDSGASNHVTNDIGNLTIGF
                  SG R +     PY           HS  +    ++F   + Q  S  PF  +  +  L   S      W  DSGA++H+T+D  NL+   
Subjt:  ----------SGQRFERNFQGPY----------AHSPHQ--NTNKFSQISRQSSSQPPFNAFTLQHELNKES-----QWFPDSGASNHVTNDIGNLTIGF

Query:  EYLRDNKVLVGNGASLDVLNVGSSFL
         Y   + V++ +G+++ + + GS+ L
Subjt:  EYLRDNKVLVGNGASLDVLNVGSSFL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGACTGCAAGACTACTAGTGAGATTAAAAATTTGGTTGATGCCTTGGCTGTTGCTGGACGCAAAATTTCGCATGTGGATCATGTATTACATATTTTGCAAGGCTT
AGGTTATGAGTATGATTCTGTAGTCTCTGTAATTACAGATAAAGATACATCTCCCTCCTTACAGAAAGTTTATTCACTTTTGCTGGATCAAGAAAATAGAATTGAACATC
ACTCTACCATCAATCCTAATGGTTCTCTGCCTTCAGTAAACCTTAGCACTCACAATCCAGTAAAACAGCCATCTATAGTGAACAATGATCCAAATCGAAGAGGGAAAAAT
TCAGGACAGAGATTTGAGCGAAATTTTCAAGGTCCGTATGCTCACTCACCTCATCAGAATACTAATAAGTTTTCTCAAATATCAAGGCAAAGTTCTTCTCAACCTCCTTT
CAATGCCTTTACTCTACAACATGAATTGAACAAGGAAAGTCAGTGGTTCCCAGATTCTGGTGCGTCGAACCATGTTACGAATGATATTGGCAATTTAACAATTGGATTTG
AGTATCTTAGAGATAACAAAGTTTTGGTCGGTAATGGTGCAAGTTTGGATGTTCTTAATGTTGGATCTTCCTTTCTTAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTAGACTGCAAGACTACTAGTGAGATTAAAAATTTGGTTGATGCCTTGGCTGTTGCTGGACGCAAAATTTCGCATGTGGATCATGTATTACATATTTTGCAAGGCTT
AGGTTATGAGTATGATTCTGTAGTCTCTGTAATTACAGATAAAGATACATCTCCCTCCTTACAGAAAGTTTATTCACTTTTGCTGGATCAAGAAAATAGAATTGAACATC
ACTCTACCATCAATCCTAATGGTTCTCTGCCTTCAGTAAACCTTAGCACTCACAATCCAGTAAAACAGCCATCTATAGTGAACAATGATCCAAATCGAAGAGGGAAAAAT
TCAGGACAGAGATTTGAGCGAAATTTTCAAGGTCCGTATGCTCACTCACCTCATCAGAATACTAATAAGTTTTCTCAAATATCAAGGCAAAGTTCTTCTCAACCTCCTTT
CAATGCCTTTACTCTACAACATGAATTGAACAAGGAAAGTCAGTGGTTCCCAGATTCTGGTGCGTCGAACCATGTTACGAATGATATTGGCAATTTAACAATTGGATTTG
AGTATCTTAGAGATAACAAAGTTTTGGTCGGTAATGGTGCAAGTTTGGATGTTCTTAATGTTGGATCTTCCTTTCTTAAATAG
Protein sequenceShow/hide protein sequence
MLDCKTTSEIKNLVDALAVAGRKISHVDHVLHILQGLGYEYDSVVSVITDKDTSPSLQKVYSLLLDQENRIEHHSTINPNGSLPSVNLSTHNPVKQPSIVNNDPNRRGKN
SGQRFERNFQGPYAHSPHQNTNKFSQISRQSSSQPPFNAFTLQHELNKESQWFPDSGASNHVTNDIGNLTIGFEYLRDNKVLVGNGASLDVLNVGSSFLK