; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g20880 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g20880
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr3:14268979..14270166
RNA-Seq ExpressionMoc03g20880
SyntenyMoc03g20880
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7548157.1 Integrase catalytic core [Arabidopsis suecica]4.0e-3133.1Show/hide
Query:  VLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPS-------SYGNNGY-----KRKHQEVFKGKRDHRI
        +++DEQ V+  +RSL    +HMK+  T ++ +K   D++ +L  +++R+EA R  N   Y +  S       ++ N        K++ +   +GK+  R 
Subjt:  VLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPS-------SYGNNGY-----KRKHQEVFKGKRDHRI

Query:  KKRKQ------------------------------------------------DSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLN
         K K                                                 DSGATDHVA+ R  + ++RRIP   RWLYVG N +V V  IG+C+L+
Subjt:  KKRKQ------------------------------------------------DSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLN

Query:  MRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHYGYGFIFNGFIVLDV-VETISYNYASFSLIT
        MR   TL+LHDVLYAP+I R+L+SV  L +LG++    +  L +SL+ + +GYG+ ++GFIVLD    T S N   FS +T
Subjt:  MRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHYGYGFIFNGFIVLDV-VETISYNYASFSLIT

KAG7595359.1 Reverse transcriptase RNA-dependent DNA polymerase [Arabidopsis thaliana x Arabidopsis arenosa]4.4e-3034.14Show/hide
Query:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMR---LDNVETYVAG-PSSYGNNGYKRKHQEVFK-----G
        I ELK    +++DEQ ++  +RSL    +HM+V  T +  +K   D++ HL  +++R+EA +     NV + ++G     GN G K  +Q   K      
Subjt:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMR---LDNVETYVAG-PSSYGNNGYKRKHQEVFK-----G

Query:  KRDHRIKKRKQ--------------------------------------------------DSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVK
        KR  R  K+ +                                                  DSGATDHVA+DR  +V++RRIP  +RWLYVG N++V V 
Subjt:  KRDHRIKKRKQ--------------------------------------------------DSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVK

Query:  DIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHYGYGFIFNGFIVLDVVETI--SYNYASFSLIT
         IG+C+L+MR   TL+LHDVL AP+I R+L+SV  L KLG+     +  L ++L ++ +G G+  +GFIVLD    I  S +   FS IT
Subjt:  DIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHYGYGFIFNGFIVLDVVETI--SYNYASFSLIT

TXG60589.1 hypothetical protein EZV62_015162 [Acer yangbiense]1.3e-2940.67Show/hide
Query:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSYGNNGYKRKHQE-------VFKGKR
        I ELK   H L DEQ ++  IRSL    E+MK++ T +  IK  +D++ H+  +++R+EA +    + Y+A  S     G+K   ++         KGK 
Subjt:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSYGNNGYKRKHQE-------VFKGKR

Query:  DHRIKKRKQDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTL
           I+  K++SGATDHVA+DR  +V++ RI    RW+YV  N+RV VK IG+CKL M +   L LHDVLYAP I RNL+SV VL  LGY        + +
Subjt:  DHRIKKRKQDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTL

Query:  SLNNIHYGY
         LN++  G+
Subjt:  SLNNIHYGY

XP_020080565.1 uncharacterized protein LOC109704209 [Ananas comosus]1.7e-3442.34Show/hide
Query:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSYGNNGYKRKHQEVFKGKRDHRIKKR
        I ELK   H LTD Q  +  IRSL    EHMKV+ T ++ IK   DV  ++  +++R+ A R D V  Y+   SS+  +G       +        I   
Subjt:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSYGNNGYKRKHQEVFKGKRDHRIKKR

Query:  KQDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHY
          DSGAT H+A DRG +V++RR+ +  +W+YVG NA+V VK IG+CK+ +R   TLL HDVLYAP+I RNL+SV+VL  LGY      N + +   +++Y
Subjt:  KQDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHY

Query:  GYGFIFNGFIVLDVVETISYNY
        G G++ NGF VLD    + YNY
Subjt:  GYGFIFNGFIVLDVVETISYNY

XP_022889045.1 uncharacterized protein LOC111404476 [Olea europaea var. sylvestris]2.5e-3336.94Show/hide
Query:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSYGNNGYKRKHQEVFKG---------
        I ELK   H+L+DEQ V+  IRSL Y+ EH+KV+ T ++ IK   D  CH+  +++R+ A +   V+ +VA  SS   +G+KRK +  FKG         
Subjt:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSYGNNGYKRKHQEVFKG---------

Query:  --------KRDHRIK-----------------------------KRKQDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLNMRNEC
                KR+++ K                             K+  DSGA DHV +DR V+V+F R P+  +W+YVG + RV VK IG+CKL++    
Subjt:  --------KRDHRIK-----------------------------KRKQDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLNMRNEC

Query:  TLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHYGYGFIFNGFIVLDVVETISYNY
        T+LLHDVL+A  I RNL+ V VL +L +      NS+ ++  ++ YG G + + FIVLD  ++ +YNY
Subjt:  TLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHYGYGFIFNGFIVLDVVETISYNY

TrEMBL top hitse value%identityAlignment
A0A151QKT8 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-2932.75Show/hide
Query:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMR-------LDNVETYVAGPSSYGNNGYKRKHQ----EVF
        I ELKE  H L+DEQ V+  IRSL    EHMKV+ T +E I   +DV  HL  +++R+EA +        +   T  A P+S     ++RK +    +  
Subjt:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMR-------LDNVETYVAGPSSYGNNGYKRKHQ----EVF

Query:  KGKRDHRIKKRKQ--------------------------------------------------------------DSGATDHVAKDRGVYVKFRRIPSRA
        KGK+  + K  K                                                               DSG+TDHV +DRG ++++RR+P+ +
Subjt:  KGKRDHRIKKRKQ--------------------------------------------------------------DSGATDHVAKDRGVYVKFRRIPSRA

Query:  RWLYVGTNARVPVKDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHYGYGFIFNGFIV
        RWLYVG NAR+ VK +G+C+  +R   TL LHDVLY PDI RNL+SV  L K G+     +  +    N+++YG  F+   F V
Subjt:  RWLYVGTNARVPVKDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHYGYGFIFNGFIV

A0A2U1LF40 Uncharacterized protein7.6e-2836.36Show/hide
Query:  IRDRLIFKLDTIAISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSYGNNGYKRKHQE
        +R  LI   + IA  EL      LTDE+ V+  I+SL+     MK++ T +  IK  +DV+ HL  +EDR+ + + D +E YVA  S      +KR    
Subjt:  IRDRLIFKLDTIAISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSYGNNGYKRKHQE

Query:  VFKGKRDHRIKKRKQDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLT
          KGK            G T H+ +DR  +V +RRIP  ++++Y+  + +     I +CKL MR   TL LHDVLYAPD+ R+L+SV VL  LGY     
Subjt:  VFKGKRDHRIKKRKQDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLT

Query:  ENSLTLSLNNIHYGYGFIFNGFIVLDVV-ETISYNYASFSLITEGIGSRSRSA
        +  + L L   +YG G I +GF+VLD V      N   FS +     + S  A
Subjt:  ENSLTLSLNNIHYGYGFIFNGFIVLDVV-ETISYNYASFSLITEGIGSRSRSA

A0A5C7HUX5 Uncharacterized protein6.2e-3040.67Show/hide
Query:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSYGNNGYKRKHQE-------VFKGKR
        I ELK   H L DEQ ++  IRSL    E+MK++ T +  IK  +D++ H+  +++R+EA +    + Y+A  S     G+K   ++         KGK 
Subjt:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSYGNNGYKRKHQE-------VFKGKR

Query:  DHRIKKRKQDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTL
           I+  K++SGATDHVA+DR  +V++ RI    RW+YV  N+RV VK IG+CKL M +   L LHDVLYAP I RNL+SV VL  LGY        + +
Subjt:  DHRIKKRKQDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTL

Query:  SLNNIHYGY
         LN++  G+
Subjt:  SLNNIHYGY

A0A6D2J7K8 Uncharacterized protein1.3e-2732.83Show/hide
Query:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSY----GNNGY--------KRKHQEV
        I ELK    +++DEQ V+  +RSL    +HM++  T +  +K   D++ HL  +++R+EA +  N   Y    S      GNNG         KR+ + V
Subjt:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSY----GNNGY--------KRKHQEV

Query:  FKGKRDHRIKKRK------------------------------------------------QDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPV
         +GKR  +  K K                                                 DSGATDHVA+ R  +V++RR+P   RWLYVG N++V V
Subjt:  FKGKRDHRIKKRK------------------------------------------------QDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPV

Query:  KDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHYGYGFI
        + IG+C+L+M    TL+LHDVLYAP+I  +L+SV  L KLG+        L +SL + ++G G++
Subjt:  KDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHYGYGFI

A0A6P5EGH5 uncharacterized protein LOC1097042098.3e-3542.34Show/hide
Query:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSYGNNGYKRKHQEVFKGKRDHRIKKR
        I ELK   H LTD Q  +  IRSL    EHMKV+ T ++ IK   DV  ++  +++R+ A R D V  Y+   SS+  +G       +        I   
Subjt:  ISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSYGNNGYKRKHQEVFKGKRDHRIKKR

Query:  KQDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHY
          DSGAT H+A DRG +V++RR+ +  +W+YVG NA+V VK IG+CK+ +R   TLL HDVLYAP+I RNL+SV+VL  LGY      N + +   +++Y
Subjt:  KQDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHY

Query:  GYGFIFNGFIVLDVVETISYNY
        G G++ NGF VLD    + YNY
Subjt:  GYGFIFNGFIVLDVVETISYNY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGATGTCATCCGTGATCGACTTATCTTCAAGCTTGACACTATTGCGATTAGTGAGCTGAAAGAACCTGATCATGTCTTGACTGATGAACAACATGTTAAACC
TGATATTCGATCTCTTTCCTATGACGAAGAGCACATGAAGGTGCATTTTACTCGTGATGAGGTAATTAAACCTCTTAATGATGTTACGTGTCATCTTGGATGGAAAGAAG
ACCGTATGGAGGCAATGCGGCTTGACAATGTTGAAACCTATGTGGCTGGCCCAAGTTCTTATGGGAACAATGGGTACAAGCGTAAACATCAAGAAGTTTTTAAGGGGAAA
AGAGATCACAGGATTAAGAAACGCAAGCAAGACTCAGGAGCAACCGACCACGTAGCTAAGGATCGTGGGGTGTATGTGAAATTTCGTCGAATTCCATCAAGAGCTAGGTG
GCTGTATGTAGGAACCAATGCTCGAGTGCCTGTAAAGGACATTGGTTCCTGCAAGTTGAACATGCGGAATGAATGCACTCTACTTCTGCATGATGTGTTGTATGCACCTG
ATATTTGGCGAAATTTGCTTTCTGTTACTGTCCTTTTCAAACTTGGTTACACTTTTACTTTGACTGAAAACTCGTTAACTCTATCTTTGAACAATATACATTATGGTTAT
GGTTTTATTTTTAATGGTTTTATTGTATTGGATGTTGTTGAGACGATATCTTACAATTATGCAAGTTTTTCCCTTATAACTGAAGGTATCGGATCCCGCAGCAGAAGCGC
GCGAATGCGTGTGCTTCGATCTTGTCTTAGAATTACAACAAAGAAAAGCCACATAAACATGAATAACAAGAAGTTAGTGATACATACCTTTGAAGACACCTCTTCAATGT
AG
mRNA sequenceShow/hide mRNA sequence
ATGAACAATGATGTCATCCGTGATCGACTTATCTTCAAGCTTGACACTATTGCGATTAGTGAGCTGAAAGAACCTGATCATGTCTTGACTGATGAACAACATGTTAAACC
TGATATTCGATCTCTTTCCTATGACGAAGAGCACATGAAGGTGCATTTTACTCGTGATGAGGTAATTAAACCTCTTAATGATGTTACGTGTCATCTTGGATGGAAAGAAG
ACCGTATGGAGGCAATGCGGCTTGACAATGTTGAAACCTATGTGGCTGGCCCAAGTTCTTATGGGAACAATGGGTACAAGCGTAAACATCAAGAAGTTTTTAAGGGGAAA
AGAGATCACAGGATTAAGAAACGCAAGCAAGACTCAGGAGCAACCGACCACGTAGCTAAGGATCGTGGGGTGTATGTGAAATTTCGTCGAATTCCATCAAGAGCTAGGTG
GCTGTATGTAGGAACCAATGCTCGAGTGCCTGTAAAGGACATTGGTTCCTGCAAGTTGAACATGCGGAATGAATGCACTCTACTTCTGCATGATGTGTTGTATGCACCTG
ATATTTGGCGAAATTTGCTTTCTGTTACTGTCCTTTTCAAACTTGGTTACACTTTTACTTTGACTGAAAACTCGTTAACTCTATCTTTGAACAATATACATTATGGTTAT
GGTTTTATTTTTAATGGTTTTATTGTATTGGATGTTGTTGAGACGATATCTTACAATTATGCAAGTTTTTCCCTTATAACTGAAGGTATCGGATCCCGCAGCAGAAGCGC
GCGAATGCGTGTGCTTCGATCTTGTCTTAGAATTACAACAAAGAAAAGCCACATAAACATGAATAACAAGAAGTTAGTGATACATACCTTTGAAGACACCTCTTCAATGT
AG
Protein sequenceShow/hide protein sequence
MNNDVIRDRLIFKLDTIAISELKEPDHVLTDEQHVKPDIRSLSYDEEHMKVHFTRDEVIKPLNDVTCHLGWKEDRMEAMRLDNVETYVAGPSSYGNNGYKRKHQEVFKGK
RDHRIKKRKQDSGATDHVAKDRGVYVKFRRIPSRARWLYVGTNARVPVKDIGSCKLNMRNECTLLLHDVLYAPDIWRNLLSVTVLFKLGYTFTLTENSLTLSLNNIHYGY
GFIFNGFIVLDVVETISYNYASFSLITEGIGSRSRSARMRVLRSCLRITTKKSHINMNNKKLVIHTFEDTSSM