; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015374 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015374
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationtig00003469:836283..836726
RNA-Seq ExpressionSgr015374
SyntenySgr015374
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0068025.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]3.1e-4864.03Show/hide
Query:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH
        FF  KGVIHQ+SCV  P+QNSVVE+ HQHILN A +L FQS++P+ FWG+C++TAVYLI+RTPS++L  K PFQ L N +PDY+SL+VFGSLC+AS+L H
Subjt:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH

Query:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD
        + SKF  +A+ SVF+GYP  +KGY+L+DIE K+ FISRD
Subjt:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD

KYP64799.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]2.0e-4763.04Show/hide
Query:  FNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLHD
        F+ KG+IHQFSCV RP+QNSVVER H HILN+A +L+FQS +P+ FWGECV TAV+L+NRTPS +L  KSPF++LY+ +P+Y   RVFGSLC+ASTLL  
Subjt:  FNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLHD

Query:  RSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD
        R KF H+A+ +VF+GYP   KGY+L D+  K+ FISRD
Subjt:  RSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD

TYK16758.1 Copia protein [Cucumis melo var. makuwa]1.1e-4865.47Show/hide
Query:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH
        FF  KGVIHQ+SCV  P+QNSVVER HQHILN A +L FQS++P+ FWG+C+LTA+YLINRTPSK+L  KS FQ L N +PDY+SL+VFGSLC+AS+L +
Subjt:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH

Query:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD
        +RSKF  +A+ SVF+GYP  +K Y+L+DIE K+ FISRD
Subjt:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD

TYK18103.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]3.1e-4864.03Show/hide
Query:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH
        FF  KGVIHQ+SCV  P+QNSVVE+ HQHILN A +L FQS++P+ FWG+C++TAVYLI+RTPS++L  K PFQ L N +PDY+SL+VFGSLC+AS+L H
Subjt:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH

Query:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD
        + SKF  +A+ SVF+GYP  +KGY+L+DIE K+ FISRD
Subjt:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]6.0e-5271.94Show/hide
Query:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH
        FF+SKGV+HQFSCVG PEQNSVVER HQH+LNVA SL FQSR+P  FWGECVLTA YLINRTP+ VL   +P+  LY    DYSSL+VFG LCF ST   
Subjt:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH

Query:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD
        +RSKFH +ALTSVFVGYPP +KGY+L+DIE KRFF+SRD
Subjt:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD

TrEMBL top hitse value%identityAlignment
A0A151TCM1 Retrovirus-related Pol polyprotein from transposon TNT 1-949.7e-4863.04Show/hide
Query:  FNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLHD
        F+ KG+IHQFSCV RP+QNSVVER H HILN+A +L+FQS +P+ FWGECV TAV+L+NRTPS +L  KSPF++LY+ +P+Y   RVFGSLC+ASTLL  
Subjt:  FNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLHD

Query:  RSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD
        R KF H+A+ +VF+GYP   KGY+L D+  K+ FISRD
Subjt:  RSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD

A0A5A7VQN7 Cysteine-rich RLK (Receptor-like protein kinase) 81.5e-4864.03Show/hide
Query:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH
        FF  KGVIHQ+SCV  P+QNSVVE+ HQHILN A +L FQS++P+ FWG+C++TAVYLI+RTPS++L  K PFQ L N +PDY+SL+VFGSLC+AS+L H
Subjt:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH

Query:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD
        + SKF  +A+ SVF+GYP  +KGY+L+DIE K+ FISRD
Subjt:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD

A0A5D3CZP1 Copia protein5.1e-4965.47Show/hide
Query:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH
        FF  KGVIHQ+SCV  P+QNSVVER HQHILN A +L FQS++P+ FWG+C+LTA+YLINRTPSK+L  KS FQ L N +PDY+SL+VFGSLC+AS+L +
Subjt:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH

Query:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD
        +RSKF  +A+ SVF+GYP  +K Y+L+DIE K+ FISRD
Subjt:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD

A0A5D3D1N3 Cysteine-rich RLK (Receptor-like protein kinase) 81.5e-4864.03Show/hide
Query:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH
        FF  KGVIHQ+SCV  P+QNSVVE+ HQHILN A +L FQS++P+ FWG+C++TAVYLI+RTPS++L  K PFQ L N +PDY+SL+VFGSLC+AS+L H
Subjt:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH

Query:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD
        + SKF  +A+ SVF+GYP  +KGY+L+DIE K+ FISRD
Subjt:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD

A0A6J1DNP7 uncharacterized protein LOC1110220652.9e-5271.94Show/hide
Query:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH
        FF+SKGV+HQFSCVG PEQNSVVER HQH+LNVA SL FQSR+P  FWGECVLTA YLINRTP+ VL   +P+  LY    DYSSL+VFG LCF ST   
Subjt:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH

Query:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD
        +RSKFH +ALTSVFVGYPP +KGY+L+DIE KRFF+SRD
Subjt:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-1935.46Show/hide
Query:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVL--AGKSPFQLLYNALPDYSSLRVFGSLCFASTL
        F   KG+ +  +    P+ N V ER  + I   A +++  +++  +FWGE VLTA YLINR PS+ L  + K+P+++ +N  P    LRVFG+  +   +
Subjt:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVL--AGKSPFQLLYNALPDYSSLRVFGSLCFASTL

Query:  LHDRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD
         + + KF  ++  S+FVGY PN  G++L+D   ++F ++RD
Subjt:  LHDRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-2138.13Show/hide
Query:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH
        + +S G+ H+ +  G P+ N V ER ++ I+    S+L  +++P +FWGE V TA YLINR+PS  LA + P ++  N    YS L+VFG   FA     
Subjt:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH

Query:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD
         R+K   +++  +F+GY     GYRL+D  +K+   SRD
Subjt:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.3e-2236.88Show/hide
Query:  LWIFFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFAST
        LW +F+  G+ H  S    PE N + ER H+HI+    +LL  + IP T+W      AVYLINR P+ +L  +SPFQ L+   P+Y  LRVFG  C+   
Subjt:  LWIFFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFAST

Query:  LLHDRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISR
          +++ K   ++   VF+GY      Y    ++  R +ISR
Subjt:  LLHDRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.1e-1934.06Show/hide
Query:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH
        + +  G+ H  S    PE N + ER H+HI+ +  +LL  + +P T+W      AVYLINR P+ +L  +SPFQ L+   P+Y  L+VFG  C+     +
Subjt:  FFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLH

Query:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISR
        +R K   ++    F+GY      Y    I   R + SR
Subjt:  DRSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCCTGAGTTATCTTTTGTGGATTTTTTTTAATTCTAAGGGTGTTATACACCAATTCTCTTGTGTGGGAAGGCCTGAGCAGAACTCGGTAGTTGAGAGGAATCACCA
ACACATCTTGAATGTTGCACACTCTCTTTTATTTCAGTCCAGAATTCCTATTACATTTTGGGGAGAATGTGTCTTGACTGCGGTGTACCTGATTAATAGGACTCCTTCAA
AGGTTTTGGCTGGGAAGTCCCCTTTTCAGCTTCTTTATAATGCCTTGCCTGATTATAGTTCTTTAAGAGTTTTTGGATCACTCTGTTTTGCTTCTACATTATTGCACGAT
AGGTCCAAATTCCATCATCAAGCTCTTACTTCTGTTTTCGTGGGATATCCCCCTAACGTCAAAGGGTATCGTTTGTTTGATATAGAGCAGAAGCGGTTTTTCATCTCCAG
GGAC
mRNA sequenceShow/hide mRNA sequence
ATGTCCCTGAGTTATCTTTTGTGGATTTTTTTTAATTCTAAGGGTGTTATACACCAATTCTCTTGTGTGGGAAGGCCTGAGCAGAACTCGGTAGTTGAGAGGAATCACCA
ACACATCTTGAATGTTGCACACTCTCTTTTATTTCAGTCCAGAATTCCTATTACATTTTGGGGAGAATGTGTCTTGACTGCGGTGTACCTGATTAATAGGACTCCTTCAA
AGGTTTTGGCTGGGAAGTCCCCTTTTCAGCTTCTTTATAATGCCTTGCCTGATTATAGTTCTTTAAGAGTTTTTGGATCACTCTGTTTTGCTTCTACATTATTGCACGAT
AGGTCCAAATTCCATCATCAAGCTCTTACTTCTGTTTTCGTGGGATATCCCCCTAACGTCAAAGGGTATCGTTTGTTTGATATAGAGCAGAAGCGGTTTTTCATCTCCAG
GGAC
Protein sequenceShow/hide protein sequence
MSLSYLLWIFFNSKGVIHQFSCVGRPEQNSVVERNHQHILNVAHSLLFQSRIPITFWGECVLTAVYLINRTPSKVLAGKSPFQLLYNALPDYSSLRVFGSLCFASTLLHD
RSKFHHQALTSVFVGYPPNVKGYRLFDIEQKRFFISRD