; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022165 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022165
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00153902:358961..362179
RNA-Seq ExpressionSgr022165
SyntenySgr022165
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0015074 - DNA integration (biological process)
GO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8521602.1 hypothetical protein F0562_012275 [Nyssa sinensis]6.3e-1964.56Show/hide
Query:  DTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK-ENNTAVGENVQSNEAIGTIVRR
        D+ +KGWRCMD +TK VVVSRDVVFDEVSSHQ+DA+T +G  D SPFF +DASN+K  N T+ GE +Q +E IGT +RR
Subjt:  DTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK-ENNTAVGENVQSNEAIGTIVRR

KAA8540328.1 hypothetical protein F0562_024753 [Nyssa sinensis]3.0e-2971.13Show/hide
Query:  RTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK-ENNTAVGENVQSNEAIGTIVRRSS
        RTKLD RAR CIFVGYDTHKKGWRCMDL+TK VVVSRDVVFDEVSSHQ+DA+  +G  D SPFF +DAS++K  N T+ GE +Q +E IGT +RRSS
Subjt:  RTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK-ENNTAVGENVQSNEAIGTIVRRSS

KAA8541518.1 hypothetical protein F0562_022670 [Nyssa sinensis]2.2e-2467.42Show/hide
Query:  RTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK-ENNTAVGENVQSNEAI
        RTKLD RARRCIFVGYDTHKKGWRCMDL+TK VVVSRDVVFDE SSHQ+DA+  +G  D SPFF +DAS++   N T+  E +Q +E +
Subjt:  RTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK-ENNTAVGENVQSNEAI

KAA8549858.1 hypothetical protein F0562_001542 [Nyssa sinensis]7.6e-2564.89Show/hide
Query:  TKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDKENNT-AVGENVQSNEAIGTIVRR
        TK D RARRCIFVGY+THKKGWRCMD  TK V+VS DVVFD+VSS+Q++A+T +G+ D SPFFSNDAS++K +NT + GE +Q +E IGT ++R
Subjt:  TKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDKENNT-AVGENVQSNEAIGTIVRR

KAG6424918.1 hypothetical protein SASPL_115341 [Salvia splendens]1.7e-1955.67Show/hide
Query:  NKRTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDKENNTAVGENVQSNEAIGTIVRRS
        + R KLD +A+RCIF+GYDTH+KGWRCMD + K VVVSRDVVFDE+SS  + A+ K    D  P   +  SNDK + T  GENVQ +E I    RRS
Subjt:  NKRTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDKENNTAVGENVQSNEAIGTIVRRS

TrEMBL top hitse value%identityAlignment
A0A443N8T5 Integrase, catalytic core1.1e-1666.67Show/hide
Query:  RTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK
        RTKLD RARRCIFVGYD H+KGW+CMD +TK V VSRDVVFDEVSS Q+D  TK+G +D SPF    +  D+
Subjt:  RTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK

A0A5J4ZW51 CCHC-type domain-containing protein3.0e-1964.56Show/hide
Query:  DTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK-ENNTAVGENVQSNEAIGTIVRR
        D+ +KGWRCMD +TK VVVSRDVVFDEVSSHQ+DA+T +G  D SPFF +DASN+K  N T+ GE +Q +E IGT +RR
Subjt:  DTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK-ENNTAVGENVQSNEAIGTIVRR

A0A5J5BCB3 Uncharacterized protein1.5e-2971.13Show/hide
Query:  RTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK-ENNTAVGENVQSNEAIGTIVRRSS
        RTKLD RAR CIFVGYDTHKKGWRCMDL+TK VVVSRDVVFDEVSSHQ+DA+  +G  D SPFF +DAS++K  N T+ GE +Q +E IGT +RRSS
Subjt:  RTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK-ENNTAVGENVQSNEAIGTIVRRSS

A0A5J5BFR6 Uncharacterized protein1.1e-2467.42Show/hide
Query:  RTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK-ENNTAVGENVQSNEAI
        RTKLD RARRCIFVGYDTHKKGWRCMDL+TK VVVSRDVVFDE SSHQ+DA+  +G  D SPFF +DAS++   N T+  E +Q +E +
Subjt:  RTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDK-ENNTAVGENVQSNEAI

A0A5J5C3K7 Uncharacterized protein3.7e-2564.89Show/hide
Query:  TKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDKENNT-AVGENVQSNEAIGTIVRR
        TK D RARRCIFVGY+THKKGWRCMD  TK V+VS DVVFD+VSS+Q++A+T +G+ D SPFFSNDAS++K +NT + GE +Q +E IGT ++R
Subjt:  TKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDKENNT-AVGENVQSNEAIGTIVRR

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-0428.3Show/hide
Query:  KRTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEV---SSHQMDASTKKGIVDPSPFFSNDASNDKENNTAVGENVQSNEAIGTIVRRSS
        +RTKLD ++  CIF+GY   + G+R  D   K V+ SRDVVF E    ++  M    K GI+       + ++N     +   E  +  E  G ++ +  
Subjt:  KRTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEV---SSHQMDASTKKGIVDPSPFFSNDASNDKENNTAVGENVQSNEAIGTIVRRSS

Query:  NANSSL
          +  +
Subjt:  NANSSL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AATAAGCGGACTAAACTTGACATAAGGGCAAGACGTTGTATTTTTGTTGGTTATGATACTCACAAAAAAGGATGGAGATGTATGGATTTAGATACAAAGGCAGTTGTTGT
CTCTCGAGATGTGGTGTTTGACGAAGTTTCGTCACATCAAATGGATGCAAGTACAAAGAAAGGCATTGTTGATCCGTCACCTTTCTTTAGTAATGATGCATCGAATGATA
AGGAGAACAATACTGCTGTTGGAGAAAATGTTCAGTCAAATGAAGCTATAGGGACCATTGTTCGAAGGTCTTCAAATGCAAACTCTAGTTTATTTGTTAAAAGGACAGCT
TCGCTACAAGAAGAGATTGTCACTGGAGCGTCTTCATCGGCGTGTTGGCATGTGCAGATTGGATGGTGCGATGGCGTGGTTCTTCTATTTGGGAGTGGGTCTTCTGCTAC
ATCGCTGGTTCGAGTGTGGGCCTTCGCCAAATTGACTCATGGGGTGCCTACTGTGCCGCTTGGGGCTCTGATGACTTTTGGGTCGAGGTATGGTTTGCCAGCGTAG
mRNA sequenceShow/hide mRNA sequence
AATAAGCGGACTAAACTTGACATAAGGGCAAGACGTTGTATTTTTGTTGGTTATGATACTCACAAAAAAGGATGGAGATGTATGGATTTAGATACAAAGGCAGTTGTTGT
CTCTCGAGATGTGGTGTTTGACGAAGTTTCGTCACATCAAATGGATGCAAGTACAAAGAAAGGCATTGTTGATCCGTCACCTTTCTTTAGTAATGATGCATCGAATGATA
AGGAGAACAATACTGCTGTTGGAGAAAATGTTCAGTCAAATGAAGCTATAGGGACCATTGTTCGAAGGTCTTCAAATGCAAACTCTAGTTTATTTGTTAAAAGGACAGCT
TCGCTACAAGAAGAGATTGTCACTGGAGCGTCTTCATCGGCGTGTTGGCATGTGCAGATTGGATGGTGCGATGGCGTGGTTCTTCTATTTGGGAGTGGGTCTTCTGCTAC
ATCGCTGGTTCGAGTGTGGGCCTTCGCCAAATTGACTCATGGGGTGCCTACTGTGCCGCTTGGGGCTCTGATGACTTTTGGGTCGAGGTATGGTTTGCCAGCGTAG
Protein sequenceShow/hide protein sequence
NKRTKLDIRARRCIFVGYDTHKKGWRCMDLDTKAVVVSRDVVFDEVSSHQMDASTKKGIVDPSPFFSNDASNDKENNTAVGENVQSNEAIGTIVRRSSNANSSLFVKRTA
SLQEEIVTGASSSACWHVQIGWCDGVVLLFGSGSSATSLVRVWAFAKLTHGVPTVPLGALMTFGSRYGLPA