; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr004467 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr004467
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationtig00002956:451345..452058
RNA-Seq ExpressionSgr004467
SyntenySgr004467
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN67403.1 hypothetical protein VITISV_025614 [Vitis vinifera]2.5e-1532.7Show/hide
Query:  VNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG------------------------
        V N+F+SYL++HGI  Q SC YTPEQNG     +  +++   L+L++ + LP KFW Y F T +FLINRLP+ +L                         
Subjt:  VNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG------------------------

Query:  ------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDS--LSQFTSGLLPSVS--PLA
                                      GY+C N   G+ Y+  H +F E +FPF S+   S S   +P   FLP  S  +S   S   PS S  PL 
Subjt:  ------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDS--LSQFTSGLLPSVS--PLA

Query:  NCDTAATISSP
        N   ++TIS P
Subjt:  NCDTAATISSP

CAN68489.1 hypothetical protein VITISV_037543 [Vitis vinifera]1.2e-1431.9Show/hide
Query:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG---------------------
        GEF   +F+SYL++HGI  Q SC YTPEQNG     +  +++   L+L++ + LP KFW Y F TA+FLINRLP+ +L                      
Subjt:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG---------------------

Query:  ---------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDSLSQFTSGLLPSVSPLAN
                                         GY+C N   G+ Y+  H +F E +FPF S+   S S   +P   FLP  S         P VS L +
Subjt:  ---------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDSLSQFTSGLLPSVSPLAN

Query:  CDTAATISSP
          T +T S P
Subjt:  CDTAATISSP

CAN79148.1 hypothetical protein VITISV_004343 [Vitis vinifera]1.0e-1331.25Show/hide
Query:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSIL----------------------
        GEF+  +F  YL +HGILHQ SC +TP+QNG     I  L++   L+LM++S+LP+K+W+Y F TA++LIN LP+ +L                      
Subjt:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSIL----------------------

Query:  --------------------------------GGYLCYNMSNGKFYIPHHAIFDENLFPFASSSSFSFYQLPFVLFLPNDSLSQFTSGLLPSVSPLANCD
                                         GYLC ++S  + YI  + IF E+ FPF SSS  S          P+  L   T  L+   SP  +  
Subjt:  --------------------------------GGYLCYNMSNGKFYIPHHAIFDENLFPFASSSSFSFYQLPFVLFLPNDSLSQFTSGLLPSVSPLANCD

Query:  TAATISSP
        ++  +SSP
Subjt:  TAATISSP

RVW64314.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.6e-1533.18Show/hide
Query:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG---------------------
        GEF   +F+SYL++HGI  Q SC YTPEQNG     +  +++   L+L++ + LP KFW Y F TA+FLINRLP+ +L                      
Subjt:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG---------------------

Query:  ---------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDS--LSQFTSGLLPSVS--
                                         GY+C N   G+ Y+  H +F E +FPF S+   S S   +P   FLP  S  +S   S   PS S  
Subjt:  ---------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDS--LSQFTSGLLPSVS--

Query:  PLANCDTAATISSP
        PL N   ++TIS P
Subjt:  PLANCDTAATISSP

RVX06084.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.6e-1432.71Show/hide
Query:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG---------------------
        GEF   +F+SYL++HGI  Q SC YTPEQNG     +  +++   L+L++ + LP KFW Y F T +FLINRLP+ +L                      
Subjt:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG---------------------

Query:  ---------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDS--LSQFTSGLLPSVS--
                                         GY+C N   G+ Y+  H +F E +FPF S+   S S   +P   FLP  S  +S   S   PS S  
Subjt:  ---------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDS--LSQFTSGLLPSVS--

Query:  PLANCDTAATISSP
        PL N   ++TIS P
Subjt:  PLANCDTAATISSP

TrEMBL top hitse value%identityAlignment
A0A438FWJ3 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-1533.18Show/hide
Query:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG---------------------
        GEF   +F+SYL++HGI  Q SC YTPEQNG     +  +++   L+L++ + LP KFW Y F TA+FLINRLP+ +L                      
Subjt:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG---------------------

Query:  ---------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDS--LSQFTSGLLPSVS--
                                         GY+C N   G+ Y+  H +F E +FPF S+   S S   +P   FLP  S  +S   S   PS S  
Subjt:  ---------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDS--LSQFTSGLLPSVS--

Query:  PLANCDTAATISSP
        PL N   ++TIS P
Subjt:  PLANCDTAATISSP

A0A438JAU4 Retrovirus-related Pol polyprotein from transposon TNT 1-947.8e-1532.71Show/hide
Query:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG---------------------
        GEF   +F+SYL++HGI  Q SC YTPEQNG     +  +++   L+L++ + LP KFW Y F T +FLINRLP+ +L                      
Subjt:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG---------------------

Query:  ---------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDS--LSQFTSGLLPSVS--
                                         GY+C N   G+ Y+  H +F E +FPF S+   S S   +P   FLP  S  +S   S   PS S  
Subjt:  ---------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDS--LSQFTSGLLPSVS--

Query:  PLANCDTAATISSP
        PL N   ++TIS P
Subjt:  PLANCDTAATISSP

A5AYB0 Integrase catalytic domain-containing protein5.1e-1431.25Show/hide
Query:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSIL----------------------
        GEF+  +F  YL +HGILHQ SC +TP+QNG     I  L++   L+LM++S+LP+K+W+Y F TA++LIN LP+ +L                      
Subjt:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSIL----------------------

Query:  --------------------------------GGYLCYNMSNGKFYIPHHAIFDENLFPFASSSSFSFYQLPFVLFLPNDSLSQFTSGLLPSVSPLANCD
                                         GYLC ++S  + YI  + IF E+ FPF SSS  S          P+  L   T  L+   SP  +  
Subjt:  --------------------------------GGYLCYNMSNGKFYIPHHAIFDENLFPFASSSSFSFYQLPFVLFLPNDSLSQFTSGLLPSVSPLANCD

Query:  TAATISSP
        ++  +SSP
Subjt:  TAATISSP

A5B1N8 Integrase catalytic domain-containing protein1.2e-1532.7Show/hide
Query:  VNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG------------------------
        V N+F+SYL++HGI  Q SC YTPEQNG     +  +++   L+L++ + LP KFW Y F T +FLINRLP+ +L                         
Subjt:  VNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG------------------------

Query:  ------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDS--LSQFTSGLLPSVS--PLA
                                      GY+C N   G+ Y+  H +F E +FPF S+   S S   +P   FLP  S  +S   S   PS S  PL 
Subjt:  ------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDS--LSQFTSGLLPSVS--PLA

Query:  NCDTAATISSP
        N   ++TIS P
Subjt:  NCDTAATISSP

A5C001 Integrase catalytic domain-containing protein6.0e-1531.9Show/hide
Query:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG---------------------
        GEF   +F+SYL++HGI  Q SC YTPEQNG     +  +++   L+L++ + LP KFW Y F TA+FLINRLP+ +L                      
Subjt:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILG---------------------

Query:  ---------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDSLSQFTSGLLPSVSPLAN
                                         GY+C N   G+ Y+  H +F E +FPF S+   S S   +P   FLP  S         P VS L +
Subjt:  ---------------------------------GYLCYNMSNGKFYIPHHAIFDENLFPFASS--SSFSFYQLPFVLFLPNDSLSQFTSGLLPSVSPLAN

Query:  CDTAATISSP
          T +T S P
Subjt:  CDTAATISSP

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-0427.48Show/hide
Query:  EFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSIL--GGYLCYNM-SNGKFYIPHHAI
        E+++N    +    GI +  +  +TP+ NG     I T+ ++ A +++S + L   FW     TA +LINR+PS  L       Y M  N K Y+ H  +
Subjt:  EFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSIL--GGYLCYNM-SNGKFYIPHHAI

Query:  FDENLFPFASSSSFSF----YQLPFVLFLPN
        F   ++    +    F    ++  FV + PN
Subjt:  FDENLFPFASSSSFSF----YQLPFVLFLPN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.8e-0838.67Show/hide
Query:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPS
        GE+ +  F  Y SSHGI H+K+   TP+ NG       T++++   S++  + LP  FW     TA +LINR PS
Subjt:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.5e-1027.16Show/hide
Query:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSIL----------------------
        GEFV  +   Y S HGI H  S  +TPE NG        +++   L+L+S + +P  +W Y F  A++LINRLP+ +L                      
Subjt:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSIL----------------------

Query:  --------------------------------GGYLCYNMSNGKFYIPHHAIFDENLFPFAS
                                          YLC ++   + YI  H  FDEN FPF++
Subjt:  --------------------------------GGYLCYNMSNGKFYIPHHAIFDENLFPFAS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.8e-0926.83Show/hide
Query:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSIL----------------------
        GEFV      YLS HGI H  S  +TPE NG        +++   L+L+S + +P  +W Y F  A++LINRLP+ +L                      
Subjt:  GEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSIL----------------------

Query:  --------------------------------GGYLCYNMSNGKFYIPHHAIFDENLFPFASSS
                                          YLC ++  G+ Y   H  FDE  FPF++++
Subjt:  --------------------------------GGYLCYNMSNGKFYIPHHAIFDENLFPFASSS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGGTGAATTTGTGAATAATAGTTTTGCTTCTTATCTTAGCTCTCATGGCATCCTTCACCAAAAGTCTTGTGCCTATACTCCAGAGCAAAATGGGTGGCCTAACGC
AAGCATCGCCACATTGTTGAAGAGAGAGGCTCTTTCTCTCATGTCCAAGTCATATCTTCCTACTAAATTTTGGTCCTATACCTTTGGTACTGCCATGTTTCTTATTAATC
GGCTTCCCTCGTCTATTCTTGGTGGTTACCTCTGTTACAACATGAGCAATGGTAAATTTTATATTCCTCACCATGCTATCTTTGATGAGAATTTGTTCCCTTTTGCTTCT
TCTTCTTCTTTTTCTTTCTATCAATTGCCCTTTGTTCTTTTTCTTCCAAATGACTCTTTATCACAATTTACCAGTGGGTTGTTGCCTTCTGTTTCTCCACTTGCGAATTG
TGATACTGCTGCTACTATTTCTTCTCCTGCTTCTTGTGATATTGGACTGCATGCTATTCCTGCTAATGATTCTTCTGGTATTAATGCTAGTGAAATTAAGGGTGCTTTTT
AA
mRNA sequenceShow/hide mRNA sequence
ATGTGGGGTGAATTTGTGAATAATAGTTTTGCTTCTTATCTTAGCTCTCATGGCATCCTTCACCAAAAGTCTTGTGCCTATACTCCAGAGCAAAATGGGTGGCCTAACGC
AAGCATCGCCACATTGTTGAAGAGAGAGGCTCTTTCTCTCATGTCCAAGTCATATCTTCCTACTAAATTTTGGTCCTATACCTTTGGTACTGCCATGTTTCTTATTAATC
GGCTTCCCTCGTCTATTCTTGGTGGTTACCTCTGTTACAACATGAGCAATGGTAAATTTTATATTCCTCACCATGCTATCTTTGATGAGAATTTGTTCCCTTTTGCTTCT
TCTTCTTCTTTTTCTTTCTATCAATTGCCCTTTGTTCTTTTTCTTCCAAATGACTCTTTATCACAATTTACCAGTGGGTTGTTGCCTTCTGTTTCTCCACTTGCGAATTG
TGATACTGCTGCTACTATTTCTTCTCCTGCTTCTTGTGATATTGGACTGCATGCTATTCCTGCTAATGATTCTTCTGGTATTAATGCTAGTGAAATTAAGGGTGCTTTTT
AA
Protein sequenceShow/hide protein sequence
MWGEFVNNSFASYLSSHGILHQKSCAYTPEQNGWPNASIATLLKREALSLMSKSYLPTKFWSYTFGTAMFLINRLPSSILGGYLCYNMSNGKFYIPHHAIFDENLFPFAS
SSSFSFYQLPFVLFLPNDSLSQFTSGLLPSVSPLANCDTAATISSPASCDIGLHAIPANDSSGINASEIKGAF