; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018121 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018121
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00153107:369846..370538
RNA-Seq ExpressionSgr018121
SyntenySgr018121
Gene Ontology termsGO:0043167 - ion binding (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PNX92571.1 histone deacetylase [Trifolium pratense]2.8e-2438.46Show/hide
Query:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSSEGSS
        +PS+ IN Q P+ +L  + PDYH L+ FG  C+P LRPY  HK +F +++C+F+ Y+  HKGYRCLSP+ R+Y S+ V  NE++FPY++LF   S    S
Subjt:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSSEGSS

Query:  IGNTILSWLPVAAPSSIPQSDLMA-----VPHTSQPLVPISSENSPSYTVQNHSGD
          +      P+    SI  +D+ +      PH S P  PI+  + P+  +   + D
Subjt:  IGNTILSWLPVAAPSSIPQSDLMA-----VPHTSQPLVPISSENSPSYTVQNHSGD

QHO25178.1 Copia-like retrotransposon Hopscotch polyprotein [Arachis hypogaea]1.3e-2441.67Show/hide
Query:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSS--EG
        +PS      +P+ +LN + PDY  L+ FG +C+P LRPYQ HKFDF T KC+F+ Y+ HHKGY+CL P+ ++Y +RHV   E+KFPYQ LF    S  + 
Subjt:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSS--EG

Query:  SSIGNTILSWLPVAAPSSIPQSDLMAVPHTSQPLVP-----ISSENSPSYTVQNHS
        S    T L  +P+          L+ +PH S P  P     I S +SP+  V + S
Subjt:  SSIGNTILSWLPVAAPSSIPQSDLMAVPHTSQPLVP-----ISSENSPSYTVQNHS

TXG56026.1 hypothetical protein EZV62_017339 [Acer yangbiense]2.8e-2444.97Show/hide
Query:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGS
        +PS+V+N  +PF  L  ++P+Y  L+TFG +C+P LR Y +HKFDFH+ KC+F+ Y+  HKGY+CL P+ +IY SRHV  NET+FPY  LFS   SS+ S
Subjt:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGS

Query:  SIG-------NTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPISSENSP
        S G       N   S  P   P   P   L   PH S  +   S  +SP
Subjt:  SIG-------NTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPISSENSP

TXG57080.1 hypothetical protein EZV62_018393 [Acer yangbiense]2.8e-2444.97Show/hide
Query:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGS
        +PS+V+N  +PF  L  ++P+Y  L+TFG +C+P LR Y +HKFDFH+ KC+F+ Y+  HKGY+CL P+ +IY SRHV  NET+FPY  LFS   SS+ S
Subjt:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGS

Query:  SIG-------NTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPISSENSP
        S G       N   S  P   P   P   L   PH S  +   S  +SP
Subjt:  SIG-------NTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPISSENSP

TXG58227.1 hypothetical protein EZV62_016056 [Acer yangbiense]2.8e-2444.97Show/hide
Query:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGS
        +PS+V+N  +PF  L  ++P+Y  L+TFG +C+P LR Y +HKFDFH+ KC+F+ Y+  HKGY+CL P+ +IY SRHV  NET+FPY  LFS   SS+ S
Subjt:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGS

Query:  SIG-------NTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPISSENSP
        S G       N   S  P   P   P   L   PH S  +   S  +SP
Subjt:  SIG-------NTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPISSENSP

TrEMBL top hitse value%identityAlignment
A0A2K3MP35 Histone deacetylase1.4e-2438.46Show/hide
Query:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSSEGSS
        +PS+ IN Q P+ +L  + PDYH L+ FG  C+P LRPY  HK +F +++C+F+ Y+  HKGYRCLSP+ R+Y S+ V  NE++FPY++LF   S    S
Subjt:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSSEGSS

Query:  IGNTILSWLPVAAPSSIPQSDLMA-----VPHTSQPLVPISSENSPSYTVQNHSGD
          +      P+    SI  +D+ +      PH S P  PI+  + P+  +   + D
Subjt:  IGNTILSWLPVAAPSSIPQSDLMA-----VPHTSQPLVPISSENSPSYTVQNHSGD

A0A5C7HIM7 Integrase catalytic domain-containing protein1.4e-2444.97Show/hide
Query:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGS
        +PS+V+N  +PF  L  ++P+Y  L+TFG +C+P LR Y +HKFDFH+ KC+F+ Y+  HKGY+CL P+ +IY SRHV  NET+FPY  LFS   SS+ S
Subjt:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGS

Query:  SIG-------NTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPISSENSP
        S G       N   S  P   P   P   L   PH S  +   S  +SP
Subjt:  SIG-------NTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPISSENSP

A0A5C7HJ99 Integrase catalytic domain-containing protein1.4e-2444.97Show/hide
Query:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGS
        +PS+V+N  +PF  L  ++P+Y  L+TFG +C+P LR Y +HKFDFH+ KC+F+ Y+  HKGY+CL P+ +IY SRHV  NET+FPY  LFS   SS+ S
Subjt:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGS

Query:  SIG-------NTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPISSENSP
        S G       N   S  P   P   P   L   PH S  +   S  +SP
Subjt:  SIG-------NTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPISSENSP

A0A5C7HMG8 Integrase catalytic domain-containing protein1.4e-2444.97Show/hide
Query:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGS
        +PS+V+N  +PF  L  ++P+Y  L+TFG +C+P LR Y +HKFDFH+ KC+F+ Y+  HKGY+CL P+ +IY SRHV  NET+FPY  LFS   SS+ S
Subjt:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFS-GFSSEGS

Query:  SIG-------NTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPISSENSP
        S G       N   S  P   P   P   L   PH S  +   S  +SP
Subjt:  SIG-------NTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPISSENSP

A0A803Q615 Uncharacterized protein4.7e-2540.52Show/hide
Query:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSSEGSS
        +P+ V+ G++P  +L  K+PDY  L+TFG TCYPCLRPYQ HKF +H+ KCV + Y+D HKGY+CLS   R+Y SR+V  NE +FP+   F        +
Subjt:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSSEGSS

Query:  IGNTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPI----SSENSPSYTVQNHS
        +  ++ SW   +   +IP S        S+P  P      SE  PS     H+
Subjt:  IGNTILSWLPVAAPSSIPQSDLMAVPHTSQPLVPI----SSENSPSYTVQNHS

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.6e-1633.56Show/hide
Query:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLS-PASRIYNSRHVCINETKFPYQQLFSGFSSEGS
        +P+ ++  ++PF  L    P+Y  LR FG  CYP LRPY QHK D  + +CVF+ Y+     Y CL    SR+Y SRHV  +E  FP+    +  S    
Subjt:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLS-PASRIYNSRHVCINETKFPYQQLFSGFSSEGS

Query:  SIGNTILSWLP-VAAPSSIPQSDLMAVPHTSQPLVPISSENSPSYTVQN
            +   W P    P+  P   ++  P  S P    +  +SPS   +N
Subjt:  SIGNTILSWLP-VAAPSSIPQSDLMAVPHTSQPLVPISSENSPSYTVQN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-1632.74Show/hide
Query:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLS-PASRIYNSRHVCINETKFPYQQLFSGFSSEGS
        +P+ ++  Q+PF  L  + P+Y  L+ FG  CYP LRPY +HK +  +++C F+ Y+     Y CL  P  R+Y SRHV  +E  FP+     G S+   
Subjt:  MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLS-PASRIYNSRHVCINETKFPYQQLFSGFSSEGS

Query:  SIGNTILSW-----LPV------AAPSSIPQSDLMAVPHTS-QPL--VPISSENSPSYTVQNHSGDAP
           ++  +W     LP       A P   P  D    P +S  PL    +SS N PS ++ + S   P
Subjt:  SIGNTILSW-----LPV------AAPSSIPQSDLMAVPHTS-QPL--VPISSENSPSYTVQNHSGDAP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCAACTGTCATAAATGGGCAGAATCCTTTCAGCATCCTAAATCTCAAGCAACCTGATTATCACAGTTTAAGAACCTTTGGATTAACTTGTTATCCTTGCTTAAG
ACCATATCAGCAGCATAAATTTGATTTTCACACTGAGAAATGTGTCTTCATAAGCTACAATGATCACCATAAAGGCTACCGGTGTCTTAGTCCGGCTAGCAGAATTTATA
ATTCTCGTCATGTGTGCATTAATGAAACTAAATTTCCATATCAACAACTGTTTTCAGGCTTTAGCAGTGAAGGCTCATCCATTGGCAACACAATTCTCTCTTGGCTGCCA
GTTGCTGCTCCGTCTTCCATCCCTCAATCAGATCTTATGGCAGTACCACACACCAGCCAGCCCTTGGTTCCTATATCTTCAGAGAATTCCCCTTCATATACTGTACAAAA
TCACAGTGGAGATGCCCCGCTGGTTCTCCTCAAAATGTTTCTCACTATCAAGCTGCACCTTTACACTCTCCTACCTCTAACTCTGCAATATCTCACCATGGTAGGACAAC
AATTACATCAGCAACATGAGAATATCCTCCTCCTACTACAGCTAGCAATGCTCATCCCATGGTTACACGTGCTAAGGGCTGGAATTTCCAAACCTAAACAGTTCTTTGGT
GGCTTTGCTCAAATCTCTTCTGCTATAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTCAACTGTCATAAATGGGCAGAATCCTTTCAGCATCCTAAATCTCAAGCAACCTGATTATCACAGTTTAAGAACCTTTGGATTAACTTGTTATCCTTGCTTAAG
ACCATATCAGCAGCATAAATTTGATTTTCACACTGAGAAATGTGTCTTCATAAGCTACAATGATCACCATAAAGGCTACCGGTGTCTTAGTCCGGCTAGCAGAATTTATA
ATTCTCGTCATGTGTGCATTAATGAAACTAAATTTCCATATCAACAACTGTTTTCAGGCTTTAGCAGTGAAGGCTCATCCATTGGCAACACAATTCTCTCTTGGCTGCCA
GTTGCTGCTCCGTCTTCCATCCCTCAATCAGATCTTATGGCAGTACCACACACCAGCCAGCCCTTGGTTCCTATATCTTCAGAGAATTCCCCTTCATATACTGTACAAAA
TCACAGTGGAGATGCCCCGCTGGTTCTCCTCAAAATGTTTCTCACTATCAAGCTGCACCTTTACACTCTCCTACCTCTAACTCTGCAATATCTCACCATGGTAGGACAAC
AATTACATCAGCAACATGAGAATATCCTCCTCCTACTACAGCTAGCAATGCTCATCCCATGGTTACACGTGCTAAGGGCTGGAATTTCCAAACCTAAACAGTTCTTTGGT
GGCTTTGCTCAAATCTCTTCTGCTATAGATTGA
Protein sequenceShow/hide protein sequence
MPSTVINGQNPFSILNLKQPDYHSLRTFGLTCYPCLRPYQQHKFDFHTEKCVFISYNDHHKGYRCLSPASRIYNSRHVCINETKFPYQQLFSGFSSEGSSIGNTILSWLP
VAAPSSIPQSDLMAVPHTSQPLVPISSENSPSYTVQNHSGDAPLVLLKMFLTIKLHLYTLLPLTLQYLTMVGQQLHQQHENILLLLQLAMLIPWLHVLRAGISKPKQFFG
GFAQISSAID