; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016711 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016711
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationtig00152985:1419606..1422958
RNA-Seq ExpressionSgr016711
SyntenySgr016711
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ABG37663.1 CCHC-type integrase [Populus trichocarpa]3.0e-2136.89Show/hide
Query:  KATEDQGKRRSIKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIETN-LLFMASH-IDDNKSTSWLIDSGCTTHMAKDINIFSQIDKSR
        K T    K    K+K  +   +CNN GH +++C AK+ Q+ Q+  Q AN ++KE E +  LFMAS  I  ++   WLIDSGCT++M K + IFS IDKS 
Subjt:  KATEDQGKRRSIKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIETN-LLFMASH-IDDNKSTSWLIDSGCTTHMAKDINIFSQIDKSR

Query:  KSKVVLGHGESILTEGRDSYRDFEEQ--KIMRNSPTSIE---DLKQIESS---FDL--IDSNNQDSQDPPL---DDDDLVEATGDDPNYKVRSLADVYES
        + KV LG+G+ +  +GR +     ++  +I   S   ++   D+   E+S   +DL  +D     + +P +    D   +E T D    KVR L+DVYE 
Subjt:  KSKVVLGHGESILTEGRDSYRDFEEQ--KIMRNSPTSIE---DLKQIESS---FDL--IDSNNQDSQDPPL---DDDDLVEATGDDPNYKVRSLADVYES

Query:  CNFVVVEPNSYYEASKSAKWVAAMK
        CN V  +P SY EA++   W+ A+K
Subjt:  CNFVVVEPNSYYEASKSAKWVAAMK

TXG53604.1 hypothetical protein EZV62_018860 [Acer yangbiense]1.1e-1831.07Show/hide
Query:  NEEHVASAFNAKSKGK-KHVEKDDRKATEDQGKRRS-----------------IKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIET-
        N+  +  AF   +KGK K   K D  ++E  GK +                   K K   H +Y    GH++ FC +K NQ    P QQAN  D++ +  
Subjt:  NEEHVASAFNAKSKGK-KHVEKDDRKATEDQGKRRS-----------------IKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIET-

Query:  NLLFMASHIDDNKSTS-WLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESILTEGRDSYR---------------DFEE--QKIMRNSPTSIEDLKQ
        + +FMAS    + S   W +DSGCT+HM +D  +F+ +DK+  + V +G+G+ +   G+DS +               DF +  QK +      + D++ 
Subjt:  NLLFMASHIDDNKSTS-WLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESILTEGRDSYR---------------DFEE--QKIMRNSPTSIEDLKQ

Query:  IESSFDLIDS------NNQDSQDPPLDDDDLVEATGDD---PNYKVRSLADVYESCNFVVVEPNSYYEASKSAKWVAAMK
        I  ++DL D+      + + + +   D D   E+  DD    + K RSLADVYE CN +V EP+ Y EA++  +W  AMK
Subjt:  IESSFDLIDS------NNQDSQDPPLDDDDLVEATGDD---PNYKVRSLADVYESCNFVVVEPNSYYEASKSAKWVAAMK

XP_022150313.1 uncharacterized protein LOC111018511 [Momordica charantia]2.5e-3643.62Show/hide
Query:  KGKKHVEKDDRKATEDQGKRRSIKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIETNLLFMASHIDDNKSTSWLIDSGCTTHMAKDIN
        K   HVEKD              KNKQVY+ DYCN DGH ++FCYAKQNQ  Q PQQQANCAD  +ETNLLF+ASHI +N+STSWLIDSGCTTHMAKDIN
Subjt:  KGKKHVEKDDRKATEDQGKRRSIKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIETNLLFMASHIDDNKSTSWLIDSGCTTHMAKDIN

Query:  IFSQIDKSRKSKVVLGHGESILTEGRDS--YRDFEEQKIMRN-------SPTSIEDLKQIESSFDLIDSNN-----------------------------
         FSQI+KS +SKVVL HGE+IL EG+ +      + +KI+ N       S   +   + + ++F ++  +                              
Subjt:  IFSQIDKSRKSKVVLGHGESILTEGRDS--YRDFEEQKIMRN-------SPTSIEDLKQIESSFDLIDSNN-----------------------------

Query:  ------------------QDSQDPPLDDDDLVEATGDDPNYKV
                           D+QDP  +DDDLV+ATGD  NYK+
Subjt:  ------------------QDSQDPPLDDDDLVEATGDDPNYKV

XP_025015337.1 uncharacterized protein LOC112536724 [Ricinus communis]1.0e-1632.42Show/hide
Query:  MGNEEHVASAFNAKSKGKKHVEKDDR------KATEDQ------------GKRRSIKNKQVYHSD---------YCNNDGHKKRFCYAKQNQTTQYPQQQ
        M +EE   SAF    KGKK   ++ R      +A E+             GK+ +   K  ++ D         +CN  GH ++FC AK+ Q  Q   Q+
Subjt:  MGNEEHVASAFNAKSKGKKHVEKDDR------KATEDQ------------GKRRSIKNKQVYHSD---------YCNNDGHKKRFCYAKQNQTTQYPQQQ

Query:  ANCADKEIETN---LLFMASHIDDNKST---SWLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESILTEGRDS--------YRDFEEQKIMRNSPTS
           A KE +++    LFMAS    NK+T   +WLID+GCT+HM  D   F+ +DKS  ++V LG G ++   GR S        Y+ FE    + N    
Subjt:  ANCADKEIETN---LLFMASHIDDNKST---SWLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESILTEGRDS--------YRDFEEQKIMRNSPTS

Query:  I-EDLKQIESSFDLIDSNNQDSQD------PPLD-------DDDL--VEATGDDPNYKVRSLADVYESCNFVVVEPNSYYEASKSAKWVAAMK
        +  D+   ESSF   +S+  +  +      P  +       +D+L  V  T D    K R LA+VY+ CNF+  EP ++ EASK  +W+ AMK
Subjt:  I-EDLKQIESSFDLIDSNNQDSQD------PPLD-------DDDL--VEATGDDPNYKVRSLADVYESCNFVVVEPNSYYEASKSAKWVAAMK

XP_034674366.1 LOW QUALITY PROTEIN: uncharacterized protein LOC117905578 [Vitis riparia]1.7e-1934.62Show/hide
Query:  GHKKRFCYAKQNQTTQYPQQQANCA--DKEIETNLLFMASHIDDNKSTSWLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESILTEGR-----DSYR
        GH +++C AK+ Q+ Q P+Q A+    DK  + +L   +  +  ++  +WLIDSGCT+HM K ++IF+ ID+S + KV LG+GE +  +G+      + R
Subjt:  GHKKRFCYAKQNQTTQYPQQQANCA--DKEIETNLLFMASHIDDNKSTSWLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESILTEGR-----DSYR

Query:  DFEEQKIMRNSPTSIEDLKQIESSF---DLIDSNNQDSQDP--------------PLDDDDLVEATGDDPNYKVRSLADVYESCNFVVVEPNSYYEASKS
         +    + R       D+   E+S+   DL   +  D   P              PLD    VEAT D    K+R L+DVYE CN V  EP  Y EA++ 
Subjt:  DFEEQKIMRNSPTSIEDLKQIESSF---DLIDSNNQDSQDP--------------PLDDDDLVEATGDDPNYKVRSLADVYESCNFVVVEPNSYYEASKS

Query:  AKWVAAMK
         +W+ AMK
Subjt:  AKWVAAMK

TrEMBL top hitse value%identityAlignment
A0A151QWM2 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-1332.93Show/hide
Query:  NEEHVASAFNAKSKGKKHVEKD--DRKATEDQGKRRS--------------------IKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADK-
        ++E V  AF AK  G+   +    ++   +D+G  R                      K+K ++H ++CN +GH +++C  K+NQ  Q  ++QAN  ++ 
Subjt:  NEEHVASAFNAKSKGKKHVEKD--DRKATEDQGKRRS--------------------IKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADK-

Query:  -EIETNLLFMAS-HIDDNKSTSWLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESILTEGR
         + E   LFMAS   + ++  +WLIDSGCT+HM K ++ F+ IDKS K KV +G+GE +   G+
Subjt:  -EIETNLLFMAS-HIDDNKSTSWLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESILTEGR

A0A5C7H9I3 DUF4219 domain-containing protein5.2e-1931.07Show/hide
Query:  NEEHVASAFNAKSKGK-KHVEKDDRKATEDQGKRRS-----------------IKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIET-
        N+  +  AF   +KGK K   K D  ++E  GK +                   K K   H +Y    GH++ FC +K NQ    P QQAN  D++ +  
Subjt:  NEEHVASAFNAKSKGK-KHVEKDDRKATEDQGKRRS-----------------IKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIET-

Query:  NLLFMASHIDDNKSTS-WLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESILTEGRDSYR---------------DFEE--QKIMRNSPTSIEDLKQ
        + +FMAS    + S   W +DSGCT+HM +D  +F+ +DK+  + V +G+G+ +   G+DS +               DF +  QK +      + D++ 
Subjt:  NLLFMASHIDDNKSTS-WLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESILTEGRDSYR---------------DFEE--QKIMRNSPTSIEDLKQ

Query:  IESSFDLIDS------NNQDSQDPPLDDDDLVEATGDD---PNYKVRSLADVYESCNFVVVEPNSYYEASKSAKWVAAMK
        I  ++DL D+      + + + +   D D   E+  DD    + K RSLADVYE CN +V EP+ Y EA++  +W  AMK
Subjt:  IESSFDLIDS------NNQDSQDPPLDDDDLVEATGDD---PNYKVRSLADVYESCNFVVVEPNSYYEASKSAKWVAAMK

A0A5J5ARX1 Uncharacterized protein5.9e-1529.11Show/hide
Query:  CNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIETNLLFMASHIDDNKSTSWLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESILTEGRDS-----
        CN  GH+   C  K     Q  + +A  AD+E+E  L         + S SWLIDSGCT HM     +F +++ +  +KV +G+G+ I  +G  +     
Subjt:  CNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIETNLLFMASHIDDNKSTSWLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESILTEGRDS-----

Query:  ----------------YRDFEEQKI----MRNSPTSIEDLKQIESSFDLIDSNNQDSQDPPLDDDDLVEATGDDPNYKVRSLADVYESCNFVVVEPNSYY
                         +D   Q+I    MR    S++ +++ +++F + +S  Q +Q+   + +DLV+   D P    R L+D+Y+ CN  V EP  Y 
Subjt:  ----------------YRDFEEQKI----MRNSPTSIEDLKQIESSFDLIDSNNQDSQDPPLDDDDLVEATGDDPNYKVRSLADVYESCNFVVVEPNSYY

Query:  EASKSAKWVAAMK
        +A K  +W+ AM+
Subjt:  EASKSAKWVAAMK

A0A6J1D946 uncharacterized protein LOC1110185111.2e-3643.62Show/hide
Query:  KGKKHVEKDDRKATEDQGKRRSIKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIETNLLFMASHIDDNKSTSWLIDSGCTTHMAKDIN
        K   HVEKD              KNKQVY+ DYCN DGH ++FCYAKQNQ  Q PQQQANCAD  +ETNLLF+ASHI +N+STSWLIDSGCTTHMAKDIN
Subjt:  KGKKHVEKDDRKATEDQGKRRSIKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIETNLLFMASHIDDNKSTSWLIDSGCTTHMAKDIN

Query:  IFSQIDKSRKSKVVLGHGESILTEGRDS--YRDFEEQKIMRN-------SPTSIEDLKQIESSFDLIDSNN-----------------------------
         FSQI+KS +SKVVL HGE+IL EG+ +      + +KI+ N       S   +   + + ++F ++  +                              
Subjt:  IFSQIDKSRKSKVVLGHGESILTEGRDS--YRDFEEQKIMRN-------SPTSIEDLKQIESSFDLIDSNN-----------------------------

Query:  ------------------QDSQDPPLDDDDLVEATGDDPNYKV
                           D+QDP  +DDDLV+ATGD  NYK+
Subjt:  ------------------QDSQDPPLDDDDLVEATGDDPNYKV

Q0ZCC5 CCHC-type integrase1.5e-2136.89Show/hide
Query:  KATEDQGKRRSIKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIETN-LLFMASH-IDDNKSTSWLIDSGCTTHMAKDINIFSQIDKSR
        K T    K    K+K  +   +CNN GH +++C AK+ Q+ Q+  Q AN ++KE E +  LFMAS  I  ++   WLIDSGCT++M K + IFS IDKS 
Subjt:  KATEDQGKRRSIKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIETN-LLFMASH-IDDNKSTSWLIDSGCTTHMAKDINIFSQIDKSR

Query:  KSKVVLGHGESILTEGRDSYRDFEEQ--KIMRNSPTSIE---DLKQIESS---FDL--IDSNNQDSQDPPL---DDDDLVEATGDDPNYKVRSLADVYES
        + KV LG+G+ +  +GR +     ++  +I   S   ++   D+   E+S   +DL  +D     + +P +    D   +E T D    KVR L+DVYE 
Subjt:  KSKVVLGHGESILTEGRDSYRDFEEQ--KIMRNSPTSIE---DLKQIESS---FDL--IDSNNQDSQDPPL---DDDDLVEATGDDPNYKVRSLADVYES

Query:  CNFVVVEPNSYYEASKSAKWVAAMK
        CN V  +P SY EA++   W+ A+K
Subjt:  CNFVVVEPNSYYEASKSAKWVAAMK

SwissProt top hitse value%identityAlignment
P25601 Putative transposon Ty5-1 protein YCL075W3.5e-0440.38Show/hide
Query:  LLFMASHIDDNKSTSWLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESI
        L  ++S +   KS+ W+ D+GCT+HM  D +IFS   +S +   V G G SI
Subjt:  LLFMASHIDDNKSTSWLIDSGCTTHMAKDINIFSQIDKSRKSKVVLGHGESI

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTAATGAAGAGCATGTTGCGAGTGCATTTAATGCCAAGTCCAAAGGCAAGAAACATGTTGAAAAGGATGATAGAAAGGCAACCGAAGATCAAGGGAAAAGAAGAAG
CATCAAGAACAAACAAGTATATCATTCTGATTATTGCAACAATGATGGTCATAAAAAGAGGTTTTGTTATGCCAAACAAAACCAAACGACCCAATATCCCCAGCAACAAG
CAAATTGTGCAGACAAAGAGATAGAGACAAATTTATTGTTTATGGCTTCTCATATTGATGACAATAAGTCAACCTCATGGCTTATTGACAGTGGATGTACCACCCACATG
GCTAAAGATATCAATATTTTTAGCCAAATTGACAAATCTAGAAAGTCTAAGGTGGTCCTTGGTCATGGAGAGTCAATACTGACTGAAGGTAGAGACTCTTATAGGGATTT
TGAGGAGCAAAAAATTATGAGAAACTCACCCACCAGCATTGAAGATTTGAAACAAATTGAATCTTCCTTTGATTTAATTGATTCTAATAATCAAGATAGTCAAGATCCTC
CTTTGGATGATGATGACCTTGTTGAAGCAACTGGCGACGATCCAAATTATAAGGTAAGATCACTTGCTGATGTATATGAAAGCTGTAATTTTGTTGTTGTTGAACCAAAT
AGCTATTATGAAGCTTCAAAATCTGCTAAATGGGTTGCTGCTATGAAAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTAATGAAGAGCATGTTGCGAGTGCATTTAATGCCAAGTCCAAAGGCAAGAAACATGTTGAAAAGGATGATAGAAAGGCAACCGAAGATCAAGGGAAAAGAAGAAG
CATCAAGAACAAACAAGTATATCATTCTGATTATTGCAACAATGATGGTCATAAAAAGAGGTTTTGTTATGCCAAACAAAACCAAACGACCCAATATCCCCAGCAACAAG
CAAATTGTGCAGACAAAGAGATAGAGACAAATTTATTGTTTATGGCTTCTCATATTGATGACAATAAGTCAACCTCATGGCTTATTGACAGTGGATGTACCACCCACATG
GCTAAAGATATCAATATTTTTAGCCAAATTGACAAATCTAGAAAGTCTAAGGTGGTCCTTGGTCATGGAGAGTCAATACTGACTGAAGGTAGAGACTCTTATAGGGATTT
TGAGGAGCAAAAAATTATGAGAAACTCACCCACCAGCATTGAAGATTTGAAACAAATTGAATCTTCCTTTGATTTAATTGATTCTAATAATCAAGATAGTCAAGATCCTC
CTTTGGATGATGATGACCTTGTTGAAGCAACTGGCGACGATCCAAATTATAAGGTAAGATCACTTGCTGATGTATATGAAAGCTGTAATTTTGTTGTTGTTGAACCAAAT
AGCTATTATGAAGCTTCAAAATCTGCTAAATGGGTTGCTGCTATGAAAGTTTGA
Protein sequenceShow/hide protein sequence
MGNEEHVASAFNAKSKGKKHVEKDDRKATEDQGKRRSIKNKQVYHSDYCNNDGHKKRFCYAKQNQTTQYPQQQANCADKEIETNLLFMASHIDDNKSTSWLIDSGCTTHM
AKDINIFSQIDKSRKSKVVLGHGESILTEGRDSYRDFEEQKIMRNSPTSIEDLKQIESSFDLIDSNNQDSQDPPLDDDDLVEATGDDPNYKVRSLADVYESCNFVVVEPN
SYYEASKSAKWVAAMKV