; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024591 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024591
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationtig00001291:4720862..4721329
RNA-Seq ExpressionSgr024591
SyntenySgr024591
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047880.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.2e-1447.31Show/hide
Query:  ADYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDE-LYDNFLSEFVLPRPL
        ADY+ +K F CL +A TLP N + F P V P V    PS MK Y+L+DI K+KFF+S D++F E + PFHS   +   +  +FL +FV+P PL
Subjt:  ADYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDE-LYDNFLSEFVLPRPL

MCH80704.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium medium]1.5e-1338.97Show/hide
Query:  DYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRP-------LNV
        DY+ L+VFGC+CFAST+P   S F+P  IP V    P G+K Y+L  IE +K  I+ DV F ES FPFHS ++ D L  N  S+ VLP+P       ++ 
Subjt:  DYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRP-------LNV

Query:  EISLLSPLASSNSGCTDGFSYDAATSSSSPNASAPN
          ++++P +  + G       D   S S+ ++S  N
Subjt:  EISLLSPLASSNSGCTDGFSYDAATSSSSPNASAPN

TYK16758.1 Copia protein [Cucumis melo var. makuwa]6.8e-1441.91Show/hide
Query:  DYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVEISL---
        DY+ LKVFG LC+AS+LP+N S F    IP V    P GMKAY+L+DIE +K FIS DV+FHE+ FPFH+   ++++  + L  F LP+P + E +L   
Subjt:  DYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVEISL---

Query:  --LSPLASSNSGCTDGFSYDAATSSSSPNASAPNQP
          L P  +  +  T     D + S+   N     QP
Subjt:  --LSPLASSNSGCTDGFSYDAATSSSSPNASAPNQP

XP_022150855.1 uncharacterized protein LOC111018899 [Momordica charantia]1.5e-1650.93Show/hide
Query:  MADYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVEISLL
        + D+  L+VFGCLCFASTL  N S F    +P +    P G+KAYRL+DI  +KFFIS DVVFHE VFPFH  T +D + D F   FV P+ L+   S L
Subjt:  MADYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVEISLL

Query:  SPLASSNS
        S L +++S
Subjt:  SPLASSNS

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]5.9e-1841.06Show/hide
Query:  ADYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVEISLLS
        ADYS LKVFGCLCF ST P N S F P  +  V    P GMK Y+L+DIE ++FF+S DV+FHES+FPFH+ ++   + D F    V+P+  +       
Subjt:  ADYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVEISLLS

Query:  PLASSNSGCTDGFSYDAATSSSSPNASAPNQPTDVLDAFAPISATDGIPSN
         L  ++SG  D   ++ AT S+    SA   PT V+   +PI   +   +N
Subjt:  PLASSNSGCTDGFSYDAATSSSSPNASAPNQPTDVLDAFAPISATDGIPSN

TrEMBL top hitse value%identityAlignment
A0A2N9E374 Integrase catalytic domain-containing protein5.6e-1437.8Show/hide
Query:  YSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVE---ISLL
        +S LK+FGCLC+ASTL HN + FSP     V    P  +K Y++ D+   K FIS DV FHES+FPFH+        D F S  VLP  ++ E      +
Subjt:  YSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVE---ISLL

Query:  SPLASSNS--------GCTDGFSYDAATSSSSPNASAPNQ-PTDVLDAFAPISATDGIP-SNLV
         P + SNS           D  ++    S SSP+  + N  P   LD+ A I  +  +P SN V
Subjt:  SPLASSNS--------GCTDGFSYDAATSSSSPNASAPNQ-PTDVLDAFAPISATDGIP-SNLV

A0A5A7TXG6 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-1447.31Show/hide
Query:  ADYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDE-LYDNFLSEFVLPRPL
        ADY+ +K F CL +A TLP N + F P V P V    PS MK Y+L+DI K+KFF+S D++F E + PFHS   +   +  +FL +FV+P PL
Subjt:  ADYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDE-LYDNFLSEFVLPRPL

A0A5D3CZP1 Copia protein3.3e-1441.91Show/hide
Query:  DYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVEISL---
        DY+ LKVFG LC+AS+LP+N S F    IP V    P GMKAY+L+DIE +K FIS DV+FHE+ FPFH+   ++++  + L  F LP+P + E +L   
Subjt:  DYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVEISL---

Query:  --LSPLASSNSGCTDGFSYDAATSSSSPNASAPNQP
          L P  +  +  T     D + S+   N     QP
Subjt:  --LSPLASSNSGCTDGFSYDAATSSSSPNASAPNQP

A0A6J1D9M2 uncharacterized protein LOC1110188997.1e-1750.93Show/hide
Query:  MADYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVEISLL
        + D+  L+VFGCLCFASTL  N S F    +P +    P G+KAYRL+DI  +KFFIS DVVFHE VFPFH  T +D + D F   FV P+ L+   S L
Subjt:  MADYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVEISLL

Query:  SPLASSNS
        S L +++S
Subjt:  SPLASSNS

A0A6J1DNP7 uncharacterized protein LOC1110220652.9e-1841.06Show/hide
Query:  ADYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVEISLLS
        ADYS LKVFGCLCF ST P N S F P  +  V    P GMK Y+L+DIE ++FF+S DV+FHES+FPFH+ ++   + D F    V+P+  +       
Subjt:  ADYSFLKVFGCLCFASTLPHNYSNFSPHVIPFV----PSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVEISLLS

Query:  PLASSNSGCTDGFSYDAATSSSSPNASAPNQPTDVLDAFAPISATDGIPSN
         L  ++SG  D   ++ AT S+    SA   PT V+   +PI   +   +N
Subjt:  PLASSNSGCTDGFSYDAATSSSSPNASAPNQPTDVLDAFAPISATDGIPSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGACTACTCGTTTTTAAAAGTCTTTGGCTGCCTTTGCTTTGCCTCAACCCTTCCGCACAATTACTCTAATTTTTCTCCGCATGTCATACCTTTTGTTCCCTCTGG
GATGAAAGCTTATCGGTTGTTTGACATTGAAAAGAGAAAATTCTTCATCTCCGGTGATGTGGTTTTTCATGAATCTGTCTTTCCCTTTCATTCTGCTACCTTGGAAGATG
AACTTTATGATAATTTTTTGTCTGAGTTTGTTCTTCCTCGACCGCTTAATGTTGAGATCTCCCTTCTTTCTCCTCTTGCCTCCTCAAATTCTGGTTGCACTGATGGTTTC
TCATATGATGCTGCAACTAGCTCTAGTTCTCCAAATGCTTCTGCTCCAAATCAACCAACTGATGTTCTTGATGCTTTTGCTCCTATTTCTGCTACTGATGGCATCCCTTC
TAACCTTGTTAGAATTGACCCTATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGACTACTCGTTTTTAAAAGTCTTTGGCTGCCTTTGCTTTGCCTCAACCCTTCCGCACAATTACTCTAATTTTTCTCCGCATGTCATACCTTTTGTTCCCTCTGG
GATGAAAGCTTATCGGTTGTTTGACATTGAAAAGAGAAAATTCTTCATCTCCGGTGATGTGGTTTTTCATGAATCTGTCTTTCCCTTTCATTCTGCTACCTTGGAAGATG
AACTTTATGATAATTTTTTGTCTGAGTTTGTTCTTCCTCGACCGCTTAATGTTGAGATCTCCCTTCTTTCTCCTCTTGCCTCCTCAAATTCTGGTTGCACTGATGGTTTC
TCATATGATGCTGCAACTAGCTCTAGTTCTCCAAATGCTTCTGCTCCAAATCAACCAACTGATGTTCTTGATGCTTTTGCTCCTATTTCTGCTACTGATGGCATCCCTTC
TAACCTTGTTAGAATTGACCCTATTTGA
Protein sequenceShow/hide protein sequence
MADYSFLKVFGCLCFASTLPHNYSNFSPHVIPFVPSGMKAYRLFDIEKRKFFISGDVVFHESVFPFHSATLEDELYDNFLSEFVLPRPLNVEISLLSPLASSNSGCTDGF
SYDAATSSSSPNASAPNQPTDVLDAFAPISATDGIPSNLVRIDPI