; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018929 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018929
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationtig00153228:800687..801055
RNA-Seq ExpressionSgr018929
SyntenySgr018929
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8530341.1 hypothetical protein F0562_005050 [Nyssa sinensis]3.4e-0842.61Show/hide
Query:  DVTLPLSSNDVVPA---DVSSSTT-----IISYCGTSNVA----HKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSSMPLSCFKNPLQNYLSYSRLSP
        D+ LP S  DV PA     SSS T      + +     VA      A +  I  RKSTR IKPPSY  D+HCSL+   ++P S    PL  +LSY+ LS 
Subjt:  DVTLPLSSNDVVPA---DVSSSTT-----IISYCGTSNVA----HKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSSMPLSCFKNPLQNYLSYSRLSP

Query:  TFRNFVLNVSTHYEP
        + R FVL +S+H+EP
Subjt:  TFRNFVLNVSTHYEP

PNX84823.1 retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense]4.8e-0735.25Show/hide
Query:  HSSITTHVSDDVTLPLSSNDVVPADVSSSTTIISYCGTSNVAHKAPTLG-IPIRKSTRPIKPPSYPNDFHCSLLDCS--------SMPLSCFKNPLQNYL
        ++ + THVSD      S +   P               +  +H +P+L  IP+R+STRP  PP Y  DFHCSLL  S        S+  S  K PL +++
Subjt:  HSSITTHVSDDVTLPLSSNDVVPADVSSSTTIISYCGTSNVAHKAPTLG-IPIRKSTRPIKPPSYPNDFHCSLLDCS--------SMPLSCFKNPLQNYL

Query:  SYSRLSPTFRNFVLNVSTHYEP
        SY  LS + ++F  N+ST  EP
Subjt:  SYSRLSPTFRNFVLNVSTHYEP

XP_022147774.1 uncharacterized protein LOC111016631 isoform X1 [Momordica charantia]3.7e-0736.29Show/hide
Query:  ILASGVDHSSITTHVSDDVTLPLSSNDV----VPADVSSSTTIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSSMPLSCFKNPLQN
        IL +   ++SI  H  D V +     DV      +D  SST+  +  G              I +S+R ++ PSY  D+HCSL   + +  S  K  LQ 
Subjt:  ILASGVDHSSITTHVSDDVTLPLSSNDV----VPADVSSSTTIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSSMPLSCFKNPLQN

Query:  YLSYSRLSPTFRNFVLNVSTHYEP
        YLSY +LSP ++ F+LNVSTH+EP
Subjt:  YLSYSRLSPTFRNFVLNVSTHYEP

XP_022147777.1 uncharacterized protein LOC111016631 isoform X2 [Momordica charantia]3.7e-0736.29Show/hide
Query:  ILASGVDHSSITTHVSDDVTLPLSSNDV----VPADVSSSTTIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSSMPLSCFKNPLQN
        IL +   ++SI  H  D V +     DV      +D  SST+  +  G              I +S+R ++ PSY  D+HCSL   + +  S  K  LQ 
Subjt:  ILASGVDHSSITTHVSDDVTLPLSSNDV----VPADVSSSTTIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSSMPLSCFKNPLQN

Query:  YLSYSRLSPTFRNFVLNVSTHYEP
        YLSY +LSP ++ F+LNVSTH+EP
Subjt:  YLSYSRLSPTFRNFVLNVSTHYEP

XP_031259419.1 uncharacterized protein LOC116117549 [Pistacia vera]3.7e-0738.79Show/hide
Query:  HSSITTHVSDDVTLPLSSNDVVPADVSSST--TIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSSMPLSCFKN-PLQNYLSYSRLS
        HSS+  H   D+      +  VP+ V+S++   I      +N    +P  G+ +RKSTR I  P Y  D+HC+LL  SS   S F + P+ NY+SY  LS
Subjt:  HSSITTHVSDDVTLPLSSNDVVPADVSSST--TIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSSMPLSCFKN-PLQNYLSYSRLS

Query:  PTFRNFVLNVSTHYEP
         + R FVL+VS+  EP
Subjt:  PTFRNFVLNVSTHYEP

TrEMBL top hitse value%identityAlignment
A0A2N9F376 Integrase catalytic domain-containing protein1.9e-0941.23Show/hide
Query:  SITTHVSDDVTLPLSSNDVVPADVSSSTTIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSS---MPLSCFKNPLQNYLSYSRLSPT
        S+   +SD   +P      +P D S+ T   S    S +A  +PTL  P+RKS+R IKPPSY  D+H +L+  SS    P S   +P+QN LSYS LS +
Subjt:  SITTHVSDDVTLPLSSNDVVPADVSSSTTIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSS---MPLSCFKNPLQNYLSYSRLSPT

Query:  FRNFVLNVSTHYEP
         + F L +ST  EP
Subjt:  FRNFVLNVSTHYEP

A0A2N9FBS5 Uncharacterized protein3.6e-0842.31Show/hide
Query:  TLPLSSNDVVPADVSSSTTIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSL---LDCSSMPLSCFKNPLQNYLSYSRLSPTFRNFVLNVST
        +LPL+S+   P D  S+ +I      S +   AP+  +PIR+S+R +KPPSY  D+HCSL   L  S+MP +    P+Q+ LSYS LS + + F L +ST
Subjt:  TLPLSSNDVVPADVSSSTTIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSL---LDCSSMPLSCFKNPLQNYLSYSRLSPTFRNFVLNVST

Query:  HYEP
          EP
Subjt:  HYEP

A0A2N9FYF9 Reverse transcriptase2.1e-0840.5Show/hide
Query:  ASGVDHSSITTHVSDDVTLPLSSNDVVPADVSSSTTIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSS---MPLSCFKNPLQNYLS
        AS   H +    VS   T P+SS        S S+T       S +A  +PTL  P+RKS R +KPPSY  D+HC+L+  SS    P S   +P+QN LS
Subjt:  ASGVDHSSITTHVSDDVTLPLSSNDVVPADVSSSTTIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSS---MPLSCFKNPLQNYLS

Query:  YSRLSPTFRNFVLNVSTHYEP
        YS LS + + F L + T  EP
Subjt:  YSRLSPTFRNFVLNVSTHYEP

A0A2N9GV85 Integrase catalytic domain-containing protein3.6e-0841.35Show/hide
Query:  TLPLSSNDVVPADVSSSTTIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSL---LDCSSMPLSCFKNPLQNYLSYSRLSPTFRNFVLNVST
        +LPL+S+   P D  S+ +I      S +   AP+  +PIR+S+R +KPPSY  D+HC+L   L  S+MP +    P+Q+ LSYS LS + + F L +ST
Subjt:  TLPLSSNDVVPADVSSSTTIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSL---LDCSSMPLSCFKNPLQNYLSYSRLSPTFRNFVLNVST

Query:  HYEP
          EP
Subjt:  HYEP

A0A5J5AMM8 Uncharacterized protein1.6e-0842.61Show/hide
Query:  DVTLPLSSNDVVPA---DVSSSTT-----IISYCGTSNVA----HKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSSMPLSCFKNPLQNYLSYSRLSP
        D+ LP S  DV PA     SSS T      + +     VA      A +  I  RKSTR IKPPSY  D+HCSL+   ++P S    PL  +LSY+ LS 
Subjt:  DVTLPLSSNDVVPA---DVSSSTT-----IISYCGTSNVA----HKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSSMPLSCFKNPLQNYLSYSRLSP

Query:  TFRNFVLNVSTHYEP
        + R FVL +S+H+EP
Subjt:  TFRNFVLNVSTHYEP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACATTCTAGCTTCTGGTGTTGATCATTCTTCTATTACTACACATGTTTCTGATGATGTTACTTTACCACTCTCATCTAATGATGTTGTTCCTGCTGATGTTAGCTC
ATCTACTACTATTATTTCATATTGTGGCACTTCTAATGTTGCTCATAAAGCACCTACTCTGGGTATACCAATTCGCAAGTCTACTCGACCAATCAAACCGCCTTCTTATC
CCAACGACTTTCATTGCAGCTTACTTGACTGTTCTAGCATGCCTCTTTCTTGTTTTAAAAATCCATTACAAAATTATCTATCTTATTCGAGGCTATCACCTACTTTTCGG
AATTTTGTTTTAAATGTTTCTACTCACTATGAGCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACATTCTAGCTTCTGGTGTTGATCATTCTTCTATTACTACACATGTTTCTGATGATGTTACTTTACCACTCTCATCTAATGATGTTGTTCCTGCTGATGTTAGCTC
ATCTACTACTATTATTTCATATTGTGGCACTTCTAATGTTGCTCATAAAGCACCTACTCTGGGTATACCAATTCGCAAGTCTACTCGACCAATCAAACCGCCTTCTTATC
CCAACGACTTTCATTGCAGCTTACTTGACTGTTCTAGCATGCCTCTTTCTTGTTTTAAAAATCCATTACAAAATTATCTATCTTATTCGAGGCTATCACCTACTTTTCGG
AATTTTGTTTTAAATGTTTCTACTCACTATGAGCCTTAA
Protein sequenceShow/hide protein sequence
MHILASGVDHSSITTHVSDDVTLPLSSNDVVPADVSSSTTIISYCGTSNVAHKAPTLGIPIRKSTRPIKPPSYPNDFHCSLLDCSSMPLSCFKNPLQNYLSYSRLSPTFR
NFVLNVSTHYEP