; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018090 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018090
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationtig00153092:1354003..1354350
RNA-Seq ExpressionSgr018090
SyntenySgr018090
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN75698.1 hypothetical protein VITISV_035985 [Vitis vinifera]1.2e-3672Show/hide
Query:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN
        S SH++QITTIRLNGDNFL WSQSVRMYIRG+ K+GYLTR KKAP  +DP++  WDA N++V++WLVNSM E+IS NYMCYP A+ELW+NVNQMY +LGN
Subjt:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN

RVW64704.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.2e-3672Show/hide
Query:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN
        S SH++QITTIRLNGDNFL WSQSVRMYIRG+ K+GYLTR KKAP  +DP++  WDA N++V++WLVNSM E+IS NYMCYP A+ELW+NVNQMY +LGN
Subjt:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN

RVX21861.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.2e-3672Show/hide
Query:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN
        S SH++QITTIRLNGDNFL WSQSVRMYIRG+ K+GYLTR KKAP  +DP++  WDA N++V++WLVNSM E+IS NYMCYP A+ELW+NVNQMY +LGN
Subjt:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN

XP_006471430.1 uncharacterized protein LOC102629445 [Citrus sinensis]2.9e-3875Show/hide
Query:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN
        S+SH++QITTIRLNGDNFL WSQSVRMYIRGQ KIGY+T  KK P  +DP + TWDA N++V+TWLVNSM E+IS NYMCYP+AKELWDNV+QMY DLGN
Subjt:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN

XP_024032212.1 uncharacterized protein LOC112094741 [Morus notabilis]1.2e-3672.73Show/hide
Query:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLG
        +DSH++QITTIRLNGDNFL WSQ VRMYIRG+ KIGYLT   KAP E DP++ TWDA N++V+TWLVNSM E+I  NYMCYP+A+ELW+NVNQMY DLG
Subjt:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLG

TrEMBL top hitse value%identityAlignment
A0A2N9EE05 Uncharacterized protein3.2e-3874Show/hide
Query:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN
        S+SH++QITTIRLNGDNFL WSQSVRMYIRG+ K+GYLT  K AP E DP++ TWDA N++V+TWLVNSM E+IS NYMCYP+A+ELW+NVNQMY DLGN
Subjt:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN

A0A2N9G446 Uncharacterized protein3.2e-3874Show/hide
Query:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN
        S+SH++QITTIRLNGDNFL WSQSVRMYIRG+ K+GYLT  K AP E DP++ TWDA N++V+TWLVNSM E+IS NYMCYP+A+ELW+NVNQMY DLGN
Subjt:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN

A0A2N9GKJ5 Uncharacterized protein3.5e-3773Show/hide
Query:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN
        S+SH++QITTIRLN DNFL WSQSVRMYIRG+ K+GYLT  K AP E DP++ TWDA N++V+TWLVNSM E+IS NYMCYP+A+ELW+NVNQMY DLGN
Subjt:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN

A0A2N9GQ49 Uncharacterized protein3.2e-3874Show/hide
Query:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN
        S+SH++QITTIRLNGDNFL WSQSVRMYIRG+ K+GYLT  K AP E DP++ TWDA N++V+TWLVNSM E+IS NYMCYP+A+ELW+NVNQMY DLGN
Subjt:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN

A0A2N9I543 Uncharacterized protein3.2e-3874Show/hide
Query:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN
        S+SH++QITTIRLNGDNFL WSQSVRMYIRG+ K+GYLT  K AP E DP++ TWDA N++V+TWLVNSM E+IS NYMCYP+A+ELW+NVNQMY DLGN
Subjt:  SDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYLDLGN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.5e-0824.69Show/hide
Query:  DNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYL
        DN+++W    R ++R   K G++      P    P +  W+  N +V+ WL+NSM +++  + M   +A ++W+++ ++++
Subjt:  DNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMYL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAAGCCATCTTTTGCTAAAGTTTTTGACACCCGCATCGGTTCCGATAGTCATACTCTCCAAATTACCACTATCCGACTTAATGGGGATAATTTCCTTAGTTGGTC
CCAAAGTGTTCGGATGTATATTCGTGGTCAATGGAAGATTGGCTATCTTACCAGAGTGAAAAAGGCACCCATTGAGGAGGACCCTTCGTTTACCACTTGGGATGCTAGAA
ACAACGTGGTAATAACTTGGCTTGTGAATTCTATGGTTGAGGAAATTAGCGGTAACTACATGTGTTACCCTAGTGCAAAGGAATTATGGGATAACGTGAATCAAATGTAT
TTAGATTTGGGCAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCAAGCCATCTTTTGCTAAAGTTTTTGACACCCGCATCGGTTCCGATAGTCATACTCTCCAAATTACCACTATCCGACTTAATGGGGATAATTTCCTTAGTTGGTC
CCAAAGTGTTCGGATGTATATTCGTGGTCAATGGAAGATTGGCTATCTTACCAGAGTGAAAAAGGCACCCATTGAGGAGGACCCTTCGTTTACCACTTGGGATGCTAGAA
ACAACGTGGTAATAACTTGGCTTGTGAATTCTATGGTTGAGGAAATTAGCGGTAACTACATGTGTTACCCTAGTGCAAAGGAATTATGGGATAACGTGAATCAAATGTAT
TTAGATTTGGGCAATTAG
Protein sequenceShow/hide protein sequence
MSKPSFAKVFDTRIGSDSHTLQITTIRLNGDNFLSWSQSVRMYIRGQWKIGYLTRVKKAPIEEDPSFTTWDARNNVVITWLVNSMVEEISGNYMCYPSAKELWDNVNQMY
LDLGN