; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018364 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018364
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationtig00153197:535207..536252
RNA-Seq ExpressionSgr018364
SyntenySgr018364
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD1825734.1 unnamed protein product [Ananas comosus var. bracteatus]4.4e-5172.93Show/hide
Query:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE
        +LW HIDG  PAP D TQL+QWK+KDARVMSWI GSCD Q+VLNLR Y +A+ MW YLKK+Y QT+SARRFQLECEI+NYTQ  LSIQDY+S FQ LWAE
Subjt:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE

Query:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL
        FSDIVCAT+SK+S  DVLA+++++K DQFLMKL
Subjt:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL

DAD32765.1 TPA_asm: hypothetical protein HUJ06_011616 [Nelumbo nucifera]2.0e-5984.96Show/hide
Query:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE
        +LW HI GTTP P DATQL QWKIKDARVMSWITGSCD QIVLNLR Y SAQ MW YLKK+Y QTNSARRFQLECEI+NYTQGSLSIQDYYSGFQNLWAE
Subjt:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE

Query:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL
        FSDIVCA VSK+SL DVL +HEI+KRDQFLMKL
Subjt:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL

DAD42694.1 TPA_asm: hypothetical protein HUJ06_000924 [Nelumbo nucifera]7.0e-5781.95Show/hide
Query:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE
        +LW HIDGTTPAP DAT+LA+WKIKDARVM WITGSCD +IVLNLR Y SAQ MW YLKK+Y QTNSARRFQLECEI++YTQGSLSIQDYYS FQNLWAE
Subjt:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE

Query:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL
        FSDIVCA VSK SL DVL ++EI+KRDQFLMKL
Subjt:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL

XP_006346877.1 PREDICTED: uncharacterized protein LOC102591997 [Solanum tuberosum]2.0e-4870.68Show/hide
Query:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE
        +LW HIDG+ PAPTDAT+L +WKIKDARVM+WI GS DP IVLNLR Y +A+AMW+YL+K+Y Q NSARRFQLE EI+NY+QG LS+QDY+SGFQNLWAE
Subjt:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE

Query:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL
        F+DIV A +  ESL+ + A+HE +KRDQFLMKL
Subjt:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL

XP_010266766.1 PREDICTED: uncharacterized protein LOC104604200 [Nelumbo nucifera]5.5e-5480.45Show/hide
Query:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE
        +LW HIDGTTPAP DAT+LA+WKIKDARVMSWITGSCD +IVLNL  Y SAQ M  YLKK+Y QTNSARRFQLECEI +YTQGSLSIQDYYS FQNLW E
Subjt:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE

Query:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL
        FSDIVCA VSK SL DVL ++EI+KRDQFLMKL
Subjt:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL

TrEMBL top hitse value%identityAlignment
A0A1U8AL74 uncharacterized protein LOC1046042002.7e-5480.45Show/hide
Query:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE
        +LW HIDGTTPAP DAT+LA+WKIKDARVMSWITGSCD +IVLNL  Y SAQ M  YLKK+Y QTNSARRFQLECEI +YTQGSLSIQDYYS FQNLW E
Subjt:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE

Query:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL
        FSDIVCA VSK SL DVL ++EI+KRDQFLMKL
Subjt:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL

A0A5C7HJ24 CCHC-type domain-containing protein4.0e-4266.17Show/hide
Query:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE
        +L  HIDG+  APT+  +LA WK+KDARVMSWI G  DP IVLNLR Y +A+ MW YL K+Y Q N+ARRFQLE EI+NYTQG+LSIQDY+S FQNLW E
Subjt:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE

Query:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL
        FSD+V A V   SL+ V A+HE +KRDQFLMKL
Subjt:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL

A0A5C7IEH8 Uncharacterized protein1.5e-4467.67Show/hide
Query:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE
        +LW HIDG+ PAPT+  +LA WK+KDARVMSWI GS DP IVLNLR Y +A+ MW YL K+Y Q N+A RFQLE EI+NYTQG+LSIQDY+S FQNLW E
Subjt:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE

Query:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL
        FSD+V A V   SL+ V A+HE +KRDQFLMKL
Subjt:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL

A0A5J5AIJ4 Uncharacterized protein5.0e-4563.91Show/hide
Query:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE
        +LW H+DG+ PAPTD  +L QWK+KDARVM+WI GS DP ++LNL+ + +A++MW YLKK+Y Q +SARRFQLE +++ Y+QG+LS+Q+Y+ GFQNLWAE
Subjt:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE

Query:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL
        FSDIV A VS ESL+ V A+HE +KRDQFLMKL
Subjt:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL

A0A6V7P508 Uncharacterized protein2.1e-5172.93Show/hide
Query:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE
        +LW HIDG  PAP D TQL+QWK+KDARVMSWI GSCD Q+VLNLR Y +A+ MW YLKK+Y QT+SARRFQLECEI+NYTQ  LSIQDY+S FQ LWAE
Subjt:  QLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSGFQNLWAE

Query:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL
        FSDIVCAT+SK+S  DVLA+++++K DQFLMKL
Subjt:  FSDIVCATVSKESLTDVLAIHEINKRDQFLMKL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-0427.91Show/hide
Query:  GTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSG
        GT  AP       +WK +D  + S + G+    +   +   ++A  +W  L+KIYA  +     QL  ++  +T+G+ +I DY  G
Subjt:  GTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEISNYTQGSLSIQDYYSG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCTACCTTTGGCCACTTGAAAGTAATGCAACATTCGGCCTCCGCCATCCAGAGGCACATGTCCTCGACTCTTCTAAGATTGGCACTCACCCTTGGCAATTATGGGA
ACATATTGATGGTACTACTCCAGCACCAACAGATGCTACTCAGTTGGCTCAATGGAAGATCAAAGATGCTAGGGTGATGTCTTGGATTACTGGGTCATGTGATCCTCAAA
TTGTTCTTAATTTACGTTCCTATAGCAGCGCTCAAGCCATGTGGAACTATTTAAAAAAGATTTATGCTCAAACAAATTCAGCCAGGAGATTTCAATTGGAGTGTGAAATT
TCAAATTATACACAGGGGAGTCTCTCTATTCAGGATTACTATTCTGGTTTTCAAAATTTATGGGCTGAATTTTCTGATATAGTGTGTGCTACAGTGTCTAAAGAATCTCT
TACTGATGTTTTGGCTATTCATGAGATTAACAAGCGTGATCAGTTCTTGATGAAGCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCTACCTTTGGCCACTTGAAAGTAATGCAACATTCGGCCTCCGCCATCCAGAGGCACATGTCCTCGACTCTTCTAAGATTGGCACTCACCCTTGGCAATTATGGGA
ACATATTGATGGTACTACTCCAGCACCAACAGATGCTACTCAGTTGGCTCAATGGAAGATCAAAGATGCTAGGGTGATGTCTTGGATTACTGGGTCATGTGATCCTCAAA
TTGTTCTTAATTTACGTTCCTATAGCAGCGCTCAAGCCATGTGGAACTATTTAAAAAAGATTTATGCTCAAACAAATTCAGCCAGGAGATTTCAATTGGAGTGTGAAATT
TCAAATTATACACAGGGGAGTCTCTCTATTCAGGATTACTATTCTGGTTTTCAAAATTTATGGGCTGAATTTTCTGATATAGTGTGTGCTACAGTGTCTAAAGAATCTCT
TACTGATGTTTTGGCTATTCATGAGATTAACAAGCGTGATCAGTTCTTGATGAAGCTATGA
Protein sequenceShow/hide protein sequence
MPYLWPLESNATFGLRHPEAHVLDSSKIGTHPWQLWEHIDGTTPAPTDATQLAQWKIKDARVMSWITGSCDPQIVLNLRSYSSAQAMWNYLKKIYAQTNSARRFQLECEI
SNYTQGSLSIQDYYSGFQNLWAEFSDIVCATVSKESLTDVLAIHEINKRDQFLMKL