; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020907 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020907
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationtig00153577:187925..191799
RNA-Seq ExpressionSgr020907
SyntenySgr020907
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]3.0e-3240.08Show/hide
Query:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------
        N+VLWKFQ++ ILKAHKL+G+IDG+   P     S +    PPQ NP+YE W  KDQAL+ +INA+LS  ALAYVVG  +S+Q                 
Subjt:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------

Query:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----
                                IK+I ++LA VS  I++ED +IY +NGLP+ +N F+T++RTRSQ +TFE+LHVL+ +EE A+ +  K +DS     
Subjt:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----

Query:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS
            S  S L+    F+N          G G G+H+G GR S
Subjt:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS

KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]3.9e-3240Show/hide
Query:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLI-LRSHNEPPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC------------------
        N+VLWKFQ++ ILKAHKLFG++DG+   P      +   PPQ NP YE W  KDQAL+ +INA+LS  ALAYVVG  +S+Q                   
Subjt:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLI-LRSHNEPPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC------------------

Query:  ----------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-------
                              IK+I ++LA VS  I++ED +IY +NGLP+ +N F+T++RTRSQ +TFE+LHVL+ +EE A+ +  K +DS       
Subjt:  ----------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-------

Query:  SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS
          S  S L+    FNN          G G G+++G GR S
Subjt:  SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]3.0e-3240.08Show/hide
Query:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------
        N+VLWKFQ++ ILKAHKL+G+IDG+   P     S +    PPQ NP+YE W  KDQAL+ +INA+LS  ALAYVVG  +S+Q                 
Subjt:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------

Query:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----
                                IK+I ++LA VS  I++ED +IY +NGLP+ +N F+T++RTRSQ +TFE+LHVL+ +EE A+ +  K +DS     
Subjt:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----

Query:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS
            S  S L+    F+N          G G G+H+G GR S
Subjt:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]3.0e-3240.08Show/hide
Query:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------
        N+VLWKFQ++ ILKAHKL+G+IDG+   P     S +    PPQ NP+YE W  KDQAL+ +INA+LS  ALAYVVG  +S+Q                 
Subjt:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------

Query:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----
                                IK+I ++LA VS  I++ED +IY +NGLP+ +N F+T++RTRSQ +TFE+LHVL+ +EE A+ +  K +DS     
Subjt:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----

Query:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS
            S  S L+    F+N          G G G+H+G GR S
Subjt:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]3.0e-3240.08Show/hide
Query:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------
        N+VLWKFQ++ ILKAHKL+G+IDG+   P     S +    PPQ NP+YE W  KDQAL+ +INA+LS  ALAYVVG  +S+Q                 
Subjt:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------

Query:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----
                                IK+I ++LA VS  I++ED +IY +NGLP+ +N F+T++RTRSQ +TFE+LHVL+ +EE A+ +  K +DS     
Subjt:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----

Query:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS
            S  S L+    F+N          G G G+H+G GR S
Subjt:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X21.4e-3240.08Show/hide
Query:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------
        N+VLWKFQ++ ILKAHKL+G+IDG+   P     S +    PPQ NP+YE W  KDQAL+ +INA+LS  ALAYVVG  +S+Q                 
Subjt:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------

Query:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----
                                IK+I ++LA VS  I++ED +IY +NGLP+ +N F+T++RTRSQ +TFE+LHVL+ +EE A+ +  K +DS     
Subjt:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----

Query:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS
            S  S L+    F+N          G G G+H+G GR S
Subjt:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X31.4e-3240.08Show/hide
Query:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------
        N+VLWKFQ++ ILKAHKL+G+IDG+   P     S +    PPQ NP+YE W  KDQAL+ +INA+LS  ALAYVVG  +S+Q                 
Subjt:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------

Query:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----
                                IK+I ++LA VS  I++ED +IY +NGLP+ +N F+T++RTRSQ +TFE+LHVL+ +EE A+ +  K +DS     
Subjt:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----

Query:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS
            S  S L+    F+N          G G G+H+G GR S
Subjt:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X11.4e-3240.08Show/hide
Query:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------
        N+VLWKFQ++ ILKAHKL+G+IDG+   P     S +    PPQ NP+YE W  KDQAL+ +INA+LS  ALAYVVG  +S+Q                 
Subjt:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------

Query:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----
                                IK+I ++LA VS  I++ED +IY +NGLP+ +N F+T++RTRSQ +TFE+LHVL+ +EE A+ +  K +DS     
Subjt:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----

Query:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS
            S  S L+    F+N          G G G+H+G GR S
Subjt:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS

A0A5D3CLI6 T4.51.4e-3240.08Show/hide
Query:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------
        N+VLWKFQ++ ILKAHKL+G+IDG+   P     S +    PPQ NP+YE W  KDQAL+ +INA+LS  ALAYVVG  +S+Q                 
Subjt:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNE---PPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC----------------

Query:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----
                                IK+I ++LA VS  I++ED +IY +NGLP+ +N F+T++RTRSQ +TFE+LHVL+ +EE A+ +  K +DS     
Subjt:  ------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDS-----

Query:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS
            S  S L+    F+N          G G G+H+G GR S
Subjt:  --SLSMNSALAMFANFNNKGRGKGRCGSGGGRGRHFGGGRGS

A0A6J1E049 uncharacterized protein LOC1110251504.2e-3243.16Show/hide
Query:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLIL-----RSHNEPPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC--------------
        N+VLWKFQ++ ILKAHKL+G+IDGS   P   L      S + PP  NPA+ +W  KD AL+ L+NA LS +ALAYVVGC +SQQ               
Subjt:  NYVLWKFQISPILKAHKLFGYIDGSIATPDLIL-----RSHNEPPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQC--------------

Query:  --------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQ
                                  IK++ ++LA V V++D ED +IY +N LP  FN F+T++RTRSQ ++FE+LHVL+ SEE AI++
Subjt:  --------------------------IKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).8.6e-0626.53Show/hide
Query:  SEYSIGYTNSFDDNYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNEPPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQCIKDI
        S++SI   +  +DNYV WK +    L+  K FG+IDG++  PD            +P Y+ W   +  ++  +  S++   L  V+   T+ +  +D+
Subjt:  SEYSIGYTNSFDDNYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNEPPQVNPAYEQWYIKDQALIMLINASLSQTALAYVVGCHTSQQCIKDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATTTGGCAGAAAACAGGGTTCGAAGACTGAACGAGAACACACACAACAATTTCCGCGAGAAAAATTTTCAGAATATTCCATAGGATATACGAATTCCTTCGATGA
CAATTATGTGCTTTGGAAGTTTCAAATCTCTCCGATTTTGAAGGCTCACAAACTTTTTGGTTACATCGATGGTTCTATTGCTACGCCTGATCTGATTTTGAGGTCTCATA
ATGAACCTCCTCAAGTTAATCCAGCTTATGAGCAGTGGTATATCAAAGATCAAGCTCTTATCATGTTGATCAATGCATCTTTATCTCAAACGGCACTTGCTTATGTGGTT
GGTTGTCACACTTCTCAGCAATGCATCAAGGATATTGTCAATCGATTAGCTGCTGTTTCAGTTGTTATCGATCAAGAAGATCCTATTATTTACATTGTTAATGGCCTTCC
TTCCACTTTTAATGTCTTCAAGACTACTTTGCGCACTCGTTCACAGGATCTTACCTTTGAAGATTTACATGTTCTTATGAATTCTGAGGAGTGTGCTATTGAGCAGCACT
TTAAGACCGAAGATTCCTCTCTATCTATGAATTCTGCACTTGCTATGTTTGCAAATTTCAACAATAAAGGTCGTGGTAAAGGACGATGTGGATCTGGCGGTGGTCGTGGA
CGACATTTTGGTGGTGGTCGTGGTTCTATTGTTCCGTCTTCGAATCCATCCCGTTCCTTCGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAATTTGGCAGAAAACAGGGTTCGAAGACTGAACGAGAACACACACAACAATTTCCGCGAGAAAAATTTTCAGAATATTCCATAGGATATACGAATTCCTTCGATGA
CAATTATGTGCTTTGGAAGTTTCAAATCTCTCCGATTTTGAAGGCTCACAAACTTTTTGGTTACATCGATGGTTCTATTGCTACGCCTGATCTGATTTTGAGGTCTCATA
ATGAACCTCCTCAAGTTAATCCAGCTTATGAGCAGTGGTATATCAAAGATCAAGCTCTTATCATGTTGATCAATGCATCTTTATCTCAAACGGCACTTGCTTATGTGGTT
GGTTGTCACACTTCTCAGCAATGCATCAAGGATATTGTCAATCGATTAGCTGCTGTTTCAGTTGTTATCGATCAAGAAGATCCTATTATTTACATTGTTAATGGCCTTCC
TTCCACTTTTAATGTCTTCAAGACTACTTTGCGCACTCGTTCACAGGATCTTACCTTTGAAGATTTACATGTTCTTATGAATTCTGAGGAGTGTGCTATTGAGCAGCACT
TTAAGACCGAAGATTCCTCTCTATCTATGAATTCTGCACTTGCTATGTTTGCAAATTTCAACAATAAAGGTCGTGGTAAAGGACGATGTGGATCTGGCGGTGGTCGTGGA
CGACATTTTGGTGGTGGTCGTGGTTCTATTGTTCCGTCTTCGAATCCATCCCGTTCCTTCGATTAG
Protein sequenceShow/hide protein sequence
MQFGRKQGSKTEREHTQQFPREKFSEYSIGYTNSFDDNYVLWKFQISPILKAHKLFGYIDGSIATPDLILRSHNEPPQVNPAYEQWYIKDQALIMLINASLSQTALAYVV
GCHTSQQCIKDIVNRLAAVSVVIDQEDPIIYIVNGLPSTFNVFKTTLRTRSQDLTFEDLHVLMNSEECAIEQHFKTEDSSLSMNSALAMFANFNNKGRGKGRCGSGGGRG
RHFGGGRGSIVPSSNPSRSFD