; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028357 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028357
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationtig00153057:2232515..2236362
RNA-Seq ExpressionSgr028357
SyntenySgr028357
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB38291.1 hypothetical protein L484_013924 [Morus notabilis]8.7e-3546.91Show/hide
Query:  GEIAEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQWAENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQD
        G I  +   +  EMP+FDG +P  W   AE YF ++++T  +KL V+V+S EG+AL W+QW + R   ++W  L+  LL  FRP++EG   E+FL+++Q+
Subjt:  GEIAEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQWAENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQD

Query:  STVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIEDK
        +TV DY R+FE +A  L +LSE +++STFVKGL  +IRAEIR MKP  L  IME AQ +E++
Subjt:  STVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIEDK

KAA0037917.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.5e-3441.71Show/hide
Query:  SENDGLRSTDRKVDVGESSGNLFEGECQGKSMGEI----AEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQWAE
        SE  G+ +  RK  V    G   EGE +G++   +     +KR K K  EMP+F+G DP+ W + AEHYF++H L   +KL ++V+S EG  L W++WAE
Subjt:  SENDGLRSTDRKVDVGESSGNLFEGECQGKSMGEI----AEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQWAE

Query:  NRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIEDK
        NRK F++W +L++R+   FR    G    +FLAIKQ+ TV +Y ++FE ++  LP+++E +++ TF  GL  +IR E+  M+ V LE++ME AQL E+K
Subjt:  NRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIEDK

KAA0038377.1 gypsy/ty3 element polyprotein [Cucumis melo var. makuwa]4.3e-3438.76Show/hide
Query:  KKQVSHSENDGLRSTDRKVDVGESSGNLFEGECQGKSMGEIAEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQW
        ++Q   SE  G+ +  RK+   +      EGE          + R K K  EMP+F+G DP  WI+ AEHYF++H L   +KL ++++S EG  L W++W
Subjt:  KKQVSHSENDGLRSTDRKVDVGESSGNLFEGECQGKSMGEIAEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQW

Query:  AENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIED
        AENRK F++W++L++RL T FR  + G    +FLAIKQ+ +V +Y +RFE ++  LP+++E ++   F  GL   IR E+  M+ V LE++M+ A+L E+
Subjt:  AENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIED

Query:  KEAIDSPAH
        K  I   +H
Subjt:  KEAIDSPAH

TYK03081.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.3e-3437.39Show/hide
Query:  SENDGLRSTDRK-------VDVGESSGNLFEGECQGKSMGEIAEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQ
        SE  G+ +  RK       ++ GE       GE  G       ++R K K  EMP+F+  DP+ W + AEHYF++H L+  +KL ++V+S EG  L W++
Subjt:  SENDGLRSTDRK-------VDVGESSGNLFEGECQGKSMGEIAEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQ

Query:  WAENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIE
        WAENRK F++W +L++R+   FR    G    +FLAIKQ+ TV +Y ++FE ++  LP+++E +++ TF  GL  +IR E+  M+ V LE++ME AQL E
Subjt:  WAENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIE

Query:  DKE--AIDSPAHEATSNRAYKKMIFVGPVSKTHESALT
        +K   +   P     S    K  +  GP  KT+ S  T
Subjt:  DKE--AIDSPAHEATSNRAYKKMIFVGPVSKTHESALT

XP_022897442.1 uncharacterized protein LOC111411108 [Olea europaea var. sylvestris]1.0e-3538.5Show/hide
Query:  RAMKALRLKKQVSHSENDGLRSTDRKVD--VGESSGNLFEGECQGKSMGEIAEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFE
        R  K +  +K  + S   GL  +    +  VG S+G+ + GE + + +            EMP+F+GNDP  W+F  E YF +++L+  +KL  + + F+
Subjt:  RAMKALRLKKQVSHSENDGLRSTDRKVD--VGESSGNLFEGECQGKSMGEIAEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFE

Query:  GDALKWYQWAENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEI
        G+AL W+QW E R+  + WEDL+  LL  FRP +EG    QFL+++Q +TV +Y RRFET+A  L  +SE +++ +F+ GL ++IRAE++ ++P+ LE+I
Subjt:  GDALKWYQWAENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEI

Query:  MEKAQLIEDKEAI
        ME AQ++ED+  I
Subjt:  MEKAQLIEDKEAI

TrEMBL top hitse value%identityAlignment
A0A5A7T8L1 Ty3-gypsy retrotransposon protein1.2e-3441.71Show/hide
Query:  SENDGLRSTDRKVDVGESSGNLFEGECQGKSMGEI----AEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQWAE
        SE  G+ +  RK  V    G   EGE +G++   +     +KR K K  EMP+F+G DP+ W + AEHYF++H L   +KL ++V+S EG  L W++WAE
Subjt:  SENDGLRSTDRKVDVGESSGNLFEGECQGKSMGEI----AEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQWAE

Query:  NRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIEDK
        NRK F++W +L++R+   FR    G    +FLAIKQ+ TV +Y ++FE ++  LP+++E +++ TF  GL  +IR E+  M+ V LE++ME AQL E+K
Subjt:  NRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIEDK

A0A5D3BD16 Ty3/gypsy retrotransposon protein2.1e-3438.76Show/hide
Query:  KKQVSHSENDGLRSTDRKVDVGESSGNLFEGECQGKSMGEIAEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQW
        ++Q   SE  G+ +  RK+   +      EGE          + R K K  EMP+F+G DP  WI+ AEHYF++H L   +KL ++++S EG  L W++W
Subjt:  KKQVSHSENDGLRSTDRKVDVGESSGNLFEGECQGKSMGEIAEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQW

Query:  AENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIED
        AENRK F++W++L++RL T FR  + G    +FLAIKQ+ +V +Y +RFE ++  LP+++E ++   F  GL   IR E+  M+ V LE++M+ A+L E+
Subjt:  AENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIED

Query:  KEAIDSPAH
        K  I   +H
Subjt:  KEAIDSPAH

A0A5D3BTN5 Ty3-gypsy retrotransposon protein1.6e-3437.39Show/hide
Query:  SENDGLRSTDRK-------VDVGESSGNLFEGECQGKSMGEIAEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQ
        SE  G+ +  RK       ++ GE       GE  G       ++R K K  EMP+F+  DP+ W + AEHYF++H L+  +KL ++V+S EG  L W++
Subjt:  SENDGLRSTDRK-------VDVGESSGNLFEGECQGKSMGEIAEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQ

Query:  WAENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIE
        WAENRK F++W +L++R+   FR    G    +FLAIKQ+ TV +Y ++FE ++  LP+++E +++ TF  GL  +IR E+  M+ V LE++ME AQL E
Subjt:  WAENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIE

Query:  DKE--AIDSPAHEATSNRAYKKMIFVGPVSKTHESALT
        +K   +   P     S    K  +  GP  KT+ S  T
Subjt:  DKE--AIDSPAHEATSNRAYKKMIFVGPVSKTHESALT

A0A5D3C5T9 Ty3-gypsy retrotransposon protein2.1e-3440Show/hide
Query:  KKQVSHSENDGLRSTDRKVDVGESSGNLFEGECQGKSMGEI----AEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALK
        ++Q   SE  G+ +  RK  V    G   EGE +G++   +     ++R K K  EMP+F+G DP+ W + AEHYF++H L   +KL ++V+S EG  L 
Subjt:  KKQVSHSENDGLRSTDRKVDVGESSGNLFEGECQGKSMGEI----AEKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALK

Query:  WYQWAENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQ
        W++WAENRK F++W +L++R+   FR    G    +FLAI+Q  TV +Y ++FE ++  LP+++E +++ TF  GL  +IR E+  M+ V LE++ME AQ
Subjt:  WYQWAENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQ

Query:  LIEDK
        L E+K
Subjt:  LIEDK

W9QTX5 Mediator of RNA polymerase II transcription subunit 254.2e-3546.91Show/hide
Query:  GEIAEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQWAENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQD
        G I  +   +  EMP+FDG +P  W   AE YF ++++T  +KL V+V+S EG+AL W+QW + R   ++W  L+  LL  FRP++EG   E+FL+++Q+
Subjt:  GEIAEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQWAENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQD

Query:  STVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIEDK
        +TV DY R+FE +A  L +LSE +++STFVKGL  +IRAEIR MKP  L  IME AQ +E++
Subjt:  STVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIEDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein3.9e-0935.14Show/hide
Query:  EMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQWAENRKHFQNWEDLRKRLLTHFRPLK
        EMP+FDG+    W    E +F + +   + KL +  +S EG ALKW+    +   F++W    +RLL  F P+K
Subjt:  EMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEGDALKWYQWAENRKHFQNWEDLRKRLLTHFRPLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCCAAAGATGGAATGGTTTGCCATTGCCGTGCTATGAAGGCACTTAGATTGAAGAAACAAGTTTCTCATTCTGAAAATGATGGTTTGCGAAGCACCGACAGAAA
AGTCGACGTTGGTGAGTCTTCTGGAAATCTTTTCGAAGGGGAGTGTCAGGGTAAGAGTATGGGAGAGATTGCTGAGAAGCGCTATAAAAAAAATGAGGAAATGCCTATTT
TCGATGGTAATGATCCAAAACCTTGGATATTTTGCGCCGAACATTATTTCGAAATTCATCAACTCACTGCTGCAAAAAAATTATCAGTGTCAGTAATAAGTTTTGAGGGT
GATGCTCTAAAATGGTACCAGTGGGCTGAAAATCGTAAACATTTTCAAAATTGGGAAGATTTACGCAAGAGATTACTTACTCATTTTCGACCTTTAAAGGAAGGGGATCC
GTATGAACAGTTCTTAGCAATCAAACAGGACAGCACGGTACCAGATTATTCACGAAGGTTTGAAACGATGGCAGGACTACTACCCGACCTGTCTGAAAGCATAGTAAAGA
GTACTTTTGTTAAAGGGCTTCTTTCAAAAATTCGCGCTGAAATCAGATGCATGAAACCCGTCACATTGGAAGAGATTATGGAGAAGGCCCAACTAATTGAAGACAAAGAG
GCGATAGATTCCCCAGCCCACGAGGCAACTTCGAATCGGGCCTATAAGAAGATGATTTTTGTCGGTCCAGTGAGTAAAACCCATGAGAGTGCCTTGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCCAAAGATGGAATGGTTTGCCATTGCCGTGCTATGAAGGCACTTAGATTGAAGAAACAAGTTTCTCATTCTGAAAATGATGGTTTGCGAAGCACCGACAGAAA
AGTCGACGTTGGTGAGTCTTCTGGAAATCTTTTCGAAGGGGAGTGTCAGGGTAAGAGTATGGGAGAGATTGCTGAGAAGCGCTATAAAAAAAATGAGGAAATGCCTATTT
TCGATGGTAATGATCCAAAACCTTGGATATTTTGCGCCGAACATTATTTCGAAATTCATCAACTCACTGCTGCAAAAAAATTATCAGTGTCAGTAATAAGTTTTGAGGGT
GATGCTCTAAAATGGTACCAGTGGGCTGAAAATCGTAAACATTTTCAAAATTGGGAAGATTTACGCAAGAGATTACTTACTCATTTTCGACCTTTAAAGGAAGGGGATCC
GTATGAACAGTTCTTAGCAATCAAACAGGACAGCACGGTACCAGATTATTCACGAAGGTTTGAAACGATGGCAGGACTACTACCCGACCTGTCTGAAAGCATAGTAAAGA
GTACTTTTGTTAAAGGGCTTCTTTCAAAAATTCGCGCTGAAATCAGATGCATGAAACCCGTCACATTGGAAGAGATTATGGAGAAGGCCCAACTAATTGAAGACAAAGAG
GCGATAGATTCCCCAGCCCACGAGGCAACTTCGAATCGGGCCTATAAGAAGATGATTTTTGTCGGTCCAGTGAGTAAAACCCATGAGAGTGCCTTGACCTGA
Protein sequenceShow/hide protein sequence
MEPKDGMVCHCRAMKALRLKKQVSHSENDGLRSTDRKVDVGESSGNLFEGECQGKSMGEIAEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAKKLSVSVISFEG
DALKWYQWAENRKHFQNWEDLRKRLLTHFRPLKEGDPYEQFLAIKQDSTVPDYSRRFETMAGLLPDLSESIVKSTFVKGLLSKIRAEIRCMKPVTLEEIMEKAQLIEDKE
AIDSPAHEATSNRAYKKMIFVGPVSKTHESALT