; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr000242 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr000242
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationtig00000119:19570..20163
RNA-Seq ExpressionSgr000242
SyntenySgr000242
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0051716 - cellular response to stimulus (biological process)
GO:0003824 - catalytic activity (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037097.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]5.8e-3446.25Show/hide
Query:  AEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV
        A++   K  EMP+F G DP  W+F AE YF+IH+L+ +EK+ VS ISF+G AL WYR  E R+ F +W +L++RLL RFR S++    E+FL +KQ++TV
Subjt:  AEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV

Query:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDKE
         DY   F+ +   L D+ + +VK TF+ G+   IRA++R  +P  L E+ME AQL+E++E
Subjt:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDKE

KAA0037917.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.0e-3445.28Show/hide
Query:  EKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV
        +KR K K  EMP+F+G DP+ W + AEHYF++H L   EKL ++V+S EG  L W+RWAE+RK F++W +L++R+  RFR    G    +FLAIKQ+ TV
Subjt:  EKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV

Query:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDK
         +Y  +FE ++  LP+++E +++ TF  G+  +IR ++  M+ V LE++ME AQL E+K
Subjt:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDK

TYK06549.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]5.8e-3446.25Show/hide
Query:  AEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV
        A++   K  EMP+F G DP  W+F AE YF+IH+L+ +EK+ VS ISF+G AL WYR  E R+ F +W +L++RLL RFR S++    E+FL +KQ++TV
Subjt:  AEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV

Query:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDKE
         DY   F+ +   L D+ + +VK TF+ G+   IRA++R  +P  L E+ME AQL+E++E
Subjt:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDKE

TYK18994.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]5.8e-3444.65Show/hide
Query:  EKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV
        +KR K K  EMP+F+G DP+ W + AEHYF++H L   EKL ++V+S EG  L W+RWAE+RK F++W +L++R+  RFR    G    +FLAIKQ+ TV
Subjt:  EKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV

Query:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDK
         +Y  +FE ++  LP+++E +++ TF  G+  +IR ++  M+ V L+++ME AQL E+K
Subjt:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDK

XP_022897442.1 uncharacterized protein LOC111411108 [Olea europaea var. sylvestris]3.1e-3545.1Show/hide
Query:  EMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTVPDYSLRFET
        EMP+F+GNDP  W+F  E YF +++L+  EKL  + + F+G+AL W++W E R+  + WEDL+  LL RFRPS+EG    QFL+++Q  TV +Y  RFET
Subjt:  EMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTVPDYSLRFET

Query:  MAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDKEAI
        +A  L  +SE +++ +F+ G+ ++IRA+++ ++P+ LE+IME AQ++ED+  I
Subjt:  MAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDKEAI

TrEMBL top hitse value%identityAlignment
A0A5A7T6B1 Transposon Ty3-G Gag-Pol polyprotein2.8e-3446.25Show/hide
Query:  AEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV
        A++   K  EMP+F G DP  W+F AE YF+IH+L+ +EK+ VS ISF+G AL WYR  E R+ F +W +L++RLL RFR S++    E+FL +KQ++TV
Subjt:  AEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV

Query:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDKE
         DY   F+ +   L D+ + +VK TF+ G+   IRA++R  +P  L E+ME AQL+E++E
Subjt:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDKE

A0A5A7T8L1 Ty3-gypsy retrotransposon protein9.6e-3545.28Show/hide
Query:  EKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV
        +KR K K  EMP+F+G DP+ W + AEHYF++H L   EKL ++V+S EG  L W+RWAE+RK F++W +L++R+  RFR    G    +FLAIKQ+ TV
Subjt:  EKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV

Query:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDK
         +Y  +FE ++  LP+++E +++ TF  G+  +IR ++  M+ V LE++ME AQL E+K
Subjt:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDK

A0A5D3BTN5 Ty3-gypsy retrotransposon protein6.2e-3439.57Show/hide
Query:  KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTVPDYSLR
        K  EMP+F+  DP+ W + AEHYF++H L+  EKL ++V+S EG  L W+RWAE+RK F++W +L++R+  RFR    G    +FLAIKQ+ TV +Y  +
Subjt:  KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTVPDYSLR

Query:  FETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDKEAIDSPAHEATSNRAYKKMISVGPVSKTHESALT
        FE ++  LP+++E +++ TF  G+  +IR ++  M+ V LE++ME AQL E+K          +         ++GP  KT+ S  T
Subjt:  FETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDKEAIDSPAHEATSNRAYKKMISVGPVSKTHESALT

A0A5D3C860 Transposon Tf2-1 polyprotein isoform X12.8e-3446.25Show/hide
Query:  AEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV
        A++   K  EMP+F G DP  W+F AE YF+IH+L+ +EK+ VS ISF+G AL WYR  E R+ F +W +L++RLL RFR S++    E+FL +KQ++TV
Subjt:  AEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV

Query:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDKE
         DY   F+ +   L D+ + +VK TF+ G+   IRA++R  +P  L E+ME AQL+E++E
Subjt:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDKE

A0A5D3D605 Ty3-gypsy retrotransposon protein2.8e-3444.65Show/hide
Query:  EKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV
        +KR K K  EMP+F+G DP+ W + AEHYF++H L   EKL ++V+S EG  L W+RWAE+RK F++W +L++R+  RFR    G    +FLAIKQ+ TV
Subjt:  EKRYK-KNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTV

Query:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDK
         +Y  +FE ++  LP+++E +++ TF  G+  +IR ++  M+ V L+++ME AQL E+K
Subjt:  PDYSLRFETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein1.8e-0936.49Show/hide
Query:  EMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSK
        EMP+FDG+    W    E +F + +   ++KL +  +S EG ALKW+        F++W    +RLL RF P K
Subjt:  EMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAGATTGCTGAGAAGCGCTATAAAAAAAATGAGGAAATGCCTATTTTCGATGGTAATGATCCAAAACCTTGGATATTTTGCGCCGAACATTATTTCGAAATTCA
TCAACTCACTGCTGCAGAAAAATTATCAGTGTCAGTAATAAGTTTTGAGGGTGATGCCCTAAAATGGTACCGGTGGGCTGAACATCGTAAACATTTTCAAAATTGGGAAG
ATTTACGCAAGAGATTACTTACTCGTTTTCGACCTTCAAAGGAAGGGGATCCGTATGAACAGTTCTTAGCAATCAAACAAGACAACACGGTACCAGATTATTCACTAAGG
TTTGAAACGATGGCAGGACTACTACCCGACCTGTCTGAAAGCATAGTAAAGAGTACTTTTGTTAAAGGGATTCTTTCAAAAATTCGGGCTAAAATCAGATGCATGAAACC
CGTCACATTGGAAGAGATTATGGAGAAGGCCCAACTAATTGAAGACAAAGAGGCGATAGATTCCCCAGCCCACGAGGCAACTTCGAATCGGGCCTATAAGAAGATGATTT
CTGTCGGTCCAGTGAGTAAAACCCATGAGAGTGCCTTGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGAGATTGCTGAGAAGCGCTATAAAAAAAATGAGGAAATGCCTATTTTCGATGGTAATGATCCAAAACCTTGGATATTTTGCGCCGAACATTATTTCGAAATTCA
TCAACTCACTGCTGCAGAAAAATTATCAGTGTCAGTAATAAGTTTTGAGGGTGATGCCCTAAAATGGTACCGGTGGGCTGAACATCGTAAACATTTTCAAAATTGGGAAG
ATTTACGCAAGAGATTACTTACTCGTTTTCGACCTTCAAAGGAAGGGGATCCGTATGAACAGTTCTTAGCAATCAAACAAGACAACACGGTACCAGATTATTCACTAAGG
TTTGAAACGATGGCAGGACTACTACCCGACCTGTCTGAAAGCATAGTAAAGAGTACTTTTGTTAAAGGGATTCTTTCAAAAATTCGGGCTAAAATCAGATGCATGAAACC
CGTCACATTGGAAGAGATTATGGAGAAGGCCCAACTAATTGAAGACAAAGAGGCGATAGATTCCCCAGCCCACGAGGCAACTTCGAATCGGGCCTATAAGAAGATGATTT
CTGTCGGTCCAGTGAGTAAAACCCATGAGAGTGCCTTGACCTGA
Protein sequenceShow/hide protein sequence
MGEIAEKRYKKNEEMPIFDGNDPKPWIFCAEHYFEIHQLTAAEKLSVSVISFEGDALKWYRWAEHRKHFQNWEDLRKRLLTRFRPSKEGDPYEQFLAIKQDNTVPDYSLR
FETMAGLLPDLSESIVKSTFVKGILSKIRAKIRCMKPVTLEEIMEKAQLIEDKEAIDSPAHEATSNRAYKKMISVGPVSKTHESALT