; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015474 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015474
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationtig00003991:663373..665167
RNA-Seq ExpressionSgr015474
SyntenySgr015474
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI09819.1 hypothetical protein PRUPE_4G011200 [Prunus persica]4.1e-2549.63Show/hide
Query:  QKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVEACLNVLNGGISVRDLNKTNIALILKVKQPKN
        Q++   L+ V   +T  +N  L  L  +  E++  TL QMF TKA G+DG  ALFFQKYW IVG K  + CL +LNG  SVR+ N T IALI KVK P  
Subjt:  QKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVEACLNVLNGGISVRDLNKTNIALILKVKQPKN

Query:  VGDFKPIILCNVFYKIITKAIANHLSKILPAIISE
        V +F+PI LC   YK+I K IAN L  +LP +I+E
Subjt:  VGDFKPIILCNVFYKIITKAIANHLSKILPAIISE

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]4.1e-2543.53Show/hide
Query:  WRWGNLEVVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGT
        WR  +  V +TE  +  + + +    +F +T    Q++   L+ V   +T  +N  L  L  +  E++  TL QMF TKA G+DG  ALFFQKYW IVG 
Subjt:  WRWGNLEVVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGT

Query:  KTVEACLNVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE
        K  + CL +LNG  SVR+ N T IALI KVK P  V +F+PI LC   YK+I K IAN L  +LP +I+E
Subjt:  KTVEACLNVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE

XP_014758739.1 uncharacterized protein LOC106866882 [Brachypodium distachyon]2.8e-2641.24Show/hide
Query:  EIQVSLRSYSNSHIDAGLTMGDRNWRWGNLEVVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMF
        +++  L S   S+ DAG  +G++    G   V E E    +E  I+  + +F++   N       L +V  RVT E+N FLS   PY E D+R+ +  + 
Subjt:  EIQVSLRSYSNSHIDAGLTMGDRNWRWGNLEVVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMF

Query:  TTKALGYDGFLALFFQKYWEIVGTKTVEACLNVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE
          KA G DG  A+F++ YW+IVG +     LNV+NGG      N T IALI KVK+P N+ D +PI LCNV YKII K +A  L  ILP IISE
Subjt:  TTKALGYDGFLALFFQKYWEIVGTKTVEACLNVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.2e-3051.82Show/hide
Query:  NLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVEACLNVLNGGISVRDLNKTNIALILKVKQP
        N   I A ++ +  R+T EVN  L  L PY +E+I   + QMF TKALG DGF ALF+Q YW +VG KT+EACLN LN G  ++  N T IALI K+KQP
Subjt:  NLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVEACLNVLNGGISVRDLNKTNIALILKVKQP

Query:  KNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE
        +++ DF+PI LCNV YKII+K+I N L  ++  +IS+
Subjt:  KNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE

XP_024038343.1 uncharacterized protein LOC112097373 [Citrus clementina]1.4e-2541.24Show/hide
Query:  GLTMGDRNWRWGNLEVVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQ
        G+     NW    +E  E   F+F +++ N    +F T++PN  +I AALS ++ RV+ E+N  L   +P+  E++ + L QM  TKA G DG  A+FFQ
Subjt:  GLTMGDRNWRWGNLEVVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQ

Query:  KYWEIVGTKTVEACLNVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIIS
        K+W+ V    +  CL++LN    V   N T I LI K  +P+ V DF+PI LCNV Y+I+ KAIAN L  +LP +IS
Subjt:  KYWEIVGTKTVEACLNVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIIS

TrEMBL top hitse value%identityAlignment
A0A2N9GPZ7 Reverse transcriptase domain-containing protein5.6e-2842.33Show/hide
Query:  VVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVEACL
        V +TE  K  E  ++    IF ++ P+ + I   L  + + VT  +N  L A   + ++++   L QM+ TKA G DG  A+F+Q YW+IVG +  +A L
Subjt:  VVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVEACL

Query:  NVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE
        ++L+ G  +R +N T+IALI KVK P+N+ DF+PI LCNV YKI++K +AN L K+LP +ISE
Subjt:  NVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE

A0A2N9I335 Reverse transcriptase domain-containing protein5.6e-2841.57Show/hide
Query:  NLEVVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVE
        N  V++T+  K     ++    IF ++ P  + I + L  +   VT+E+N  L  L  +  E++ + L QM+ TKA G DG  A+F+Q YW+IVG +  +
Subjt:  NLEVVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVE

Query:  ACLNVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE
        A L++L+ G  V  +N T+IALI KVK P+ + DF+PI LCNV YKI++K +AN L K+LP +ISE
Subjt:  ACLNVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE

A0A2N9IBV7 Reverse transcriptase domain-containing protein2.1e-2744.05Show/hide
Query:  WGNLEVVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKT
        W +  V  T+  + E+  ++  D IF T+ P    +   L++VN+RVT EVN  L  L P+  +++R  L QM  +KA G DG  + FFQKYW IVG   
Subjt:  WGNLEVVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKT

Query:  VEACLNVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE
        V A L+VLN G  +R +N T+I+LI K K P+ + +++PI LCNV YKII+K +AN L  +LP IIS+
Subjt:  VEACLNVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE

A0A2N9IPS8 Reverse transcriptase domain-containing protein5.6e-2842.33Show/hide
Query:  VVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVEACL
        V +TE  K  E  ++    IF ++ P+ + I   L  + + VT  +N  L A   + ++++   L QM+ TKA G DG  A+F+Q YW+IVG +  +A L
Subjt:  VVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVEACL

Query:  NVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE
        ++L+ G  +R +N T+IALI KVK P+N+ DF+PI LCNV YKI++K +AN L K+LP +ISE
Subjt:  NVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE

A0A6J1DX30 uncharacterized protein LOC1110248741.6e-3051.82Show/hide
Query:  NLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVEACLNVLNGGISVRDLNKTNIALILKVKQP
        N   I A ++ +  R+T EVN  L  L PY +E+I   + QMF TKALG DGF ALF+Q YW +VG KT+EACLN LN G  ++  N T IALI K+KQP
Subjt:  NLQKIRAALSSVNARVTKEVNAFLSALVPYREEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVEACLNVLNGGISVRDLNKTNIALILKVKQP

Query:  KNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE
        +++ DF+PI LCNV YKII+K+I N L  ++  +IS+
Subjt:  KNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.1e-0734.83Show/hide
Query:  EEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVEACLNVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIIT
        +++I   +  M   KA G D F A FF + W +V   T+ A       G  ++  N T I LI KV     +  F+P+  C V YKIIT
Subjt:  EEDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVEACLNVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIIT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGGACGAGCGGACCCCTTAGACTATGTCTTCCATTACATGCAGTTCATGTCGATGGATGAGATCCAAGTATCATTGCGATCTTATTCCAATAGTCATATTGATGC
TGGGCTAACGATGGGAGACCGAAACTGGAGATGGGGGAACCTAGAGGTGGTAGAGACAGAAACCTTCAAATTTGAAGAATTTTGGATCAATGCCAAGGATTACATCTTCA
AAACAACGGAACCAAACTTACAAAAGATTAGAGCAGCATTATCTTCCGTCAATGCTAGAGTAACGAAAGAAGTGAATGCTTTTCTGTCAGCACTCGTACCTTATAGGGAG
GAAGACATCAGAAAGACTCTTTTACAAATGTTCACAACAAAGGCTCTAGGTTATGATGGTTTTCTGGCCCTATTCTTTCAGAAATATTGGGAGATAGTTGGAACCAAAAC
AGTAGAAGCTTGCCTTAATGTCTTAAATGGTGGTATTTCAGTAAGGGACCTGAATAAAACTAACATTGCTTTGATTCTTAAAGTAAAGCAGCCCAAAAATGTGGGTGATT
TCAAACCAATCATTCTGTGTAATGTCTTCTACAAGATCATTACCAAGGCCATTGCTAACCATCTAAGTAAAATTCTCCCAGCTATCATCTCAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGATGGGACGAGCGGACCCCTTAGACTATGTCTTCCATTACATGCAGTTCATGTCGATGGATGAGATCCAAGTATCATTGCGATCTTATTCCAATAGTCATATTGATGC
TGGGCTAACGATGGGAGACCGAAACTGGAGATGGGGGAACCTAGAGGTGGTAGAGACAGAAACCTTCAAATTTGAAGAATTTTGGATCAATGCCAAGGATTACATCTTCA
AAACAACGGAACCAAACTTACAAAAGATTAGAGCAGCATTATCTTCCGTCAATGCTAGAGTAACGAAAGAAGTGAATGCTTTTCTGTCAGCACTCGTACCTTATAGGGAG
GAAGACATCAGAAAGACTCTTTTACAAATGTTCACAACAAAGGCTCTAGGTTATGATGGTTTTCTGGCCCTATTCTTTCAGAAATATTGGGAGATAGTTGGAACCAAAAC
AGTAGAAGCTTGCCTTAATGTCTTAAATGGTGGTATTTCAGTAAGGGACCTGAATAAAACTAACATTGCTTTGATTCTTAAAGTAAAGCAGCCCAAAAATGTGGGTGATT
TCAAACCAATCATTCTGTGTAATGTCTTCTACAAGATCATTACCAAGGCCATTGCTAACCATCTAAGTAAAATTCTCCCAGCTATCATCTCAGAATAG
Protein sequenceShow/hide protein sequence
MMGRADPLDYVFHYMQFMSMDEIQVSLRSYSNSHIDAGLTMGDRNWRWGNLEVVETETFKFEEFWINAKDYIFKTTEPNLQKIRAALSSVNARVTKEVNAFLSALVPYRE
EDIRKTLLQMFTTKALGYDGFLALFFQKYWEIVGTKTVEACLNVLNGGISVRDLNKTNIALILKVKQPKNVGDFKPIILCNVFYKIITKAIANHLSKILPAIISE