; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011656 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011656
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationtig00153024:188889..195178
RNA-Seq ExpressionSgr011656
SyntenySgr011656
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4316864.1 unnamed protein product [Prunus armeniaca]1.1e-1436.36Show/hide
Query:  NCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGE
        NC S L  W  +K G  PKK+KE+ L+    +    S +T          L+K L +EEIYW QRS+ HW++  DRNT +FH +A+ RRK N ++G+  E
Subjt:  NCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGE

Query:  NGQWTQNKKPIADISSNYFVKIYSSSVLGDAD
        N +W +    I  +   +F  +++   +G AD
Subjt:  NGQWTQNKKPIADISSNYFVKIYSSSVLGDAD

OMO99000.1 reverse transcriptase [Corchorus capsularis]6.6e-1538.73Show/hide
Query:  KKIKEVDLKFGGSRHKLIS-AETLEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGENGQWTQNKKPIADISSNYFVKIYSSSV
        K + E D ++G  R K     E L ++L +EE YW QRS+ +W+   DRNT +FH +AS RRK N I GLEG++GQWT +   I +I+SNYF K++ SS 
Subjt:  KKIKEVDLKFGGSRHKLIS-AETLEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGENGQWTQNKKPIADISSNYFVKIYSSSV

Query:  LGDADFVFGSLCFASTLKHNRHKFHPQTIVSIFVDICLMSKV
            D +  ++  + T + N H     T   IF  +  M  +
Subjt:  LGDADFVFGSLCFASTLKHNRHKFHPQTIVSIFVDICLMSKV

XP_008237273.1 PREDICTED: uncharacterized protein LOC103336015 [Prunus mume]1.1e-1436.36Show/hide
Query:  NCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGE
        NC S L  W  +K G  PKK+KE+ L+    +    S +T          L+  L +EEIYW QRS+ HW++  DRNT +FH +A+ RRK N ++G+  E
Subjt:  NCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGE

Query:  NGQWTQNKKPIADISSNYFVKIYSSSVLGDAD
        N +W +    I  +   +F  +++S  +G AD
Subjt:  NGQWTQNKKPIADISSNYFVKIYSSSVLGDAD

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]4.9e-1840.46Show/hide
Query:  GMLERNCLSTLMDWGKDKFGGYPKKIK--EV-------DLKFGGSRHKLISAET-LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEII
        GM    C+ +L+ WG++K G +  ++K  EV       DL F  +R     A T + ++L +EEI+W+QRS++ W K  DRNT+WFH KAS+RR+ NEI 
Subjt:  GMLERNCLSTLMDWGKDKFGGYPKKIK--EV-------DLKFGGSRHKLISAET-LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEII

Query:  GLEGENGQWTQNKKPIADISSNYFVKIYSSS
        GL  + G W +NK  +  +  +YF +++SSS
Subjt:  GLEGENGQWTQNKKPIADISSNYFVKIYSSS

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.2e-0540.28Show/hide
Query:  EDISEVGNIISKVKHKIVGSSSTFFTFTRPEGNVVAHLLSKMALEKRWTNVWLESWPDSFMSCLVAECADVL
        ED+SE G I+ K K+    S    F F + EGN  AH+L++ AL     ++W+E WP    SCL  EC + L
Subjt:  EDISEVGNIISKVKHKIVGSSSTFFTFTRPEGNVVAHLLSKMALEKRWTNVWLESWPDSFMSCLVAECADVL

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]5.0e-1531.11Show/hide
Query:  VLYFGMLERNCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKI
        V  F  + +  ++ L++W K +F G  K+++++  +  G + + +  E+          ++ I++ EEIYW+QRS+  W+K  D+NT++FH+KAS R+K 
Subjt:  VLYFGMLERNCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKI

Query:  NEIIGLEGENGQWTQNKKPIADISSNYFVKIYSSS
        N I G+E  +G W +  K + D   NYF ++++++
Subjt:  NEIIGLEGENGQWTQNKKPIADISSNYFVKIYSSS

TrEMBL top hitse value%identityAlignment
A0A6J1DAR4 uncharacterized protein LOC1110189542.4e-1840.46Show/hide
Query:  GMLERNCLSTLMDWGKDKFGGYPKKIK--EV-------DLKFGGSRHKLISAET-LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEII
        GM    C+ +L+ WG++K G +  ++K  EV       DL F  +R     A T + ++L +EEI+W+QRS++ W K  DRNT+WFH KAS+RR+ NEI 
Subjt:  GMLERNCLSTLMDWGKDKFGGYPKKIK--EV-------DLKFGGSRHKLISAET-LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEII

Query:  GLEGENGQWTQNKKPIADISSNYFVKIYSSS
        GL  + G W +NK  +  +  +YF +++SSS
Subjt:  GLEGENGQWTQNKKPIADISSNYFVKIYSSS

A0A6J1DAR4 uncharacterized protein LOC1110189546.0e-0640.28Show/hide
Query:  EDISEVGNIISKVKHKIVGSSSTFFTFTRPEGNVVAHLLSKMALEKRWTNVWLESWPDSFMSCLVAECADVL
        ED+SE G I+ K K+    S    F F + EGN  AH+L++ AL     ++W+E WP    SCL  EC + L
Subjt:  EDISEVGNIISKVKHKIVGSSSTFFTFTRPEGNVVAHLLSKMALEKRWTNVWLESWPDSFMSCLVAECADVL

A0A6J1DAR4 uncharacterized protein LOC1110189543.2e-1538.73Show/hide
Query:  KKIKEVDLKFGGSRHKLIS-AETLEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGENGQWTQNKKPIADISSNYFVKIYSSSV
        K + E D ++G  R K     E L ++L +EE YW QRS+ +W+   DRNT +FH +AS RRK N I GLEG++GQWT +   I +I+SNYF K++ SS 
Subjt:  KKIKEVDLKFGGSRHKLIS-AETLEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGENGQWTQNKKPIADISSNYFVKIYSSSV

Query:  LGDADFVFGSLCFASTLKHNRHKFHPQTIVSIFVDICLMSKV
            D +  ++  + T + N H     T   IF  +  M  +
Subjt:  LGDADFVFGSLCFASTLKHNRHKFHPQTIVSIFVDICLMSKV

A0A6J5TIF9 Reverse transcriptase domain-containing protein1.6e-1436.36Show/hide
Query:  NCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGE
        NC S L  W  +K G  PKK+KE+ L+    +    S +T          L+K L +EEIYW QRS+  W++  DRNT +FH +A+ RRK N ++G+  E
Subjt:  NCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGE

Query:  NGQWTQNKKPIADISSNYFVKIYSSSVLGDAD
        N +W      I  +   +F  +++S  +G AD
Subjt:  NGQWTQNKKPIADISSNYFVKIYSSSVLGDAD

A0A6J5Y0D5 Uncharacterized protein5.4e-1536.36Show/hide
Query:  NCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGE
        NC S L  W  +K G  PKK+KE+ L+    +    S +T          L+K L +EEIYW QRS+ HW++  DRNT +FH +A+ RRK N ++G+  E
Subjt:  NCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAET----------LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNKASYRRKINEIIGLEGE

Query:  NGQWTQNKKPIADISSNYFVKIYSSSVLGDAD
        N +W +    I  +   +F  +++   +G AD
Subjt:  NGQWTQNKKPIADISSNYFVKIYSSSVLGDAD

A0A803QF96 Uncharacterized protein1.6e-1437.67Show/hide
Query:  PHREPLVLYFGMLERNCLSTLMDWGKDKFGGYPKKIKEVDLK----------FGGSRHKLISAET-LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNK
        P  +P+V+    LE  C S L  W  DK+G   KKI +  LK             + + L ++E  L+++L +EEIYWQQRSQ  W+   DRNT++FH K
Subjt:  PHREPLVLYFGMLERNCLSTLMDWGKDKFGGYPKKIKEVDLK----------FGGSRHKLISAET-LEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNK

Query:  ASYRRKINEIIGLEGENGQWTQNKKPIADISSNYFVKIYSSSVLGD
        AS R+  N I  +  ENG     K  IA    +YF +I+++S L +
Subjt:  ASYRRKINEIIGLEGENGQWTQNKKPIADISSNYFVKIYSSSVLGD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAAAGATCAGGAGATGTCCTTTCTGGATATCCTTTCAACAGTGGGAGCTGGAGTCCAACCCCACCGAGAGCCATTAGTACTTTATTTTGGAATGTTGGAAAGAAA
TTGCTTGAGCACACTTATGGATTGGGGAAAGGACAAATTTGGGGGATATCCAAAAAAGATCAAAGAGGTAGATTTGAAGTTTGGGGGAAGCAGACACAAGTTGATTTCCG
CAGAGACTTTAGAGAAAATATTAATGAAAGAAGAGATTTATTGGCAACAAAGATCTCAAGAGCATTGGGTAAAATGGGATGACAGGAATACTCGGTGGTTTCATAATAAA
GCGTCTTATAGAAGGAAGATAAATGAAATAATAGGCTTGGAGGGTGAAAATGGGCAGTGGACGCAAAATAAGAAGCCTATAGCAGATATTAGTTCTAACTACTTTGTTAA
GATTTATTCTTCTTCTGTACTTGGTGATGCTGATTTTGTGTTTGGATCACTCTGCTTTGCTTCCACATTGAAGCATAACAGGCATAAGTTTCATCCTCAAACCATTGTTT
CTATTTTTGTGGATATCTGCCTGATGTCAAAGGTTACCGTCTTCACTAATCTTTCTGTTCAGCCTGATCCACTTTCAGATCTTGTCTTGCCAAGGTCCTTTGACTTGCCT
GAGGATAGTCCTCAAGTTACTTCTAGAGCTCCAGTGCAGGAAACTGTTGCAAGTGCACCTGACATTGCTGACTCTTCCCTAGGCACTCGTTTAGCTAATATAGATCATGT
TGAGGTAGTTTTGTTTGCTGCAGATCAGGTTAATGTTCCTCCCACGACAATACCAGTCACAAGGAAGGCTTATGAGACAGATTCTTTGAGGATTTTCCATGTGCTTATCA
CAGAGAATGAGGATATTTCAGAGGTGGGGAATATAATTTCAAAGGTAAAGCATAAGATTGTTGGTTCTTCTTCTACCTTCTTCACTTTCACGAGGCCAGAAGGCAATGTT
GTTGCTCATTTGCTTTCAAAGATGGCTTTGGAGAAACGATGGACGAACGTGTGGTTGGAAAGCTGGCCAGATTCTTTTATGTCATGTCTGGTAGCTGAGTGTGCGGATGT
GTTGTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAAAGATCAGGAGATGTCCTTTCTGGATATCCTTTCAACAGTGGGAGCTGGAGTCCAACCCCACCGAGAGCCATTAGTACTTTATTTTGGAATGTTGGAAAGAAA
TTGCTTGAGCACACTTATGGATTGGGGAAAGGACAAATTTGGGGGATATCCAAAAAAGATCAAAGAGGTAGATTTGAAGTTTGGGGGAAGCAGACACAAGTTGATTTCCG
CAGAGACTTTAGAGAAAATATTAATGAAAGAAGAGATTTATTGGCAACAAAGATCTCAAGAGCATTGGGTAAAATGGGATGACAGGAATACTCGGTGGTTTCATAATAAA
GCGTCTTATAGAAGGAAGATAAATGAAATAATAGGCTTGGAGGGTGAAAATGGGCAGTGGACGCAAAATAAGAAGCCTATAGCAGATATTAGTTCTAACTACTTTGTTAA
GATTTATTCTTCTTCTGTACTTGGTGATGCTGATTTTGTGTTTGGATCACTCTGCTTTGCTTCCACATTGAAGCATAACAGGCATAAGTTTCATCCTCAAACCATTGTTT
CTATTTTTGTGGATATCTGCCTGATGTCAAAGGTTACCGTCTTCACTAATCTTTCTGTTCAGCCTGATCCACTTTCAGATCTTGTCTTGCCAAGGTCCTTTGACTTGCCT
GAGGATAGTCCTCAAGTTACTTCTAGAGCTCCAGTGCAGGAAACTGTTGCAAGTGCACCTGACATTGCTGACTCTTCCCTAGGCACTCGTTTAGCTAATATAGATCATGT
TGAGGTAGTTTTGTTTGCTGCAGATCAGGTTAATGTTCCTCCCACGACAATACCAGTCACAAGGAAGGCTTATGAGACAGATTCTTTGAGGATTTTCCATGTGCTTATCA
CAGAGAATGAGGATATTTCAGAGGTGGGGAATATAATTTCAAAGGTAAAGCATAAGATTGTTGGTTCTTCTTCTACCTTCTTCACTTTCACGAGGCCAGAAGGCAATGTT
GTTGCTCATTTGCTTTCAAAGATGGCTTTGGAGAAACGATGGACGAACGTGTGGTTGGAAAGCTGGCCAGATTCTTTTATGTCATGTCTGGTAGCTGAGTGTGCGGATGT
GTTGTCCTAA
Protein sequenceShow/hide protein sequence
MSKDQEMSFLDILSTVGAGVQPHREPLVLYFGMLERNCLSTLMDWGKDKFGGYPKKIKEVDLKFGGSRHKLISAETLEKILMKEEIYWQQRSQEHWVKWDDRNTRWFHNK
ASYRRKINEIIGLEGENGQWTQNKKPIADISSNYFVKIYSSSVLGDADFVFGSLCFASTLKHNRHKFHPQTIVSIFVDICLMSKVTVFTNLSVQPDPLSDLVLPRSFDLP
EDSPQVTSRAPVQETVASAPDIADSSLGTRLANIDHVEVVLFAADQVNVPPTTIPVTRKAYETDSLRIFHVLITENEDISEVGNIISKVKHKIVGSSSTFFTFTRPEGNV
VAHLLSKMALEKRWTNVWLESWPDSFMSCLVAECADVLS