; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011882 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011882
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationtig00153113:11895..15557
RNA-Seq ExpressionSgr011882
SyntenySgr011882
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PNY05189.1 hypothetical protein L195_g001632 [Trifolium pratense]2.5e-1441.82Show/hide
Query:  VQVSYVVDKLGELDQDISFYCDALNHLPIEHNKNVKDEIMELSAKQEEPIATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKS
        V V+ + D   E +++I      LN   ++ N  + +EI E+ AK+E  + +  P+LKQL  HL+Y F G+ +++P IIS+ L++VEEEKLL+VLR++K 
Subjt:  VQVSYVVDKLGELDQDISFYCDALNHLPIEHNKNVKDEIMELSAKQEEPIATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKS

Query:  ALGWSIADIK
        A+GW I+D+K
Subjt:  ALGWSIADIK

XP_019163015.1 PREDICTED: uncharacterized protein LOC109159310 [Ipomoea nil]5.5e-1451.65Show/hide
Query:  ALNHLPIEHNK-NVKDEIMELSAKQEEP--IATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSIADIK
        +LNH P  H + NV    +  S+++  P  +  P P LK L +HL+YAF GE+ T PVIIS  LS +EEEKL+QVL+ HK+A+GW+IADIK
Subjt:  ALNHLPIEHNK-NVKDEIMELSAKQEEP--IATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSIADIK

XP_019171993.1 PREDICTED: uncharacterized protein LOC109167434 [Ipomoea nil]2.5e-1452.17Show/hide
Query:  ALNHLPIEHNKNVKDEIMELSAKQEE----PIATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSIADIK
        +LNH P +  +  K   M L A  E+     +  P P LK LL+HL+YAF GE+ T PVIIS  LS +EEEKL+QVL+ HK+A+GW+IADIK
Subjt:  ALNHLPIEHNKNVKDEIMELSAKQEE----PIATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSIADIK

XP_019183635.1 PREDICTED: uncharacterized protein LOC109178455 [Ipomoea nil]1.4e-1452.75Show/hide
Query:  ALNHLPIEHNK-NVKDEIMELSAKQEEP--IATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSIADIK
        +LNH P  H + NV    + +S K+  P  +  P P+LK L +HL+YAF GE+ T PVIIS  LS +EEEKL+QVL+ HK+A+GW+IADIK
Subjt:  ALNHLPIEHNK-NVKDEIMELSAKQEEP--IATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSIADIK

XP_022157708.1 uncharacterized protein LOC111024361 [Momordica charantia]1.8e-2562.5Show/hide
Query:  VDKLGELDQDISFYCDALNHLPIEHNKNVKDEIMELSAKQEEPIATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSI
        ++ +GELDQ++SFY DALN LPI H+KNVK ++MEL+ K +EP+  PA +LK+L   L   F  E STFPVIIS SLSQVEEEKLL +L  H SALGW I
Subjt:  VDKLGELDQDISFYCDALNHLPIEHNKNVKDEIMELSAKQEEPIATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSI

Query:  ADIK
        ADIK
Subjt:  ADIK

TrEMBL top hitse value%identityAlignment
A0A2K3NQ71 Reverse transcriptase1.2e-1441.82Show/hide
Query:  VQVSYVVDKLGELDQDISFYCDALNHLPIEHNKNVKDEIMELSAKQEEPIATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKS
        V V+ + D   E +++I      LN   ++ N  + +EI E+ AK+E  + +  P+LKQL  HL+Y F G+ +++P IIS+ L++VEEEKLL+VLR++K 
Subjt:  VQVSYVVDKLGELDQDISFYCDALNHLPIEHNKNVKDEIMELSAKQEEPIATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKS

Query:  ALGWSIADIK
        A+GW I+D+K
Subjt:  ALGWSIADIK

A0A392PFI9 Uncharacterized protein (Fragment)1.5e-1240Show/hide
Query:  VQVSYVVDKLGELDQDISFYCDALNHLPIEHNKNVKDEIMELSAKQEEPIATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKS
        V V+ + D   E +++I      LN   ++ N   +D I E+  K+E+ +     +LKQL  HL+Y F GE +++  IIS+SL++VEEEKLL+VL+++K 
Subjt:  VQVSYVVDKLGELDQDISFYCDALNHLPIEHNKNVKDEIMELSAKQEEPIATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKS

Query:  ALGWSIADIK
        A+GW I+D+K
Subjt:  ALGWSIADIK

A0A540KZF2 Reverse transcriptase domain-containing protein6.5e-1346.67Show/hide
Query:  ALNHLPIEHNKNVKDEIMELSAKQEEPIATPAP--KLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSIADIK
        AL  LP  H K      + +S  +  P  T AP  +LK L DHL+Y F G++ T PVI+S+SL+ +EEEKL++VL+ HK+A+GW++ADI+
Subjt:  ALNHLPIEHNKNVKDEIMELSAKQEEPIATPAP--KLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSIADIK

A0A5N5F2K1 Integrase catalytic domain-containing protein5.0e-1346.67Show/hide
Query:  ALNHLPIEHNKNVKDEIMELSAKQEEP--IATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSIADIK
        AL  LP  H+K      + +S  +  P  I  P  +LK L DHL+Y F G++ TFPVI+S+SL+ +EEEKL++VL+ HK+A+GW++ADI+
Subjt:  ALNHLPIEHNKNVKDEIMELSAKQEEP--IATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSIADIK

A0A6J1DU19 uncharacterized protein LOC1110243618.8e-2662.5Show/hide
Query:  VDKLGELDQDISFYCDALNHLPIEHNKNVKDEIMELSAKQEEPIATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSI
        ++ +GELDQ++SFY DALN LPI H+KNVK ++MEL+ K +EP+  PA +LK+L   L   F  E STFPVIIS SLSQVEEEKLL +L  H SALGW I
Subjt:  VDKLGELDQDISFYCDALNHLPIEHNKNVKDEIMELSAKQEEPIATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSI

Query:  ADIK
        ADIK
Subjt:  ADIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGTTCAAGTTAGTTACGTTGTTGATAAATTAGGGGAGTTAGATCAGGATATTTCATTTTATTGTGATGCTTTAAATCATTTGCCAATTGAACATAATAAGAATGT
TAAAGATGAGATTATGGAGCTAAGTGCAAAACAAGAGGAACCTATTGCAACTCCTGCGCCAAAACTAAAGCAATTGCTTGATCATCTTCGATATGCCTTTCACGGTGAGT
CTTCTACTTTTCCAGTCATTATATCTGCCTCTTTAAGTCAAGTAGAAGAGGAAAAATTACTACAAGTCTTGCGTGCACATAAATCTGCTTTAGGTTGGTCCATTGCTGAC
ATTAAAGATGAAGAAATGTGGAATCATCCCAGCGCCAACACACCAAACACACAGCTAGGAGCAATGATGAATCAGACTAATATTGAAGAACCTACTAACTATTCCTCCAC
CCAAACTCCTATAAATCGAGAGCCAATAAGAGCCTTGTCAAGTGACAGTGAGACCTTAGTAGGTGAGCCTCGAATGACTGCTTCAGAATTGCAATCTCCACTGCCACCTG
CCCTTAACAATAGTGATGAATTTTACATTGAGCTATCCATTGAAATACTTGCTAATTCTCTAAATGAGTTGTCAAGTGCAAGAAGGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATGTTCAAGTTAGTTACGTTGTTGATAAATTAGGGGAGTTAGATCAGGATATTTCATTTTATTGTGATGCTTTAAATCATTTGCCAATTGAACATAATAAGAATGT
TAAAGATGAGATTATGGAGCTAAGTGCAAAACAAGAGGAACCTATTGCAACTCCTGCGCCAAAACTAAAGCAATTGCTTGATCATCTTCGATATGCCTTTCACGGTGAGT
CTTCTACTTTTCCAGTCATTATATCTGCCTCTTTAAGTCAAGTAGAAGAGGAAAAATTACTACAAGTCTTGCGTGCACATAAATCTGCTTTAGGTTGGTCCATTGCTGAC
ATTAAAGATGAAGAAATGTGGAATCATCCCAGCGCCAACACACCAAACACACAGCTAGGAGCAATGATGAATCAGACTAATATTGAAGAACCTACTAACTATTCCTCCAC
CCAAACTCCTATAAATCGAGAGCCAATAAGAGCCTTGTCAAGTGACAGTGAGACCTTAGTAGGTGAGCCTCGAATGACTGCTTCAGAATTGCAATCTCCACTGCCACCTG
CCCTTAACAATAGTGATGAATTTTACATTGAGCTATCCATTGAAATACTTGCTAATTCTCTAAATGAGTTGTCAAGTGCAAGAAGGTGGTGA
Protein sequenceShow/hide protein sequence
MHVQVSYVVDKLGELDQDISFYCDALNHLPIEHNKNVKDEIMELSAKQEEPIATPAPKLKQLLDHLRYAFHGESSTFPVIISASLSQVEEEKLLQVLRAHKSALGWSIAD
IKDEEMWNHPSANTPNTQLGAMMNQTNIEEPTNYSSTQTPINREPIRALSSDSETLVGEPRMTASELQSPLPPALNNSDEFYIELSIEILANSLNELSSARRW