; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019838 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019838
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTransposon Ty3-G Gag-Pol polyprotein
Genome locationtig00153419:774632..778286
RNA-Seq ExpressionSgr019838
SyntenySgr019838
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039086.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]7.6e-1038.89Show/hide
Query:  EAPIEPTNYRRNTAK-ASLDKGKQVNTWPPNNQTLESHKADCSNQKTVLK--QQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLE
        +A +  + YR+   +  S DKGKQ++T   N  T+  +    S Q+  ++    KN N YA+   I CYKCNQ+GH SS+ PLRK V ++EE  +  N+ 
Subjt:  EAPIEPTNYRRNTAK-ASLDKGKQVNTWPPNNQTLESHKADCSNQKTVLK--QQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLE

Query:  TREE-TPNQGDDELVYGDEGERVSCV
        + +E  PN+ ++ +   DEG+RVSCV
Subjt:  TREE-TPNQGDDELVYGDEGERVSCV

XP_010247056.1 PREDICTED: uncharacterized protein LOC104590196 [Nelumbo nucifera]4.3e-0532.28Show/hide
Query:  MLEAPIEPTNYRR----NTAKASLDKGKQVN----TWPPNNQTLESHKADCSNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEET
        M +      NYRR    +    ++DKGK       + PPN             +   L   K  NPYAKPAP  C+KCN+ GH SS  P RK V ++   
Subjt:  MLEAPIEPTNYRR----NTAKASLDKGKQVN----TWPPNNQTLESHKADCSNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEET

Query:  NTATNLETREETPNQGDDELVYGDEGE
                +++    GDDE+  G +GE
Subjt:  NTATNLETREETPNQGDDELVYGDEGE

XP_012844444.1 PREDICTED: uncharacterized protein LOC105964483 [Erythranthe guttata]1.0e-0635.43Show/hide
Query:  TNYRRNTAKAS--LDKGKQV------NTWPPNNQTLESHKADC-SNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLE
        +NYRR+T + +  +DKGK V      NT PP  Q +++   +   +      +  N NPYA+PAP  CY+C++ GH S   P RK V I++        E
Subjt:  TNYRRNTAKAS--LDKGKQV------NTWPPNNQTLESHKADC-SNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLE

Query:  TREETPNQGDDEL--VYGDEGERVSCV
           E  NQ D +   +  +EG+RV+CV
Subjt:  TREETPNQGDDEL--VYGDEGERVSCV

XP_020262272.1 uncharacterized protein LOC109838225 [Asparagus officinalis]8.7e-0638.61Show/hide
Query:  KQVNTWPPNNQTLESHKADCSNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLETREETPNQGDDELVYGDEGERVSC
        KQ  T P  + T E  K   +   T  K+   ANPYAKP PI CY+C Q GH S++ P R  V I++  +  T+ +  E+     D  L   D+GE++SC
Subjt:  KQVNTWPPNNQTLESHKADCSNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLETREETPNQGDDELVYGDEGERVSC

Query:  V
        V
Subjt:  V

XP_020271469.1 uncharacterized protein LOC109846634 [Asparagus officinalis]2.5e-0534.86Show/hide
Query:  AKASLDKGKQVNTWPPNNQTLESHKADCSNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLETREETPNQGDDELVYG
        ++ S + GKQ  T    + T +  K   +   T  K+    NPYAKP PI CY+C Q GH S++ P R  V I++  +   + +  E+     DD L   
Subjt:  AKASLDKGKQVNTWPPNNQTLESHKADCSNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLETREETPNQGDDELVYG

Query:  DEGERVSCV
        D+GE++SCV
Subjt:  DEGERVSCV

TrEMBL top hitse value%identityAlignment
A0A1U7Z4C2 uncharacterized protein LOC1045901962.1e-0532.28Show/hide
Query:  MLEAPIEPTNYRR----NTAKASLDKGKQVN----TWPPNNQTLESHKADCSNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEET
        M +      NYRR    +    ++DKGK       + PPN             +   L   K  NPYAKPAP  C+KCN+ GH SS  P RK V ++   
Subjt:  MLEAPIEPTNYRR----NTAKASLDKGKQVN----TWPPNNQTLESHKADCSNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEET

Query:  NTATNLETREETPNQGDDELVYGDEGE
                +++    GDDE+  G +GE
Subjt:  NTATNLETREETPNQGDDELVYGDEGE

A0A5A7TAL8 Transposon Ty3-G Gag-Pol polyprotein3.7e-1038.89Show/hide
Query:  EAPIEPTNYRRNTAK-ASLDKGKQVNTWPPNNQTLESHKADCSNQKTVLK--QQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLE
        +A +  + YR+   +  S DKGKQ++T   N  T+  +    S Q+  ++    KN N YA+   I CYKCNQ+GH SS+ PLRK V ++EE  +  N+ 
Subjt:  EAPIEPTNYRRNTAK-ASLDKGKQVNTWPPNNQTLESHKADCSNQKTVLK--QQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLE

Query:  TREE-TPNQGDDELVYGDEGERVSCV
        + +E  PN+ ++ +   DEG+RVSCV
Subjt:  TREE-TPNQGDDELVYGDEGERVSCV

A0A5B7BER3 Uncharacterized protein2.5e-0633.85Show/hide
Query:  PIEPTNYRR---NTAKASLDKGKQVNTWPPNNQ--TLESHKADCSNQKT-VLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEET-NTATN
        P+   N  R   ++++   ++ KQ+    P  Q  T     +   NQ T +   QK+ NPYA+P P  C++C Q GH S++ P R+ V ++  T + + +
Subjt:  PIEPTNYRR---NTAKASLDKGKQVNTWPPNNQ--TLESHKADCSNQKT-VLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEET-NTATN

Query:  LETREETPNQ---GDDELVYGDEGERVSCV
         E  EE   Q   G  E+  GDEGE VSCV
Subjt:  LETREETPNQ---GDDELVYGDEGERVSCV

A0A6P3Z018 uncharacterized protein LOC1074050627.9e-0535.58Show/hide
Query:  KGKQVNTWPPNNQTLESHKADCSNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLETREETPNQGDD-ELVYGDEGER
        K K ++   P  +T  +  +  +  K   +  + +NPYA+  P+ C+KC QQGH S++ PLRK + I+E     T  ++ EE    GD+ ELV  D+GE 
Subjt:  KGKQVNTWPPNNQTLESHKADCSNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLETREETPNQGDD-ELVYGDEGER

Query:  VSCV
        V C+
Subjt:  VSCV

A0A6P4AAP5 uncharacterized protein LOC1074262384.6e-0535.58Show/hide
Query:  KGKQVNTWPPNNQTLESHKADCSNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLETREETPNQGDD-ELVYGDEGER
        K K ++   P  +T  +  +  +  K   +  + +NPYA+  P+ C+KC QQGH S++ PLRK + I+E     T  ++ EE    GD+ ELV  D+GE 
Subjt:  KGKQVNTWPPNNQTLESHKADCSNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLETREETPNQGDD-ELVYGDEGER

Query:  VSCV
        V C+
Subjt:  VSCV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGAAGCACCAATAGAGCCAACCAATTATAGAAGAAACACAGCCAAAGCAAGTCTAGATAAAGGGAAACAAGTAAATACTTGGCCACCAAACAACCAAACATTGGA
AAGCCACAAAGCAGATTGCTCCAACCAGAAAACCGTACTTAAACAACAGAAAAATGCCAATCCATATGCCAAACCAGCACCTATATGTTGCTACAAATGCAACCAACAAG
GACACTGCTCTAGCAAACGCCCATTGCGAAAGCATGTAGCCATTATTGAGGAAACTAATACAGCAACAAATTTGGAAACGAGGGAAGAAACTCCTAATCAAGGAGATGAT
GAACTTGTTTATGGAGATGAGGGAGAAAGAGTTTCTTGCGTCCCAGCTTGTGAGGCCCTAAGCCAATTGGGAATGGACATCAAGGCTCTTGAGTGCGCATCAAGGCTCAA
GGCTCTTGAGTGTATTTTCAAATTGGAAAGGCGCATCTCGAAGCTAAAGAGGGAAGAGCTGAAGAAGAATGTTTGCCAAGTTCAAAGAGAGTGTGATGGAATAACACTCG
ACAAGCATGTTAAGGGGCGCACAACATGGCCCGCAATTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGAAGCACCAATAGAGCCAACCAATTATAGAAGAAACACAGCCAAAGCAAGTCTAGATAAAGGGAAACAAGTAAATACTTGGCCACCAAACAACCAAACATTGGA
AAGCCACAAAGCAGATTGCTCCAACCAGAAAACCGTACTTAAACAACAGAAAAATGCCAATCCATATGCCAAACCAGCACCTATATGTTGCTACAAATGCAACCAACAAG
GACACTGCTCTAGCAAACGCCCATTGCGAAAGCATGTAGCCATTATTGAGGAAACTAATACAGCAACAAATTTGGAAACGAGGGAAGAAACTCCTAATCAAGGAGATGAT
GAACTTGTTTATGGAGATGAGGGAGAAAGAGTTTCTTGCGTCCCAGCTTGTGAGGCCCTAAGCCAATTGGGAATGGACATCAAGGCTCTTGAGTGCGCATCAAGGCTCAA
GGCTCTTGAGTGTATTTTCAAATTGGAAAGGCGCATCTCGAAGCTAAAGAGGGAAGAGCTGAAGAAGAATGTTTGCCAAGTTCAAAGAGAGTGTGATGGAATAACACTCG
ACAAGCATGTTAAGGGGCGCACAACATGGCCCGCAATTCCTTGA
Protein sequenceShow/hide protein sequence
MLEAPIEPTNYRRNTAKASLDKGKQVNTWPPNNQTLESHKADCSNQKTVLKQQKNANPYAKPAPICCYKCNQQGHCSSKRPLRKHVAIIEETNTATNLETREETPNQGDD
ELVYGDEGERVSCVPACEALSQLGMDIKALECASRLKALECIFKLERRISKLKREELKKNVCQVQRECDGITLDKHVKGRTTWPAIP