; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022149 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022149
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTransposon Tf2-2 polyprotein
Genome locationtig00153894:648890..655343
RNA-Seq ExpressionSgr022149
SyntenySgr022149
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW29570.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Vitis vinifera]4.9e-2245.27Show/hide
Query:  FSLDMEGFFKAALILDEGQVGIETKYLSGDTKLWWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLKNSSI------WAQFMSMLSRTK
        F  D+E FFKAA +LD  ++ I + YL GD KLWWRT ++DD  + RP+I  W++L KELK+Q LP N  W+ARE LK+S        WAQ     +  +
Subjt:  FSLDMEGFFKAALILDEGQVGIETKYLSGDTKLWWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLKNSSI------WAQFMSMLSRTK

Query:  EQGVKDLPTTFAAADALSDFKVSNSSTSTKKKKDKEKGTRTRREGTKG
        +QGV+DLPT  AAAD L D+K+    ++T++ + K     T RE  KG
Subjt:  EQGVKDLPTTFAAADALSDFKVSNSSTSTKKKKDKEKGTRTRREGTKG

RVW29570.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Vitis vinifera]3.0e-0350Show/hide
Query:  YASTNRFSKYAVFISAPHAYPAAVAAELFLKHIVKYFELPKDVISDRN
        +   +RFSKY VFI  P A P   AA+LF  ++VK+F LP+D++SDR+
Subjt:  YASTNRFSKYAVFISAPHAYPAAVAAELFLKHIVKYFELPKDVISDRN

RVW29570.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Vitis vinifera]5.4e-2137.22Show/hide
Query:  LDSIAEEMAKLSSTINIDFDSLK-GVLEVTGKC----SIEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLSGDTKLWWRTCLK
        L   +EEM    S +  D   +K  V E+ G+      ++V + + +   R AK      F  DME +F AA I D  QV   + YLSGD KLWWRT + 
Subjt:  LDSIAEEMAKLSSTINIDFDSLK-GVLEVTGKC----SIEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLSGDTKLWWRTCLK

Query:  DDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLKN--------------SSIWAQFMSMLSRTK-----------------EQGVKDLPTTFAAAD
        DD SA RP+I  W+ L KE+KDQ LPCN  W+AR+ LK               SS+     +M    K                  Q VKDLP+ FAAAD
Subjt:  DDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLKN--------------SSIWAQFMSMLSRTK-----------------EQGVKDLPTTFAAAD

Query:  ALSDFKVSNSSTSTKKKKDKEKG
         L DFK+S  ++  K K  + KG
Subjt:  ALSDFKVSNSSTSTKKKKDKEKG

RVW34161.1 hypothetical protein CK203_092847 [Vitis vinifera]1.0e-1934.85Show/hide
Query:  LEAFMSQTTEHLDSIAEEMAKLSSTINIDFDSLKGVLEVTGK-CSIEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLSGDTKL
        ++A  S    H +   E++AK  + +    D  K  L+  G+  +I  +  + +   R AK      F  D+E FFKAA + D  +V I + YL+GD KL
Subjt:  LEAFMSQTTEHLDSIAEEMAKLSSTINIDFDSLKGVLEVTGK-CSIEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLSGDTKL

Query:  WWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLKN--------------SSIWA------------QFMSML-----SRTKEQGVKDLP
        WWRT ++DD  + RP+I  W++L KELKDQ LP N  W+ARE LK               SS+               FMS L     +  + QGV+DLP
Subjt:  WWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLKN--------------SSIWA------------QFMSML-----SRTKEQGVKDLP

Query:  TTFAAADALSDFKVSNSSTSTKKKKDKEKGTRTRREGTKGK
        T  AAAD L D+K+  + ++T++ K  + G R + EG   K
Subjt:  TTFAAADALSDFKVSNSSTSTKKKKDKEKGTRTRREGTKGK

TXG47935.1 hypothetical protein EZV62_027229 [Acer yangbiense]6.2e-2540.79Show/hide
Query:  MEMSLEAFMSQTTEHLDSIAEEMAKLSSTINIDFDSLK--GVLEVTGKCSIEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLS
        +E   E   S  T+  + +A ++  L   I +   +L      E      I+V + + +   R AK      F  DME +FKAA I D  QV I + YLS
Subjt:  MEMSLEAFMSQTTEHLDSIAEEMAKLSSTINIDFDSLK--GVLEVTGKCSIEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLS

Query:  GDTKLWWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLK-----NSSIWAQFMSMLSRTKEQGVKDLPTTFAAADALSDFKVSNS-STS
        GD KLWWRT + DD SA RP IV W+SL K+LKDQ LPCN  W+ARE LK         WAQ     +  + QGVKD+P+  A A+ L DF++S+S S++
Subjt:  GDTKLWWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLK-----NSSIWAQFMSMLSRTKEQGVKDLPTTFAAADALSDFKVSNS-STS

Query:  TKKKK--DKEKGTRT----RREGTKGKG
        T+KKK  D +KG  T    + EG K +G
Subjt:  TKKKK--DKEKGTRT----RREGTKGKG

XP_025888389.1 uncharacterized protein LOC112942051 [Solanum lycopersicum]3.0e-1933.73Show/hide
Query:  FMSQTTEHLDSIAEEMAKLSSTINI---DFDSLKGVLEVTGKCSIEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLSGDTKLW
        F + TT+ L+ + +E   L + + +      +L   L  + K  +++ D + +   R AK      F  DME +F AA + D  ++ I T YLSGDTKLW
Subjt:  FMSQTTEHLDSIAEEMAKLSSTINI---DFDSLKGVLEVTGKCSIEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLSGDTKLW

Query:  WRTCLKDDESARRPKIVRW-KSLKELKDQHLPCNIEWIARELLKN-------SSIWAQFMSML------------------------SRTKEQGVKDLPT
        WRT   DD SA RP+I  W K +KE++DQ LP N  W+AR+ LK             +F SM+                        +  + Q VKDLP 
Subjt:  WRTCLKDDESARRPKIVRW-KSLKELKDQHLPCNIEWIARELLKN-------SSIWAQFMSML------------------------SRTKEQGVKDLPT

Query:  TFAAADALSDFKVSN-----SSTSTKKKKDKEKG---TRTRREGTKGKG
          AAAD+L+DF+ +       STS  KKK+++KG     +R+E    KG
Subjt:  TFAAADALSDFKVSN-----SSTSTKKKKDKEKG---TRTRREGTKGKG

TrEMBL top hitse value%identityAlignment
A0A438D275 Retrovirus-related Pol polyprotein from transposon 17.62.4e-2245.27Show/hide
Query:  FSLDMEGFFKAALILDEGQVGIETKYLSGDTKLWWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLKNSSI------WAQFMSMLSRTK
        F  D+E FFKAA +LD  ++ I + YL GD KLWWRT ++DD  + RP+I  W++L KELK+Q LP N  W+ARE LK+S        WAQ     +  +
Subjt:  FSLDMEGFFKAALILDEGQVGIETKYLSGDTKLWWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLKNSSI------WAQFMSMLSRTK

Query:  EQGVKDLPTTFAAADALSDFKVSNSSTSTKKKKDKEKGTRTRREGTKG
        +QGV+DLPT  AAAD L D+K+    ++T++ + K     T RE  KG
Subjt:  EQGVKDLPTTFAAADALSDFKVSNSSTSTKKKKDKEKGTRTRREGTKG

A0A438D275 Retrovirus-related Pol polyprotein from transposon 17.61.4e-0350Show/hide
Query:  YASTNRFSKYAVFISAPHAYPAAVAAELFLKHIVKYFELPKDVISDRN
        +   +RFSKY VFI  P A P   AA+LF  ++VK+F LP+D++SDR+
Subjt:  YASTNRFSKYAVFISAPHAYPAAVAAELFLKHIVKYFELPKDVISDRN

A0A438D275 Retrovirus-related Pol polyprotein from transposon 17.64.9e-2034.85Show/hide
Query:  LEAFMSQTTEHLDSIAEEMAKLSSTINIDFDSLKGVLEVTGK-CSIEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLSGDTKL
        ++A  S    H +   E++AK  + +    D  K  L+  G+  +I  +  + +   R AK      F  D+E FFKAA + D  +V I + YL+GD KL
Subjt:  LEAFMSQTTEHLDSIAEEMAKLSSTINIDFDSLKGVLEVTGK-CSIEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLSGDTKL

Query:  WWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLKN--------------SSIWA------------QFMSML-----SRTKEQGVKDLP
        WWRT ++DD  + RP+I  W++L KELKDQ LP N  W+ARE LK               SS+               FMS L     +  + QGV+DLP
Subjt:  WWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLKN--------------SSIWA------------QFMSML-----SRTKEQGVKDLP

Query:  TTFAAADALSDFKVSNSSTSTKKKKDKEKGTRTRREGTKGK
        T  AAAD L D+K+  + ++T++ K  + G R + EG   K
Subjt:  TTFAAADALSDFKVSNSSTSTKKKKDKEKGTRTRREGTKGK

A0A5C7GTM9 Retrotrans_gag domain-containing protein3.0e-2540.79Show/hide
Query:  MEMSLEAFMSQTTEHLDSIAEEMAKLSSTINIDFDSLK--GVLEVTGKCSIEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLS
        +E   E   S  T+  + +A ++  L   I +   +L      E      I+V + + +   R AK      F  DME +FKAA I D  QV I + YLS
Subjt:  MEMSLEAFMSQTTEHLDSIAEEMAKLSSTINIDFDSLK--GVLEVTGKCSIEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLS

Query:  GDTKLWWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLK-----NSSIWAQFMSMLSRTKEQGVKDLPTTFAAADALSDFKVSNS-STS
        GD KLWWRT + DD SA RP IV W+SL K+LKDQ LPCN  W+ARE LK         WAQ     +  + QGVKD+P+  A A+ L DF++S+S S++
Subjt:  GDTKLWWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLK-----NSSIWAQFMSMLSRTKEQGVKDLPTTFAAADALSDFKVSNS-STS

Query:  TKKKK--DKEKGTRT----RREGTKGKG
        T+KKK  D +KG  T    + EG K +G
Subjt:  TKKKK--DKEKGTRT----RREGTKGKG

A0A7N2LH98 Uncharacterized protein1.4e-1938.78Show/hide
Query:  IEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLSGDTKLWWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLK
        ++V + + +   R AK      F  DME +FKAA +  +  V I + YLSGD KLWWRT ++DD  A R KI  W++L KELKDQ LP N  W+AR+ LK
Subjt:  IEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLSGDTKLWWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLK

Query:  N--------------SSIWA------------QFMSMLS-----RTKEQGVKDLPTTFAAADALSDFKVSNSSTSTKKKKDKEKGTRTRREGTKGK
                       SS+               FMS L        K Q V+DLPT  +AADAL D+K S  S   +K+K K+KG   +++  K K
Subjt:  N--------------SSIWA------------QFMSMLS-----RTKEQGVKDLPTTFAAADALSDFKVSNSSTSTKKKKDKEKGTRTRREGTKGK

A0A7N2LH98 Uncharacterized protein4.5e-0554.17Show/hide
Query:  YASTNRFSKYAVFISAPHAYPAAVAAELFLKHIVKYFELPKDVISDRN
        +   +RFSKYA+F++AP    A VAA+LF K++VKYF LP+D++SDR+
Subjt:  YASTNRFSKYAVFISAPHAYPAAVAAELFLKHIVKYFELPKDVISDRN

A0A7N2LH98 Uncharacterized protein1.4e-1938.78Show/hide
Query:  IEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLSGDTKLWWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLK
        ++V + + +   R AK      F  DME +FKAA +  +  V I + YLSGD KLWWRT ++DD  A R KI  W++L KELKDQ LP N  W+AR+ LK
Subjt:  IEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLSGDTKLWWRTCLKDDESARRPKIVRWKSL-KELKDQHLPCNIEWIARELLK

Query:  N--------------SSIWA------------QFMSMLS-----RTKEQGVKDLPTTFAAADALSDFKVSNSSTSTKKKKDKEKGTRTRREGTKGK
                       SS+               FMS L        K Q V+DLPT  +AADAL D+K S  S   +K+K K+KG   +++  K K
Subjt:  N--------------SSIWA------------QFMSMLS-----RTKEQGVKDLPTTFAAADALSDFKVSNSSTSTKKKKDKEKGTRTRREGTKGK

A0A7N2R543 Uncharacterized protein4.5e-0554.17Show/hide
Query:  YASTNRFSKYAVFISAPHAYPAAVAAELFLKHIVKYFELPKDVISDRN
        +   +RFSKYA+F++AP    A VAA+LF K++VKYF LP+D++SDR+
Subjt:  YASTNRFSKYAVFISAPHAYPAAVAAELFLKHIVKYFELPKDVISDRN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G62410.1 structural maintenance of chromosomes 29.9e-0529.41Show/hide
Query:  QGMEMSLEAFMSQTTEHLDSIAEEMAKLSSTINIDFDSLKGVLEVTGKCSIEVEDTRTYIVQRFAKCQGTREFSLDM-EGFFKAALILDEGQ---VGIET
        Q  E S+   +   +E +DS+A+EM + SS +N   D+L G  E   K    +ED +  + +R A  + + E + D+ + F + +  L+E +    G+  
Subjt:  QGMEMSLEAFMSQTTEHLDSIAEEMAKLSSTINIDFDSLKGVLEVTGKCSIEVEDTRTYIVQRFAKCQGTREFSLDM-EGFFKAALILDEGQ---VGIET

Query:  KYLSGDTKLWWRTCLKDDESARRPKIVRWKSLKELKDQHLPCNIEWIARELLKNSSIWAQFMSMLSRTKE
           SGD +     CL+D    R  KI    +  ELK   L   IE   +EL +  S   Q MS L    E
Subjt:  KYLSGDTKLWWRTCLKDDESARRPKIVRWKSLKELKDQHLPCNIEWIARELLKNSSIWAQFMSMLSRTKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTGCAACAGAGGTTAGATTTCAAGGGATGGAGATGTCACTAGAGGCCTTTATGAGTCAAACCACGGAGCATCTAGACTCTATAGCCGAGGAGATGGCAAAGCT
TTCATCCACCATCAACATCGATTTTGACTCATTGAAGGGCGTACTCGAGGTGACAGGAAAATGTAGTATAGAAGTTGAAGATACCAGAACTTACATCGTTCAAAGGTTCG
CGAAATGTCAAGGAACTAGAGAATTTTCTTTAGACATGGAAGGATTCTTCAAGGCCGCATTAATTCTAGATGAAGGACAAGTTGGGATAGAGACCAAGTACCTTTCAGGA
GACACCAAATTATGGTGGCGGACCTGCTTGAAGGATGATGAGAGTGCGAGGAGACCGAAGATAGTAAGATGGAAATCACTAAAGGAGTTGAAGGATCAACACCTACCATG
CAATATCGAATGGATTGCAAGGGAATTGCTAAAGAACTCAAGCATATGGGCACAGTTCATGAGTATGCTCAGCAGAACTAAGGAGCAGGGAGTTAAAGATCTACCTACAA
CATTTGCCGCCGCAGATGCCCTCTCAGACTTCAAAGTCAGCAATTCTTCCACTTCTACTAAGAAGAAGAAAGATAAGGAGAAAGGCACAAGGACTCGAAGGGAGGGAACC
AAAGGCAAAGGTGTCCATAATGCCTTACTTCAGTGGGGTGCTGATCTTGGAGGAGATGACCCCATACTTCGTGTAGATTGTGAGTATAGGAGAAGATGGCTTGCTGGGAC
AAATAATCTTTGTTCAATAGATATCGGATTGTATGTTCCAAGCAGAGGGGACTACGTAGAGAATTGTTTAGGGCGACGCATGACCCTTAATGGGCCGGACACCCTAGAGT
GGAAAACAGACCATGGCAATCGGTCTCTATGGACTTTATTGTTAGTGTACAGAAGGTGGAGGTATGCTTCTACCAACAGGTTCTCAAAATATGCTGTGTTTATTTCGGCC
CCACATGCTTATCCTGCAGCAGTGGCAGCAGAACTTTTCTTGAAGCACATTGTCAAATATTTCGAACTACCGAAGGATGTTATAAGCGACAGAAACCCTGCAAAAGATCA
AGACAAGATTGACTTTCAAGAACGCTCAAGGCAGAACCAGAAGCCAAATGTCTTGAAGGGGCAGAACACCGCAACAAGGCCTTCCACTGACAACCATCGATGCCTCAAAT
CCATCAACCAACTTCCCAACTCGAAGTTGGTCAACCTCGAGATAATTATTCTTGTAGAAGAAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGTGCAACAGAGGTTAGATTTCAAGGGATGGAGATGTCACTAGAGGCCTTTATGAGTCAAACCACGGAGCATCTAGACTCTATAGCCGAGGAGATGGCAAAGCT
TTCATCCACCATCAACATCGATTTTGACTCATTGAAGGGCGTACTCGAGGTGACAGGAAAATGTAGTATAGAAGTTGAAGATACCAGAACTTACATCGTTCAAAGGTTCG
CGAAATGTCAAGGAACTAGAGAATTTTCTTTAGACATGGAAGGATTCTTCAAGGCCGCATTAATTCTAGATGAAGGACAAGTTGGGATAGAGACCAAGTACCTTTCAGGA
GACACCAAATTATGGTGGCGGACCTGCTTGAAGGATGATGAGAGTGCGAGGAGACCGAAGATAGTAAGATGGAAATCACTAAAGGAGTTGAAGGATCAACACCTACCATG
CAATATCGAATGGATTGCAAGGGAATTGCTAAAGAACTCAAGCATATGGGCACAGTTCATGAGTATGCTCAGCAGAACTAAGGAGCAGGGAGTTAAAGATCTACCTACAA
CATTTGCCGCCGCAGATGCCCTCTCAGACTTCAAAGTCAGCAATTCTTCCACTTCTACTAAGAAGAAGAAAGATAAGGAGAAAGGCACAAGGACTCGAAGGGAGGGAACC
AAAGGCAAAGGTGTCCATAATGCCTTACTTCAGTGGGGTGCTGATCTTGGAGGAGATGACCCCATACTTCGTGTAGATTGTGAGTATAGGAGAAGATGGCTTGCTGGGAC
AAATAATCTTTGTTCAATAGATATCGGATTGTATGTTCCAAGCAGAGGGGACTACGTAGAGAATTGTTTAGGGCGACGCATGACCCTTAATGGGCCGGACACCCTAGAGT
GGAAAACAGACCATGGCAATCGGTCTCTATGGACTTTATTGTTAGTGTACAGAAGGTGGAGGTATGCTTCTACCAACAGGTTCTCAAAATATGCTGTGTTTATTTCGGCC
CCACATGCTTATCCTGCAGCAGTGGCAGCAGAACTTTTCTTGAAGCACATTGTCAAATATTTCGAACTACCGAAGGATGTTATAAGCGACAGAAACCCTGCAAAAGATCA
AGACAAGATTGACTTTCAAGAACGCTCAAGGCAGAACCAGAAGCCAAATGTCTTGAAGGGGCAGAACACCGCAACAAGGCCTTCCACTGACAACCATCGATGCCTCAAAT
CCATCAACCAACTTCCCAACTCGAAGTTGGTCAACCTCGAGATAATTATTCTTGTAGAAGAAGAATGA
Protein sequenceShow/hide protein sequence
MGGATEVRFQGMEMSLEAFMSQTTEHLDSIAEEMAKLSSTINIDFDSLKGVLEVTGKCSIEVEDTRTYIVQRFAKCQGTREFSLDMEGFFKAALILDEGQVGIETKYLSG
DTKLWWRTCLKDDESARRPKIVRWKSLKELKDQHLPCNIEWIARELLKNSSIWAQFMSMLSRTKEQGVKDLPTTFAAADALSDFKVSNSSTSTKKKKDKEKGTRTRREGT
KGKGVHNALLQWGADLGGDDPILRVDCEYRRRWLAGTNNLCSIDIGLYVPSRGDYVENCLGRRMTLNGPDTLEWKTDHGNRSLWTLLLVYRRWRYASTNRFSKYAVFISA
PHAYPAAVAAELFLKHIVKYFELPKDVISDRNPAKDQDKIDFQERSRQNQKPNVLKGQNTATRPSTDNHRCLKSINQLPNSKLVNLEIIILVEEE