; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018821 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018821
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDNA-directed RNA polymerase I subunit like
Genome locationtig00153210:1000883..1003610
RNA-Seq ExpressionSgr018821
SyntenySgr018821
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601207.1 hypothetical protein SDJN03_06440, partial [Cucurbita argyrosperma subsp. sororia]4.4e-4173.88Show/hide
Query:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG
        MLGFLT+P QW  +  P L S+S+P S STRS  + P +K  LHY    ++SQIPG R RFTA + NN NGLGGNIKEREGERNGAKGSNG DDLRKERG
Subjt:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG

Query:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQ
        PV NIKWAELLIDPDPDNILAVALTGLLAWASVQ
Subjt:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQ

KAG7032002.1 hypothetical protein SDJN02_06044, partial [Cucurbita argyrosperma subsp. argyrosperma]9.8e-4167.76Show/hide
Query:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG
        MLGFLT+P QW  +  P L S+S+P S STRS  +   +K  LHY     +SQIPG R RFTA + NN NGLGGNIKEREGERNGAKGSNG DDLRKERG
Subjt:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG

Query:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK
        PV NIKWAELLIDPDPDNILAVALTGLLAWASVQV W +   S ++L   +K
Subjt:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK

XP_022139143.1 uncharacterized protein LOC111010120 [Momordica charantia]8.8e-5074.34Show/hide
Query:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG
        MLGF TLPCQWSS S+  LSST TPSS+S  SLR  P +K +LHYA   T+SQIP NR RFTAFSGN  NGLGGNIKEREGER GAKGSNGGDDL+KERG
Subjt:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG

Query:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK
        PV NIKWAELLIDPDPDNILAVALTGLLAWASVQV W +   S ++L   +K
Subjt:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK

XP_022957096.1 uncharacterized protein LOC111458579 [Cucurbita moschata]2.8e-4067.11Show/hide
Query:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG
        MLGFLT+P QW  +  P L S+S+P S STRS  +   +K  LHY    ++SQIPG R RFTA + NN NGLGGNIKEREGERNGAKGS G DDLRKERG
Subjt:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG

Query:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK
        PV NIKWAELLIDPDPDNILAVALTGLLAWASVQV W +   S ++L   +K
Subjt:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK

XP_023549272.1 uncharacterized protein LOC111807677 [Cucurbita pepo subsp. pepo]2.7e-4369.74Show/hide
Query:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG
        MLGFLT+P QW  +  P LSS+S+P S STRS  + P +K  LHY    ++SQIPG R RFTA + NN NGLGGNIKEREGERNGAKGSNGGDDLRKERG
Subjt:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG

Query:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK
        PV NIKWAELLIDPDPDNILAVALTGLLAWASVQV W +   S ++L   +K
Subjt:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK

TrEMBL top hitse value%identityAlignment
A0A1S3BFX0 uncharacterized protein LOC1034894073.1e-3261.04Show/hide
Query:  MLGFLTLPCQWS-STSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAK-GSNGGDDLRKE
        MLGFLT+P Q   S SL  L S S+PSS         P YK  LH+ F  ++  I  NR RFTA + N     GG+IKEREGERNGAK  SNGGDDL+KE
Subjt:  MLGFLTLPCQWS-STSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAK-GSNGGDDLRKE

Query:  RGPVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK
        RGPV NIKWAELLIDPDPDNILAVALTGLLAWASVQV W +   S ++L   +K
Subjt:  RGPVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK

A0A5D3CD16 Uncharacterized protein3.1e-3261.04Show/hide
Query:  MLGFLTLPCQWS-STSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAK-GSNGGDDLRKE
        MLGFLT+P Q   S SL  L S S+PSS         P YK  LH+ F  ++  I  NR RFTA + N     GG+IKEREGERNGAK  SNGGDDL+KE
Subjt:  MLGFLTLPCQWS-STSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAK-GSNGGDDLRKE

Query:  RGPVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK
        RGPV NIKWAELLIDPDPDNILAVALTGLLAWASVQV W +   S ++L   +K
Subjt:  RGPVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK

A0A6J1CF05 uncharacterized protein LOC1110101204.3e-5074.34Show/hide
Query:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG
        MLGF TLPCQWSS S+  LSST TPSS+S  SLR  P +K +LHYA   T+SQIP NR RFTAFSGN  NGLGGNIKEREGER GAKGSNGGDDL+KERG
Subjt:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG

Query:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK
        PV NIKWAELLIDPDPDNILAVALTGLLAWASVQV W +   S ++L   +K
Subjt:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK

A0A6J1GYA2 uncharacterized protein LOC1114585791.4e-4067.11Show/hide
Query:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG
        MLGFLT+P QW  +  P L S+S+P S STRS  +   +K  LHY    ++SQIPG R RFTA + NN NGLGGNIKEREGERNGAKGS G DDLRKERG
Subjt:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG

Query:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK
        PV NIKWAELLIDPDPDNILAVALTGLLAWASVQV W +   S ++L   +K
Subjt:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK

A0A6J1JWP5 uncharacterized protein LOC1114895283.1e-4066.45Show/hide
Query:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG
        MLGFLT+P QW  +  P LSS+S+P   STRS  + P +K  LHY    ++SQI G R RF A + NN NGLGGNIKEREGERNGAKGS G DDLRKERG
Subjt:  MLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAFPETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERG

Query:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK
        PV NIKWAELLIDPDPDNILAVALTGLLAWASVQV W +   S ++L   +K
Subjt:  PVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV-WCVIMPSPSVLWKVVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G40045.1 unknown protein4.6e-0439.44Show/hide
Query:  SGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV
        +  NGN    + KE  G  N    ++ G+  +K++    + KW ELL +PD DN +AV L G+L WAS+QV
Subjt:  SGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVLNIKWAELLIDPDPDNILAVALTGLLAWASVQV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGCATGTGAACCGAACCCGGACCCGGAGCCGAACCCGAACTATAATGGGTTCCAAACTTCCAACCGTCTTAGATTTTTCACAGCCTTGTCTCCCTCAGCCGAAAG
ACATGGATGCCGCGAACGGCTGCAGATTCCGGTATATTTTCCGCACGCCGCACGCGGGAGAAAATTGAAGAGAAAAAATATGTTAGGGTTTCTTACACTCCCATGCCAAT
GGAGTTCCACATCTCTCCCCTTTCTCTCCTCAACTTCCACACCTTCTTCTGCCTCAACAAGATCTCTCCGCGCCCCTCCAAGCTATAAACTCTCCCTCCATTACGCGTTT
CCCGAAACGCAGTCTCAAATTCCTGGAAACCGCACAAGATTTACGGCCTTTTCGGGCAATAATGGTAATGGTTTGGGCGGAAATATCAAGGAGAGAGAAGGAGAAAGAAA
TGGGGCGAAGGGCTCCAATGGCGGCGACGATTTGAGGAAAGAACGGGGGCCGGTTCTCAATATCAAATGGGCCGAACTTCTAATCGACCCGGATCCGGACAACATCTTGG
CGGTTGCATTGACTGGTTTGCTTGCTTGGGCAAGCGTTCAGGTCTGGTGTGTCATTATGCCTTCTCCCTCTGTTTTGTGGAAAGTTGTTAAACAAATTCTACGTTACTCA
GAAGGTACTCTTGATGGTGGTTTGTTATATCCAAACCAAGCAATATATCTTTGGAAGGTTCTGCTGATTCCATTTGGGTATCAGATCCTTAATGACAGCATATCAACTTT
TGATTTTCGTTTCTGCTATGGTCACCTAGTGCGTCAAGCCTCTACTATCTCATGTTCGAGGCTGCATATAGTTATTGTGATATTGCGAATGCTACTCCGAGATTGTTACG
TTAACATATCAGGAGAATTATGCTCAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGCATGTGAACCGAACCCGGACCCGGAGCCGAACCCGAACTATAATGGGTTCCAAACTTCCAACCGTCTTAGATTTTTCACAGCCTTGTCTCCCTCAGCCGAAAG
ACATGGATGCCGCGAACGGCTGCAGATTCCGGTATATTTTCCGCACGCCGCACGCGGGAGAAAATTGAAGAGAAAAAATATGTTAGGGTTTCTTACACTCCCATGCCAAT
GGAGTTCCACATCTCTCCCCTTTCTCTCCTCAACTTCCACACCTTCTTCTGCCTCAACAAGATCTCTCCGCGCCCCTCCAAGCTATAAACTCTCCCTCCATTACGCGTTT
CCCGAAACGCAGTCTCAAATTCCTGGAAACCGCACAAGATTTACGGCCTTTTCGGGCAATAATGGTAATGGTTTGGGCGGAAATATCAAGGAGAGAGAAGGAGAAAGAAA
TGGGGCGAAGGGCTCCAATGGCGGCGACGATTTGAGGAAAGAACGGGGGCCGGTTCTCAATATCAAATGGGCCGAACTTCTAATCGACCCGGATCCGGACAACATCTTGG
CGGTTGCATTGACTGGTTTGCTTGCTTGGGCAAGCGTTCAGGTCTGGTGTGTCATTATGCCTTCTCCCTCTGTTTTGTGGAAAGTTGTTAAACAAATTCTACGTTACTCA
GAAGGTACTCTTGATGGTGGTTTGTTATATCCAAACCAAGCAATATATCTTTGGAAGGTTCTGCTGATTCCATTTGGGTATCAGATCCTTAATGACAGCATATCAACTTT
TGATTTTCGTTTCTGCTATGGTCACCTAGTGCGTCAAGCCTCTACTATCTCATGTTCGAGGCTGCATATAGTTATTGTGATATTGCGAATGCTACTCCGAGATTGTTACG
TTAACATATCAGGAGAATTATGCTCAG
Protein sequenceShow/hide protein sequence
MNACEPNPDPEPNPNYNGFQTSNRLRFFTALSPSAERHGCRERLQIPVYFPHAARGRKLKRKNMLGFLTLPCQWSSTSLPFLSSTSTPSSASTRSLRAPPSYKLSLHYAF
PETQSQIPGNRTRFTAFSGNNGNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVLNIKWAELLIDPDPDNILAVALTGLLAWASVQVWCVIMPSPSVLWKVVKQILRYS
EGTLDGGLLYPNQAIYLWKVLLIPFGYQILNDSISTFDFRFCYGHLVRQASTISCSRLHIVIVILRMLLRDCYVNISGELCSX