; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012477 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012477
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUPF0690 protein C1orf52 homolog
Genome locationtig00153403:133096..135826
RNA-Seq ExpressionSgr012477
SyntenySgr012477
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575926.1 hypothetical protein SDJN03_26565, partial [Cucurbita argyrosperma subsp. sororia]3.6e-7189.02Show/hide
Query:  MPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTGKETRE
        MPWSDE++NSSSKESSLSQSDSDADEDSG+ KA FR KAGRSSKEK TEVKSGKRK+ A+DFDTLKRHGYKGGPS+L VPPPKENEKQDWSWSTG+ETRE
Subjt:  MPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTGKETRE

Query:  KNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
         NR+TEESYEERQKTRAALENGE LLTA T+KEKKNISFSQKEKRKRELGQASRGKNYVEEEKR LRESGIYS
Subjt:  KNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS

XP_022150145.1 UPF0690 protein C1orf52 homolog [Momordica charantia]2.4e-7591.57Show/hide
Query:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKA-GRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTG
        MKRPMPWSDE++NSSSK+SSLSQSDSDADEDSGE KASFR KA GRSSKEKDT+VKS KRKSTA+DF+ LKRHGYKGGPS+LNVPPPKENEKQDWSWS G
Subjt:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKA-GRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTG

Query:  KETREKNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
        KETRE NRETEESYEERQKTRAALENGE LLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
Subjt:  KETREKNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS

XP_022953507.1 UPF0690 protein C1orf52 homolog [Cucurbita moschata]1.1e-7288.7Show/hide
Query:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTGK
        MKR MPWSDE++NSSSKESSLSQSDSDADEDSG+ KA FR KAGRSSKEK TEVKSGKRK+ A+DFDTLKRHGYKGGPS+L VPPPKENEKQDWSWSTG+
Subjt:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTGK

Query:  ETREKNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
        ETRE NR+TEESYEERQKTRAALENGE LLTA T+KEKKNISFSQKEKRKRELGQASRGKNYVEEEKR LRESGIYS
Subjt:  ETREKNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS

XP_022991319.1 UPF0690 protein C1orf52 homolog [Cucurbita maxima]2.2e-7389.83Show/hide
Query:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTGK
        MKR MPWSDE++NSSSKESSLSQSDSDADEDSG+ KA FR KAGRSSKEK TEVKSGKRKS A+DFDTLKRHGYKGGPS+L VPPPKENEKQDWSWSTG+
Subjt:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTGK

Query:  ETREKNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
        ETRE NRETEESYEERQKTRAALENGE LLTA T+KEKKNISFSQKEKRKRELGQASRGKNYVEEEKR LRESGIYS
Subjt:  ETREKNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS

XP_038896833.1 UPF0690 protein C1orf52 homolog isoform X1 [Benincasa hispida]7.4e-6984.41Show/hide
Query:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKS-GKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTG
        MKR +PWSDE++NS    SSLSQSDSDADED+GE KASFR KAGRSSKEKD EVKS GKRKS A+DFDTLKRHGYKGGPS+L VPPPKENEKQDWSWSTG
Subjt:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKS-GKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTG

Query:  KETREKNRETEESYEERQKTRAALENGELLLTAQTR--------KEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
        +ETRE NRETEESYEERQKTRAALENGE LLTAQTR        KEKKN+SFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
Subjt:  KETREKNRETEESYEERQKTRAALENGELLLTAQTR--------KEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS

TrEMBL top hitse value%identityAlignment
A0A1S3BRX7 Uncharacterized protein3.0e-6882.45Show/hide
Query:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKS-GKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTG
        MKR MPWSDE++NS    SSLSQSDSD DED+GE KASFR K GRSSKEKDTEVKS GKRKS A+DFDTL+RHGY+GGPS+L VPPPKENEKQDWSWSTG
Subjt:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKS-GKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTG

Query:  KETREKNRETEESYEERQKTRAALENGELLLTAQTRK----------EKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
        +ETRE NRETEESYEERQKTRAALENGE LLTAQTRK          EKKN+SFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
Subjt:  KETREKNRETEESYEERQKTRAALENGELLLTAQTRK----------EKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS

A0A5D3DRL7 UPF0690 protein C1orf52-like protein3.0e-6882.45Show/hide
Query:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKS-GKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTG
        MKR MPWSDE++NS    SSLSQSDSD DED+GE KASFR K GRSSKEKDTEVKS GKRKS A+DFDTL+RHGY+GGPS+L VPPPKENEKQDWSWSTG
Subjt:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKS-GKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTG

Query:  KETREKNRETEESYEERQKTRAALENGELLLTAQTRK----------EKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
        +ETRE NRETEESYEERQKTRAALENGE LLTAQTRK          EKKN+SFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
Subjt:  KETREKNRETEESYEERQKTRAALENGELLLTAQTRK----------EKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS

A0A6J1DAL0 UPF0690 protein C1orf52 homolog1.2e-7591.57Show/hide
Query:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKA-GRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTG
        MKRPMPWSDE++NSSSK+SSLSQSDSDADEDSGE KASFR KA GRSSKEKDT+VKS KRKSTA+DF+ LKRHGYKGGPS+LNVPPPKENEKQDWSWS G
Subjt:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKA-GRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTG

Query:  KETREKNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
        KETRE NRETEESYEERQKTRAALENGE LLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
Subjt:  KETREKNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS

A0A6J1GNF0 UPF0690 protein C1orf52 homolog5.3e-7388.7Show/hide
Query:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTGK
        MKR MPWSDE++NSSSKESSLSQSDSDADEDSG+ KA FR KAGRSSKEK TEVKSGKRK+ A+DFDTLKRHGYKGGPS+L VPPPKENEKQDWSWSTG+
Subjt:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTGK

Query:  ETREKNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
        ETRE NR+TEESYEERQKTRAALENGE LLTA T+KEKKNISFSQKEKRKRELGQASRGKNYVEEEKR LRESGIYS
Subjt:  ETREKNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS

A0A6J1JUH2 UPF0690 protein C1orf52 homolog1.1e-7389.83Show/hide
Query:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTGK
        MKR MPWSDE++NSSSKESSLSQSDSDADEDSG+ KA FR KAGRSSKEK TEVKSGKRKS A+DFDTLKRHGYKGGPS+L VPPPKENEKQDWSWSTG+
Subjt:  MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTGK

Query:  ETREKNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
        ETRE NRETEESYEERQKTRAALENGE LLTA T+KEKKNISFSQKEKRKRELGQASRGKNYVEEEKR LRESGIYS
Subjt:  ETREKNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G04614.1 unknown protein4.7e-2947.73Show/hide
Query:  MPW--SDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTGKET
        M W   D+ ++  S E S S S SD++ D+ EI      +  +  K K    +  K+   A D+++L++HGYK    + ++P P   EKQDWSW+TGK+ 
Subjt:  MPW--SDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTGKET

Query:  REKNRETEESYEERQKTR-AALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS
        +++  E +ESY+ER+ TR AA+  GE +  AQ R ++KN+SFSQKEK+KR+LGQASRGKNYVEEEKR LRESG+YS
Subjt:  REKNRETEESYEERQKTR-AALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKNYVEEEKRMLRESGIYS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGGCCAATGCCATGGAGTGATGAAAAGGAGAATTCGTCATCTAAAGAGTCATCTTTATCGCAGTCAGATTCAGACGCTGACGAAGATAGTGGCGAGATAAAAGC
AAGCTTTCGGGCTAAAGCTGGCCGGTCCTCTAAAGAAAAAGACACTGAAGATTCGACTTGCTTTGCCATGAAAAGGCCAATGCCATGGAGTGATGAAAAGGAGAATTCGT
CATCTAAAGAGTCATCTTTATCGCAGTCAGATTCAGACGCTGACGAAGATAGTGGCGAGATAAAAGCAAGCTTTCGGGCTAAAGCTGGCCGGTCCTCTAAAGAAAAAGAC
ACTGAAGTCAAATCTGGAAAGCGAAAGAGCACTGCTATAGACTTCGATACATTGAAACGCCATGGCTACAAAGGTGGACCATCAATCTTGAATGTGCCACCACCAAAAGA
GAATGAGAAGCAAGACTGGTCATGGTCTACTGGTAAGGAGACTCGGGAAAAAAACAGGGAGACTGAAGAATCGTATGAAGAGAGACAGAAAACAAGAGCTGCATTAGAAA
ATGGAGAGCTGCTGCTAACCGCGCAAACTCGGAAGGAGAAGAAGAATATTTCCTTCTCCCAAAAGGAAAAGAGGAAAAGAGAGCTTGGTCAAGCAAGCAGGGGGAAAAAC
TATGTTGAGGAAGAAAAGAGAATGTTGAGGGAAAGTGGCATCTACTCTGCTCCCGTATGTGAATGTGACTATCATCGGCACGCATTTAAGGTACATATGACATGGGCATT
AGCTGATTCTGTACCATTTAACGCAAGGGAAGAGCATACTCTGGTTGTTCAGTTGTATGATGTTTCCATGGTCATAGATAAAACAGCTTCAATAACTCAGTTTGTATTTG
ACGGTGAAAGAAGCTGTAACATTGGTGAATTTGCGTTTGAATCCCTTTTGGGGCTGGTTAGGATATTAATGGCTGAAGATGATTGCCTGACGATATTACAGGAGTTGGGG
AAGAGATGGAATCGTGTCTTGAATTTGCATGAAAGGTCCAGCCACTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGGCCAATGCCATGGAGTGATGAAAAGGAGAATTCGTCATCTAAAGAGTCATCTTTATCGCAGTCAGATTCAGACGCTGACGAAGATAGTGGCGAGATAAAAGC
AAGCTTTCGGGCTAAAGCTGGCCGGTCCTCTAAAGAAAAAGACACTGAAGATTCGACTTGCTTTGCCATGAAAAGGCCAATGCCATGGAGTGATGAAAAGGAGAATTCGT
CATCTAAAGAGTCATCTTTATCGCAGTCAGATTCAGACGCTGACGAAGATAGTGGCGAGATAAAAGCAAGCTTTCGGGCTAAAGCTGGCCGGTCCTCTAAAGAAAAAGAC
ACTGAAGTCAAATCTGGAAAGCGAAAGAGCACTGCTATAGACTTCGATACATTGAAACGCCATGGCTACAAAGGTGGACCATCAATCTTGAATGTGCCACCACCAAAAGA
GAATGAGAAGCAAGACTGGTCATGGTCTACTGGTAAGGAGACTCGGGAAAAAAACAGGGAGACTGAAGAATCGTATGAAGAGAGACAGAAAACAAGAGCTGCATTAGAAA
ATGGAGAGCTGCTGCTAACCGCGCAAACTCGGAAGGAGAAGAAGAATATTTCCTTCTCCCAAAAGGAAAAGAGGAAAAGAGAGCTTGGTCAAGCAAGCAGGGGGAAAAAC
TATGTTGAGGAAGAAAAGAGAATGTTGAGGGAAAGTGGCATCTACTCTGCTCCCGTATGTGAATGTGACTATCATCGGCACGCATTTAAGGTACATATGACATGGGCATT
AGCTGATTCTGTACCATTTAACGCAAGGGAAGAGCATACTCTGGTTGTTCAGTTGTATGATGTTTCCATGGTCATAGATAAAACAGCTTCAATAACTCAGTTTGTATTTG
ACGGTGAAAGAAGCTGTAACATTGGTGAATTTGCGTTTGAATCCCTTTTGGGGCTGGTTAGGATATTAATGGCTGAAGATGATTGCCTGACGATATTACAGGAGTTGGGG
AAGAGATGGAATCGTGTCTTGAATTTGCATGAAAGGTCCAGCCACTTTTGA
Protein sequenceShow/hide protein sequence
MKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKDTEDSTCFAMKRPMPWSDEKENSSSKESSLSQSDSDADEDSGEIKASFRAKAGRSSKEKD
TEVKSGKRKSTAIDFDTLKRHGYKGGPSILNVPPPKENEKQDWSWSTGKETREKNRETEESYEERQKTRAALENGELLLTAQTRKEKKNISFSQKEKRKRELGQASRGKN
YVEEEKRMLRESGIYSAPVCECDYHRHAFKVHMTWALADSVPFNAREEHTLVVQLYDVSMVIDKTASITQFVFDGERSCNIGEFAFESLLGLVRILMAEDDCLTILQELG
KRWNRVLNLHERSSHF