; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr001046 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr001046
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionMic1 domain-containing protein
Genome locationtig00000680:49034..53163
RNA-Seq ExpressionSgr001046
SyntenySgr001046
Gene Ontology termsGO:0010506 - regulation of autophagy (biological process)
GO:0031902 - late endosome membrane (cellular component)
GO:0035658 - Mon1-Ccz1 complex (cellular component)
InterPro domainsIPR040371 - Regulator of MON1-CCZ1 complex


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651130.1 hypothetical protein Csa_002376 [Cucumis sativus]3.1e-7471.63Show/hide
Query:  MDRKEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK------------------------------VPGPRGLFYDDVHKLLICPT
        MDRK++ATAS+LKR+LV+CASQAKQYGGCVAAKVP+VERDMCLKEFIALK                              V     L Y  V K+ I   
Subjt:  MDRKEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK------------------------------VPGPRGLFYDDVHKLLICPT

Query:  VDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCPLCNIVFVKTSGLDLFAYNSD
        V  IFSWKTVPFNPAV YT+D ITEGPILS+RYSLDLKIIAIQRSSHEIQFLIRETG+TFSQ+CR ESESILGFFWTDCPLCNIVFVKTSGLDLFAY+SD
Subjt:  VDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCPLCNIVFVKTSGLDLFAYNSD

Query:  SKSLHLVESKKLYVS
        SKSLHLVESKKL VS
Subjt:  SKSLHLVESKKLYVS

KAF4370667.1 hypothetical protein F8388_025046, partial [Cannabis sativa]1.6e-6559.83Show/hide
Query:  MDRKEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK--------------------------------------------VPGPRG
        +D KE+ T S LKRILV C +QAK+YG CVAAKVPEVERDMCLKEF+ALK                                            + G RG
Subjt:  MDRKEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK--------------------------------------------VPGPRG

Query:  LFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCPLCNIVF
        LFYDD  KLLI PT DQ+FSWKTVPF P+++ T+D+I+EGPILSIRYSLD K IAI RSSHEI+F  RE+ ETFSQRCR +SESILGFFWTDCPLC+IV 
Subjt:  LFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCPLCNIVF

Query:  VKTSGLDLFAYNSDSKSLHLVESKKLYVS
        VKTSGLDL +YNS+SKSL LVE+++L VS
Subjt:  VKTSGLDLFAYNSDSKSLHLVESKKLYVS

RXH76517.1 hypothetical protein DVH24_019405 [Malus domestica]3.7e-6760.43Show/hide
Query:  DRKEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK---------------------------------------------------
        D KE+ + S LKRILVTC +QAK+YGGCVAAKVP+VERDMCLKEF+ALK                                                   
Subjt:  DRKEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK---------------------------------------------------

Query:  VPGPRGLFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCP
        V G RGLFYDD +KLL+ PT DQ+FSWKTVPF+P VT T+D+ITEGPILSIRYSLD K IAIQRS +EIQF  R +GETFSQRC+ ESESILGFFWTDCP
Subjt:  VPGPRGLFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCP

Query:  LCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS
        LC+IVFVKTSGLDLFA NS+SKSL LV+++KL VS
Subjt:  LCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS

RXH81244.1 hypothetical protein DVH24_005158 [Malus domestica]5.4e-6658.33Show/hide
Query:  KEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK-----------------------------------------------------
        KE+ + S LKRILVTCA+QAK+YGGCVAAKVP+VERDMCLKEF+ALK                                                     
Subjt:  KEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK-----------------------------------------------------

Query:  -----VPGPRGLFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFF
             V G +GLFYDD +KLL+ PT DQ+F WKTVPF+P VT T+D+I+EGPILSIRYSLD K IAIQRS HEIQF  R +GETFSQ C+ ESESILGFF
Subjt:  -----VPGPRGLFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFF

Query:  WTDCPLCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS
        WTDCP+C+IVFVKTSGLD FAYNS+SKSL LVE+KKL VS
Subjt:  WTDCPLCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS

TQD81110.1 hypothetical protein C1H46_033328 [Malus baccata]6.4e-6761.84Show/hide
Query:  KEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK----------------------------------------------VPGPRGL
        KE+ + S LKRILVTC +QAK+YGGCVAAKVP+VERDMCLKEF+ALK                                              V G RGL
Subjt:  KEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK----------------------------------------------VPGPRGL

Query:  FYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCPLCNIVFV
        FYDD +KLL+ PT DQ+FSWKTVPF+P VT T+D+ITEGPILSIRYSLD K IAIQRS +EIQF  R +GETFSQ C+ ESESILGFFWTDCPLC+IVFV
Subjt:  FYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCPLCNIVFV

Query:  KTSGLDLFAYNSDSKSLHLVESKKLYVS
        KTSGLDLFA NS+SKSL LVE++KL VS
Subjt:  KTSGLDLFAYNSDSKSLHLVESKKLYVS

TrEMBL top hitse value%identityAlignment
A0A498I3L9 Mic1 domain-containing protein1.8e-6760.43Show/hide
Query:  DRKEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK---------------------------------------------------
        D KE+ + S LKRILVTC +QAK+YGGCVAAKVP+VERDMCLKEF+ALK                                                   
Subjt:  DRKEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK---------------------------------------------------

Query:  VPGPRGLFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCP
        V G RGLFYDD +KLL+ PT DQ+FSWKTVPF+P VT T+D+ITEGPILSIRYSLD K IAIQRS +EIQF  R +GETFSQRC+ ESESILGFFWTDCP
Subjt:  VPGPRGLFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCP

Query:  LCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS
        LC+IVFVKTSGLDLFA NS+SKSL LV+++KL VS
Subjt:  LCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS

A0A498IGI6 Mic1 domain-containing protein2.6e-6658.33Show/hide
Query:  KEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK-----------------------------------------------------
        KE+ + S LKRILVTCA+QAK+YGGCVAAKVP+VERDMCLKEF+ALK                                                     
Subjt:  KEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK-----------------------------------------------------

Query:  -----VPGPRGLFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFF
             V G +GLFYDD +KLL+ PT DQ+F WKTVPF+P VT T+D+I+EGPILSIRYSLD K IAIQRS HEIQF  R +GETFSQ C+ ESESILGFF
Subjt:  -----VPGPRGLFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFF

Query:  WTDCPLCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS
        WTDCP+C+IVFVKTSGLD FAYNS+SKSL LVE+KKL VS
Subjt:  WTDCPLCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS

A0A540L3P8 Mic1 domain-containing protein3.1e-6761.84Show/hide
Query:  KEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK----------------------------------------------VPGPRGL
        KE+ + S LKRILVTC +QAK+YGGCVAAKVP+VERDMCLKEF+ALK                                              V G RGL
Subjt:  KEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK----------------------------------------------VPGPRGL

Query:  FYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCPLCNIVFV
        FYDD +KLL+ PT DQ+FSWKTVPF+P VT T+D+ITEGPILSIRYSLD K IAIQRS +EIQF  R +GETFSQ C+ ESESILGFFWTDCPLC+IVFV
Subjt:  FYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCPLCNIVFV

Query:  KTSGLDLFAYNSDSKSLHLVESKKLYVS
        KTSGLDLFA NS+SKSL LVE++KL VS
Subjt:  KTSGLDLFAYNSDSKSLHLVESKKLYVS

A0A7J6FJ60 Mic1 domain-containing protein (Fragment)7.6e-6659.83Show/hide
Query:  MDRKEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK--------------------------------------------VPGPRG
        +D KE+ T S LKRILV C +QAK+YG CVAAKVPEVERDMCLKEF+ALK                                            + G RG
Subjt:  MDRKEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK--------------------------------------------VPGPRG

Query:  LFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCPLCNIVF
        LFYDD  KLLI PT DQ+FSWKTVPF P+++ T+D+I+EGPILSIRYSLD K IAI RSSHEI+F  RE+ ETFSQRCR +SESILGFFWTDCPLC+IV 
Subjt:  LFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCPLCNIVF

Query:  VKTSGLDLFAYNSDSKSLHLVESKKLYVS
        VKTSGLDL +YNS+SKSL LVE+++L VS
Subjt:  VKTSGLDLFAYNSDSKSLHLVESKKLYVS

M5W8A2 Mic1 domain-containing protein2.1e-7166.51Show/hide
Query:  KEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK------------------------------VPGPRGLFYDDVHKLLICPTVDQ
        KE+ + S LKRILVTCA+QAK+YGGCVAAKVP+VERDMCLKEF+ALK                              V G RGLFYDD +KLL+ PT DQ
Subjt:  KEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK------------------------------VPGPRGLFYDDVHKLLICPTVDQ

Query:  IFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCPLCNIVFVKTSGLDLFAYNSDSKS
        +F WKTVPF+P VT T+D+I+EGPILSIRYSLD K IA+QRS HEIQF  R +GETFSQRC+SESESILGFFWTDCP+C+IVFVKTSGLDLFAYNS+S+S
Subjt:  IFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCPLCNIVFVKTSGLDLFAYNSDSKS

Query:  LHLVESKKLYVS
        L LVE++KL+VS
Subjt:  LHLVESKKLYVS

SwissProt top hitse value%identityAlignment
Q54LC7 Regulator of MON1-CCZ1 complex homolog7.7e-0735.11Show/hide
Query:  TEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRC--RSESESILGFFWTDCPLCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS
        ++ PI+  ++S DLK  AIQ S ++I+ L  E G  + Q C  +S   +ILG++WT     NI+ V  + L+L+A   D  S  LV+  K+ ++
Subjt:  TEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRC--RSESESILGFFWTDCPLCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS

Q96DM3 Regulator of MON1-CCZ1 complex5.9e-0731.91Show/hide
Query:  EGPILSIRYSLDLKIIAIQRSSHEIQF--LIRETGE-TFSQRCRSESESILGFFWTDCPLCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS
        +G +  I++SL+ KI+A+QR+S  + F   I +  +  ++Q C++++ +ILGF WT      IVF+   G++ +    + +SL L++S  L V+
Subjt:  EGPILSIRYSLDLKIIAIQRSSHEIQF--LIRETGE-TFSQRCRSESESILGFFWTDCPLCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS

Arabidopsis top hitse value%identityAlignment
AT3G12010.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: sperm cell, cultured cell; CONTAINS InterPro DOMAIN/s: Colon cancer-associated Mic1-like (InterPro:IPR009755); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).6.4e-4158.52Show/hide
Query:  VPGPRGLFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCP
        +P   GLFYDD ++LLIC T  Q+FSW+T PFNP V  + D+I+EGPILSIR+SLD K IA+QRS  EIQ   RET +  + +C++ SESILGFFW+D P
Subjt:  VPGPRGLFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCP

Query:  LCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS
        LC++  VKTSG+DLFA +S   SL LVE+KK  V+
Subjt:  LCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVS

AT3G12012.1 conserved peptide upstream open reading frame 201.6e-1268Show/hide
Query:  MDRKEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK
        M  K   TAS L RIL TC+ QAK YG CVA+KV EVERD+CLKEF+ALK
Subjt:  MDRKEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCGGAAAGAAAAAGCAACAGCTTCCATGCTCAAGCGTATTCTAGTGACCTGCGCATCTCAGGCAAAACAATATGGGGGCTGTGTTGCGGCAAAGGTTCCAGAAGT
TGAACGTGATATGTGTTTGAAAGAATTTATTGCACTAAAAGTTCCTGGACCGAGGGGCTTATTTTATGATGATGTACATAAGTTATTGATCTGCCCTACAGTGGATCAGA
TCTTCTCATGGAAAACAGTTCCGTTTAATCCTGCTGTCACTTATACCACTGATGCAATTACGGAAGGGCCCATTTTATCTATTCGATACTCCTTAGACTTAAAGATTATT
GCAATACAAAGGTCAAGTCATGAGATACAGTTCTTGATTAGAGAAACAGGTGAAACTTTTAGCCAGAGATGTAGATCAGAGTCAGAGAGCATTCTGGGATTTTTTTGGAC
GGATTGCCCCTTGTGCAATATTGTATTTGTGAAGACCAGTGGGCTGGACTTGTTTGCTTATAATTCTGATTCAAAGTCTCTCCATTTGGTGGAGTCAAAGAAATTGTATG
TGAGCTG
mRNA sequenceShow/hide mRNA sequence
ATGGACCGGAAAGAAAAAGCAACAGCTTCCATGCTCAAGCGTATTCTAGTGACCTGCGCATCTCAGGCAAAACAATATGGGGGCTGTGTTGCGGCAAAGGTTCCAGAAGT
TGAACGTGATATGTGTTTGAAAGAATTTATTGCACTAAAAGTTCCTGGACCGAGGGGCTTATTTTATGATGATGTACATAAGTTATTGATCTGCCCTACAGTGGATCAGA
TCTTCTCATGGAAAACAGTTCCGTTTAATCCTGCTGTCACTTATACCACTGATGCAATTACGGAAGGGCCCATTTTATCTATTCGATACTCCTTAGACTTAAAGATTATT
GCAATACAAAGGTCAAGTCATGAGATACAGTTCTTGATTAGAGAAACAGGTGAAACTTTTAGCCAGAGATGTAGATCAGAGTCAGAGAGCATTCTGGGATTTTTTTGGAC
GGATTGCCCCTTGTGCAATATTGTATTTGTGAAGACCAGTGGGCTGGACTTGTTTGCTTATAATTCTGATTCAAAGTCTCTCCATTTGGTGGAGTCAAAGAAATTGTATG
TGAGCTG
Protein sequenceShow/hide protein sequence
MDRKEKATASMLKRILVTCASQAKQYGGCVAAKVPEVERDMCLKEFIALKVPGPRGLFYDDVHKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSIRYSLDLKII
AIQRSSHEIQFLIRETGETFSQRCRSESESILGFFWTDCPLCNIVFVKTSGLDLFAYNSDSKSLHLVESKKLYVSX