; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029535 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029535
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionMultidrug resistance protein ABC transporter family protein, putative
Genome locationtig00153403:1715131..1715712
RNA-Seq ExpressionSgr029535
SyntenySgr029535
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152205.1 uncharacterized protein LOC111019977 [Momordica charantia]1.3e-7077.16Show/hide
Query:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIHSAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSAL
        MGN  S SGA   GKVVL DGSVQELNE LTAAELMLEHPRQVVVEI S + GKRPT LPADEKLDLKKVY+MLP+RGGKPASLSSEEIRR+LLC NS L
Subjt:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIHSAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSAL

Query:  RFRSLLLSSSSSKVLPWFARVCTAAT----GEAGVQSKKEDMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF
        R  SL    SSSKVLPWFARVCT  T    G AGV+ KK D +   EE  RWE EMAAEGRPEYLSRQ SGRGWKPSLDTIKEKK EKK SHWLFKF
Subjt:  RFRSLLLSSSSSKVLPWFARVCTAAT----GEAGVQSKKEDMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF

XP_022953412.1 uncharacterized protein LOC111455974 [Cucurbita moschata]3.6e-6573Show/hide
Query:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIH-SAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA
        MGNV S+SG    GK+VL DGS+ E NEPLT AELMLEHPR VVVE+  SAV  KRPTPLPAD KLDLKKVY+MLP+RGGKP SLSSEEIRRV+LCANSA
Subjt:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIH-SAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA

Query:  LRFRSLLLSSSSSKVLPWFARVCTA----ATGEAGVQSKKEDMVIGAEETERW--EMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF
        LR RSLLL  SSSKVLPW AR  +A     TG  GV  KK+D+ +  EE  RW  EM MAAEGRPEYLSRQLSG+ WKPSLDTIKEKK EKK SHWLFKF
Subjt:  LRFRSLLLSSSSSKVLPWFARVCTA----ATGEAGVQSKKEDMVIGAEETERW--EMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF

XP_022991521.1 uncharacterized protein LOC111488113 [Cucurbita maxima]1.2e-6574.36Show/hide
Query:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIHSA-VEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA
        MGNV S+SG    GKVVL DGS+ E NEPLT AELMLEHPR VVVE+ S+ V  KRPTPLPAD KLDLKKVY+MLP+RGGKP SLSSEEIR VLLCANSA
Subjt:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIHSA-VEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA

Query:  LRFRSLLLSSSSSKVLPWFARVCTAATGEAGVQSKKED-MVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF
        LR RSLLL  SSSKVLPW AR  +A TG  GV  KK+D   IG E     EM MAAEGRPEYLSRQLSG+ WKPSLDTIKEKK EKK SHWLFKF
Subjt:  LRFRSLLLSSSSSKVLPWFARVCTAATGEAGVQSKKED-MVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF

XP_023549016.1 uncharacterized protein LOC111807502 [Cucurbita pepo subsp. pepo]1.4e-6473.47Show/hide
Query:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIHSA-VEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA
        MGNV S+SG    GKVVL +GS+ E NEPLT AELMLEHPR VVVE+ S+ V  KRPTPLPAD KLDLKKVY+MLP+RGGKP SLSSEEIRRVLLCANSA
Subjt:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIHSA-VEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA

Query:  LRFRSLLLSSSSSKVLPWFARVCTAATGEAGVQSKKEDMVIGAEETERW--EMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF
        LR RSLLL  SSSKVLPW AR  +  T   GV  KKED     EE  RW  EM MA EGRPEYLSRQLSG+ WKPSLDTIKEKK EKK SHWLFKF
Subjt:  LRFRSLLLSSSSSKVLPWFARVCTAATGEAGVQSKKEDMVIGAEETERW--EMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF

XP_038899617.1 uncharacterized protein LOC120086875 [Benincasa hispida]3.5e-6875.25Show/hide
Query:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEI-HSAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA
        MGNV  +SG    G+VVL DGS+Q+ NEP TAAELMLEHPRQVVVEI  S +  KR TPLPADEKLD KKVY+MLP+RGGKPASLSSEEIRRVLLCANSA
Subjt:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEI-HSAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA

Query:  LRFRSLLLSSSSSKVLPWFARVCTAA----TGEAGVQSKKEDMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF
        LR RSLLL  SSSKVLPWFAR  T+A    T E  V+ KKED+V    E +RWE EMAAEGRPEYLSRQLSGRGWKPSLDTIKEKK EKK+SHWLF F
Subjt:  LRFRSLLLSSSSSKVLPWFARVCTAA----TGEAGVQSKKEDMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF

TrEMBL top hitse value%identityAlignment
A0A0A0K6C6 Uncharacterized protein4.8e-6371.29Show/hide
Query:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEI-HSAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA
        MGNV S+      GKVVL +GS+QE NEP T AELMLEHPRQVVVEI  S V GKRPTPLPADEKL+  KVY+MLP+RGGKPASLSSE+IRRVLLCANSA
Subjt:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEI-HSAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA

Query:  LRFRSLLLSSSSSKVLPWFARVCTAATGEAG-------VQSKKE-DMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLF
        LR RSLLL  SSSKVLPWFAR CTA T  A          +KKE D+V    E   WE     EGRPEYLSRQLSGRGWKPSLDTIKEKK EKK SHWLF
Subjt:  LRFRSLLLSSSSSKVLPWFARVCTAATGEAG-------VQSKKE-DMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLF

Query:  KF
        KF
Subjt:  KF

A0A1S3BSP6 uncharacterized protein LOC1034927631.8e-6271.29Show/hide
Query:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEI-HSAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA
        MGNV S+      GKVVL +GS+QE NEP T AELMLEHPRQVVVEI  S V GKRPTPLPADEKL+  KVY+MLP+RGGKPASLSSE+IRRVLLCANSA
Subjt:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEI-HSAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA

Query:  LRFRSLLLSSSSSKVLPWFARVCTA-------ATGEAGVQSKKE-DMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLF
        LR RSLLL  SSSKVLPWFAR CTA        T E   + KKE D+V    E   WE    AEGRPEYLSRQLSGRGWKPSLDTIKEKK+EKK SHWLF
Subjt:  LRFRSLLLSSSSSKVLPWFARVCTA-------ATGEAGVQSKKE-DMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLF

Query:  KF
         F
Subjt:  KF

A0A6J1DD99 uncharacterized protein LOC1110199776.2e-7177.16Show/hide
Query:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIHSAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSAL
        MGN  S SGA   GKVVL DGSVQELNE LTAAELMLEHPRQVVVEI S + GKRPT LPADEKLDLKKVY+MLP+RGGKPASLSSEEIRR+LLC NS L
Subjt:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIHSAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSAL

Query:  RFRSLLLSSSSSKVLPWFARVCTAAT----GEAGVQSKKEDMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF
        R  SL    SSSKVLPWFARVCT  T    G AGV+ KK D +   EE  RWE EMAAEGRPEYLSRQ SGRGWKPSLDTIKEKK EKK SHWLFKF
Subjt:  RFRSLLLSSSSSKVLPWFARVCTAAT----GEAGVQSKKEDMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF

A0A6J1GN59 uncharacterized protein LOC1114559741.8e-6573Show/hide
Query:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIH-SAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA
        MGNV S+SG    GK+VL DGS+ E NEPLT AELMLEHPR VVVE+  SAV  KRPTPLPAD KLDLKKVY+MLP+RGGKP SLSSEEIRRV+LCANSA
Subjt:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIH-SAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA

Query:  LRFRSLLLSSSSSKVLPWFARVCTA----ATGEAGVQSKKEDMVIGAEETERW--EMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF
        LR RSLLL  SSSKVLPW AR  +A     TG  GV  KK+D+ +  EE  RW  EM MAAEGRPEYLSRQLSG+ WKPSLDTIKEKK EKK SHWLFKF
Subjt:  LRFRSLLLSSSSSKVLPWFARVCTA----ATGEAGVQSKKEDMVIGAEETERW--EMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF

A0A6J1JQZ4 uncharacterized protein LOC1114881136.0e-6674.36Show/hide
Query:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIHSA-VEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA
        MGNV S+SG    GKVVL DGS+ E NEPLT AELMLEHPR VVVE+ S+ V  KRPTPLPAD KLDLKKVY+MLP+RGGKP SLSSEEIR VLLCANSA
Subjt:  MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIHSA-VEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSA

Query:  LRFRSLLLSSSSSKVLPWFARVCTAATGEAGVQSKKED-MVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF
        LR RSLLL  SSSKVLPW AR  +A TG  GV  KK+D   IG E     EM MAAEGRPEYLSRQLSG+ WKPSLDTIKEKK EKK SHWLFKF
Subjt:  LRFRSLLLSSSSSKVLPWFARVCTAATGEAGVQSKKED-MVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02090.1 unknown protein5.3e-3045.37Show/hide
Query:  MGNVA---SISGAG-ARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEI--------HSAVEGKRP-TPLPADEKLDLKKVYLMLPVR--GGKPAS--
        MGNVA    +  AG A GKVVL DG VQ L E  T AE+MLE+P+ VVVE         + A   KR   PLPAD+ L+  K+YL+LP +  GG+ A   
Subjt:  MGNVA---SISGAG-ARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEI--------HSAVEGKRP-TPLPADEKLDLKKVYLMLPVR--GGKPAS--

Query:  ---LSSEEIRRVLLCANSALRFRSLLLSSSSSKVLPWFARVCTAATGEAGVQSKKEDMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEK
           L+SEE+R++L  A + +R       S    +LPWF       T      +   D V+ A    R E EM  E RPE+LSRQLSGRGWKPSLD I+EK
Subjt:  ---LSSEEIRRVLLCANSALRFRSLLLSSSSSKVLPWFARVCTAATGEAGVQSKKEDMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEK

Query:  KMEKK
        K +KK
Subjt:  KMEKK

AT5G17350.1 unknown protein1.1e-0625.79Show/hide
Query:  MGNVAS------ISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIHSAVEGKRPTPLPADEKLDLK--KVYLMLPVRGGKPASLSSEEIRRV
        MGN  S       S + +  KV+L DG V+ ++ P+ AAELM+E P   +V+  S   G++  PL AD+ L +K   VY+  P+     A+ ++ ++ R+
Subjt:  MGNVAS------ISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIHSAVEGKRPTPLPADEKLDLK--KVYLMLPVRGGKPASLSSEEIRRV

Query:  LLCANSALRFRSLLLSSSSSKVLPWFARVCTAATGEAGVQSKKEDMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKM
         + A    R R     SS++       + C    G      + +D+ + +  ++    ++      E++ R    +  KP L+TI E+ +
Subjt:  LLCANSALRFRSLLLSSSSSKVLPWFARVCTAATGEAGVQSKKEDMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAACGTCGCCTCCATCTCCGGCGCCGGAGCACGTGGAAAAGTCGTTCTCTGGGACGGCTCTGTTCAGGAGCTCAACGAGCCGTTGACGGCGGCCGAGCTGATGCT
GGAACACCCACGGCAAGTTGTGGTGGAGATCCACTCGGCCGTCGAGGGAAAGCGGCCGACCCCATTGCCGGCCGACGAGAAGCTGGACTTGAAGAAGGTTTATTTGATGC
TTCCGGTGAGAGGAGGGAAGCCGGCGTCGTTGTCGTCGGAGGAGATCCGCCGCGTTCTTCTGTGCGCCAACTCGGCTTTACGCTTCCGCTCTCTCCTCCTGTCTTCTTCT
TCTTCGAAGGTTCTTCCTTGGTTTGCGAGGGTATGCACGGCGGCGACGGGGGAGGCCGGAGTGCAGAGTAAGAAGGAAGATATGGTGATCGGAGCAGAGGAAACTGAAAG
GTGGGAAATGGAGATGGCGGCGGAGGGGAGGCCGGAGTATTTGAGCAGACAATTATCCGGCAGAGGTTGGAAGCCGAGCTTGGATACGATAAAGGAGAAGAAAATGGAGA
AGAAATTTTCTCATTGGTTGTTCAAATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCAACGTCGCCTCCATCTCCGGCGCCGGAGCACGTGGAAAAGTCGTTCTCTGGGACGGCTCTGTTCAGGAGCTCAACGAGCCGTTGACGGCGGCCGAGCTGATGCT
GGAACACCCACGGCAAGTTGTGGTGGAGATCCACTCGGCCGTCGAGGGAAAGCGGCCGACCCCATTGCCGGCCGACGAGAAGCTGGACTTGAAGAAGGTTTATTTGATGC
TTCCGGTGAGAGGAGGGAAGCCGGCGTCGTTGTCGTCGGAGGAGATCCGCCGCGTTCTTCTGTGCGCCAACTCGGCTTTACGCTTCCGCTCTCTCCTCCTGTCTTCTTCT
TCTTCGAAGGTTCTTCCTTGGTTTGCGAGGGTATGCACGGCGGCGACGGGGGAGGCCGGAGTGCAGAGTAAGAAGGAAGATATGGTGATCGGAGCAGAGGAAACTGAAAG
GTGGGAAATGGAGATGGCGGCGGAGGGGAGGCCGGAGTATTTGAGCAGACAATTATCCGGCAGAGGTTGGAAGCCGAGCTTGGATACGATAAAGGAGAAGAAAATGGAGA
AGAAATTTTCTCATTGGTTGTTCAAATTTTGA
Protein sequenceShow/hide protein sequence
MGNVASISGAGARGKVVLWDGSVQELNEPLTAAELMLEHPRQVVVEIHSAVEGKRPTPLPADEKLDLKKVYLMLPVRGGKPASLSSEEIRRVLLCANSALRFRSLLLSSS
SSKVLPWFARVCTAATGEAGVQSKKEDMVIGAEETERWEMEMAAEGRPEYLSRQLSGRGWKPSLDTIKEKKMEKKFSHWLFKF