; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr000688 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr000688
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTyrosine-specific transport protein
Genome locationtig00000447:1160..2840
RNA-Seq ExpressionSgr000688
SyntenySgr000688
Gene Ontology termsGO:0003333 - amino acid transmembrane transport (biological process)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR018227 - Amino acid/polyamine transporter 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147722.1 uncharacterized protein LOC111016587 isoform X1 [Momordica charantia]2.3e-4768.97Show/hide
Query:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCLGYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQVT
        M+ISSCLRL FPVI         RRSL+L  QNA CLG  +SLR RR  +LRP  R+T TCFSR+PAESTV G+ +EI QE+ESQ+YELERLFSNLNQVT
Subjt:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCLGYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQVT

Query:  LKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVKLLPAFV
        LKREPG                SLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVKL    V
Subjt:  LKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVKLLPAFV

XP_022147723.1 uncharacterized protein LOC111016587 isoform X2 [Momordica charantia]1.9e-4670.06Show/hide
Query:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCLGYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQVT
        M+ISSCLRL FPVI         RRSL+L  QNA CLG  +SLR RR  +LRP  R+T TCFSR+PAESTV G+ +EI QE+ESQ+YELERLFSNLNQVT
Subjt:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCLGYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQVT

Query:  LKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV
        LKREPG                SLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV
Subjt:  LKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV

XP_022147724.1 uncharacterized protein LOC111016587 isoform X3 [Momordica charantia]2.3e-4768.97Show/hide
Query:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCLGYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQVT
        M+ISSCLRL FPVI         RRSL+L  QNA CLG  +SLR RR  +LRP  R+T TCFSR+PAESTV G+ +EI QE+ESQ+YELERLFSNLNQVT
Subjt:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCLGYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQVT

Query:  LKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVKLLPAFV
        LKREPG                SLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVKL    V
Subjt:  LKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVKLLPAFV

XP_022967292.1 uncharacterized protein LOC111466855 isoform X1 [Cucurbita maxima]2.0e-4366.67Show/hide
Query:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCL-GYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQV
        MSISSCLRLPFP +QSA      RRSL    +N SCL   ++SLR R + LLR   RT +T FSR+P ES+V GQ +EID+E+ES++YELERLFSNLNQV
Subjt:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCL-GYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQV

Query:  TLKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV
        T KREPG                SLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV
Subjt:  TLKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV

XP_022967300.1 uncharacterized protein LOC111466855 isoform X2 [Cucurbita maxima]2.0e-4366.67Show/hide
Query:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCL-GYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQV
        MSISSCLRLPFP +QSA      RRSL    +N SCL   ++SLR R + LLR   RT +T FSR+P ES+V GQ +EID+E+ES++YELERLFSNLNQV
Subjt:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCL-GYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQV

Query:  TLKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV
        T KREPG                SLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV
Subjt:  TLKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV

TrEMBL top hitse value%identityAlignment
A0A6J1D1V4 uncharacterized protein LOC111016587 isoform X31.1e-4768.97Show/hide
Query:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCLGYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQVT
        M+ISSCLRL FPVI         RRSL+L  QNA CLG  +SLR RR  +LRP  R+T TCFSR+PAESTV G+ +EI QE+ESQ+YELERLFSNLNQVT
Subjt:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCLGYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQVT

Query:  LKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVKLLPAFV
        LKREPG                SLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVKL    V
Subjt:  LKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVKLLPAFV

A0A6J1D230 uncharacterized protein LOC111016587 isoform X29.4e-4770.06Show/hide
Query:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCLGYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQVT
        M+ISSCLRL FPVI         RRSL+L  QNA CLG  +SLR RR  +LRP  R+T TCFSR+PAESTV G+ +EI QE+ESQ+YELERLFSNLNQVT
Subjt:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCLGYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQVT

Query:  LKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV
        LKREPG                SLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV
Subjt:  LKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV

A0A6J1D360 uncharacterized protein LOC111016587 isoform X11.1e-4768.97Show/hide
Query:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCLGYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQVT
        M+ISSCLRL FPVI         RRSL+L  QNA CLG  +SLR RR  +LRP  R+T TCFSR+PAESTV G+ +EI QE+ESQ+YELERLFSNLNQVT
Subjt:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCLGYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQVT

Query:  LKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVKLLPAFV
        LKREPG                SLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVKL    V
Subjt:  LKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVKLLPAFV

A0A6J1HQF1 uncharacterized protein LOC111466855 isoform X19.7e-4466.67Show/hide
Query:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCL-GYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQV
        MSISSCLRLPFP +QSA      RRSL    +N SCL   ++SLR R + LLR   RT +T FSR+P ES+V GQ +EID+E+ES++YELERLFSNLNQV
Subjt:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCL-GYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQV

Query:  TLKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV
        T KREPG                SLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV
Subjt:  TLKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV

A0A6J1HU20 uncharacterized protein LOC111466855 isoform X29.7e-4466.67Show/hide
Query:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCL-GYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQV
        MSISSCLRLPFP +QSA      RRSL    +N SCL   ++SLR R + LLR   RT +T FSR+P ES+V GQ +EID+E+ES++YELERLFSNLNQV
Subjt:  MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCL-GYIKSLRQRRHKLLRP--RTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQV

Query:  TLKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV
        T KREPG                SLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV
Subjt:  TLKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G19500.1 Tryptophan/tyrosine permease6.7e-2144.77Show/hide
Query:  MSISSCLRLPFPVIQSATSSFSTRRSLNL--LHQNASCLG-----YIKSLRQRRHKLLRPRTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSN
        +S +  LRLP   + +   SF+      L   H ++SC G     +I++   R  K L  R      S +  E+ V  + EE D+E++   +  ERLFSN
Subjt:  MSISSCLRLPFPVIQSATSSFSTRRSLNL--LHQNASCLG-----YIKSLRQRRHKLLRPRTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSN

Query:  LNQVTLKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV
        LNQ TLKRE G                SLSSAIFLVAGTT+GAGILAIPAVTQESGFLASA+ C  CW +MV
Subjt:  LNQVTLKREPGDFPVPSAHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAATTTCTTCCTGTCTTCGACTTCCATTTCCTGTAATTCAATCGGCAACTTCTTCTTTTTCGACAAGAAGAAGCCTCAATTTGCTCCACCAGAATGCGTCTTGCCT
GGGGTATATCAAGTCGCTTCGGCAGCGTCGCCACAAGCTTCTTCGTCCAAGAACTACCGCCACCTGCTTCTCGCGGAAGCCGGCAGAGTCTACTGTCCCCGGACAAGGGG
AAGAAATCGACCAGGAAAAAGAATCGCAAGAGTACGAATTGGAGAGACTGTTTTCCAACCTTAATCAAGTCACGCTCAAGCGAGAACCCGGTGATTTTCCGGTTCCTTCT
GCTCATTTGATTCGCGTTTTCGTAGGAAGAAGTTTATCCAGCGCGATTTTCCTGGTGGCTGGGACTACAATTGGTGCTGGGATCCTCGCCATTCCTGCAGTAACTCAAGA
ATCCGGATTTCTAGCCTCAGCTATTACGTGCACCTTTTGCTGGGTGTACATGGTAAAGCTATTGCCGGCATTTGTATATTTTCAATTTTTGTTTTATATTTATCGTAATC
TTATTTTAATATTAGAGTTCAAGGTTTGCAATAGAATGCGGTCAGAATGTGGTTGTGCAAGCCTTGTCGTTTCTTTATGTTTGAATGCATATTATATTGTCTATTGTTTT
TCGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAATTTCTTCCTGTCTTCGACTTCCATTTCCTGTAATTCAATCGGCAACTTCTTCTTTTTCGACAAGAAGAAGCCTCAATTTGCTCCACCAGAATGCGTCTTGCCT
GGGGTATATCAAGTCGCTTCGGCAGCGTCGCCACAAGCTTCTTCGTCCAAGAACTACCGCCACCTGCTTCTCGCGGAAGCCGGCAGAGTCTACTGTCCCCGGACAAGGGG
AAGAAATCGACCAGGAAAAAGAATCGCAAGAGTACGAATTGGAGAGACTGTTTTCCAACCTTAATCAAGTCACGCTCAAGCGAGAACCCGGTGATTTTCCGGTTCCTTCT
GCTCATTTGATTCGCGTTTTCGTAGGAAGAAGTTTATCCAGCGCGATTTTCCTGGTGGCTGGGACTACAATTGGTGCTGGGATCCTCGCCATTCCTGCAGTAACTCAAGA
ATCCGGATTTCTAGCCTCAGCTATTACGTGCACCTTTTGCTGGGTGTACATGGTAAAGCTATTGCCGGCATTTGTATATTTTCAATTTTTGTTTTATATTTATCGTAATC
TTATTTTAATATTAGAGTTCAAGGTTTGCAATAGAATGCGGTCAGAATGTGGTTGTGCAAGCCTTGTCGTTTCTTTATGTTTGAATGCATATTATATTGTCTATTGTTTT
TCGCAATGA
Protein sequenceShow/hide protein sequence
MSISSCLRLPFPVIQSATSSFSTRRSLNLLHQNASCLGYIKSLRQRRHKLLRPRTTATCFSRKPAESTVPGQGEEIDQEKESQEYELERLFSNLNQVTLKREPGDFPVPS
AHLIRVFVGRSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVKLLPAFVYFQFLFYIYRNLILILEFKVCNRMRSECGCASLVVSLCLNAYYIVYCF
SQ