; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021485 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021485
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionVacuolar sorting-associated protein 62
Genome locationtig00153705:274255..279360
RNA-Seq ExpressionSgr021485
SyntenySgr021485
Gene Ontology termsNA
InterPro domainsIPR014807 - Cytochrome c oxidase assembly factor 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576079.1 hypothetical protein SDJN03_26718, partial [Cucurbita argyrosperma subsp. sororia]1.3e-9187.89Show/hide
Query:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK
        MLAKRL SIFKRSPTP  SSS++P+EEGV KSWGRKAVSFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+A+IDAIGEPIVKGPWYNASLAVAHK
Subjt:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK

Query:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPP--AACVACTDCQLPESEKR
        RHSLSCTFPVSGPQGTGI+QLKAVRNGEDSWISFLRPRDWDI+IMDALLHVP NEGKQQT+RINL +K  P  AACVACTDCQ PE+EKR
Subjt:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPP--AACVACTDCQLPESEKR

XP_022991412.1 uncharacterized protein LOC111488050 isoform X2 [Cucurbita maxima]2.2e-9187.89Show/hide
Query:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK
        MLAKR  SIFKRSPTP  SSS++P+EEGVNKSWGRKAVSFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+A+IDAIGEPIVKGPWYNASLAVAHK
Subjt:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK

Query:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPAA--CVACTDCQLPESEKR
        RHSLSCTFPVSGPQGTGI+QLKAVRNGEDSWISFLRPRDWDI+IMDALLHVP NEGKQQTLRINL +K  PAA  CVACTDCQ P +EKR
Subjt:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPAA--CVACTDCQLPESEKR

XP_023547893.1 uncharacterized protein LOC111806704 isoform X2 [Cucurbita pepo subsp. pepo]5.7e-9287.96Show/hide
Query:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK
        MLAKR  SIFKRSPTP  SSS++P+EEGVNKSWGRKAVSFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+A+IDAIGEPIVKGPWYNASLAVAHK
Subjt:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK

Query:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPP---AACVACTDCQLPESEKR
        RHSLSCTFPVSGPQGTGI+QLKAVRNGEDSWISFLRPRDWDI+IMDALLHVP NEGKQQTLRINL +K  P   AACVACTDCQ PE+EKR
Subjt:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPP---AACVACTDCQLPESEKR

XP_038876218.1 uncharacterized protein LOC120068500 isoform X1 [Benincasa hispida]2.2e-9189.42Show/hide
Query:  MLAKRLGSIFKRSPTPQ-PSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAH
        MLAKR  SIFKRSP P   SSSIKP+E+GVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPI KGPWYNASLAVAH
Subjt:  MLAKRLGSIFKRSPTPQ-PSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPAACVACTDCQLPESEKR
        KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDI+IMDALLHVPANEGKQ+T+RINL +K  PAACV+CTDCQ PE+E R
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPAACVACTDCQLPESEKR

XP_038876219.1 uncharacterized protein LOC120068500 isoform X2 [Benincasa hispida]8.7e-9389.89Show/hide
Query:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK
        MLAKR  SIFKRSP P  SSSIKP+E+GVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPI KGPWYNASLAVAHK
Subjt:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK

Query:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPAACVACTDCQLPESEKR
        RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDI+IMDALLHVPANEGKQ+T+RINL +K  PAACV+CTDCQ PE+E R
Subjt:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPAACVACTDCQLPESEKR

TrEMBL top hitse value%identityAlignment
A0A0A0KF26 Uncharacterized protein3.7e-8987.83Show/hide
Query:  MLAKRLGSIFKRSPTPQPSS-SIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAH
        MLAKR  SIFKRS TP  SS SIKP E  VNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEK RNNQAVIDAIGEPI KGPWYNASLAVAH
Subjt:  MLAKRLGSIFKRSPTPQPSS-SIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPAACVACTDCQLPESEKR
        KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDI++MDALL+VP NEGKQ+TLRINL++K  PAACV+CTDCQ PE+EKR
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPAACVACTDCQLPESEKR

A0A6J1GQN1 uncharacterized protein LOC111456591 isoform X13.4e-9086.91Show/hide
Query:  MLAKRLGSIFKRSPTPQ-PSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAH
        MLAKRL SIFKRSPTP   SSS++P+EEGVNKSWGR AVSFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+A+IDAIGEPIVKGPWYNASLAVAH
Subjt:  MLAKRLGSIFKRSPTPQ-PSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPP--AACVACTDCQLPESEKR
        KRHSLSCTFPVSGPQGTGI+QLKAVRNGE+SWISFLRPRDWDI+IMDALLHVP NEGKQQT+RINL +K  P  AACVACTDCQ PE+EKR
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPP--AACVACTDCQLPESEKR

A0A6J1GQQ6 uncharacterized protein LOC111456591 isoform X21.4e-9187.37Show/hide
Query:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK
        MLAKRL SIFKRSPTP  SSS++P+EEGVNKSWGR AVSFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+A+IDAIGEPIVKGPWYNASLAVAHK
Subjt:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK

Query:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPP--AACVACTDCQLPESEKR
        RHSLSCTFPVSGPQGTGI+QLKAVRNGE+SWISFLRPRDWDI+IMDALLHVP NEGKQQT+RINL +K  P  AACVACTDCQ PE+EKR
Subjt:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPP--AACVACTDCQLPESEKR

A0A6J1JLR1 uncharacterized protein LOC111488050 isoform X21.0e-9187.89Show/hide
Query:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK
        MLAKR  SIFKRSPTP  SSS++P+EEGVNKSWGRKAVSFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+A+IDAIGEPIVKGPWYNASLAVAHK
Subjt:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK

Query:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPAA--CVACTDCQLPESEKR
        RHSLSCTFPVSGPQGTGI+QLKAVRNGEDSWISFLRPRDWDI+IMDALLHVP NEGKQQTLRINL +K  PAA  CVACTDCQ P +EKR
Subjt:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPAA--CVACTDCQLPESEKR

A0A6J1JQM9 uncharacterized protein LOC111488050 isoform X12.6e-9087.43Show/hide
Query:  MLAKRLGSIFKRSPTPQ-PSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAH
        MLAKR  SIFKRSPTP   SSS++P+EEGVNKSWGRKAVSFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+A+IDAIGEPIVKGPWYNASLAVAH
Subjt:  MLAKRLGSIFKRSPTPQ-PSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAH

Query:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPAA--CVACTDCQLPESEKR
        KRHSLSCTFPVSGPQGTGI+QLKAVRNGEDSWISFLRPRDWDI+IMDALLHVP NEGKQQTLRINL +K  PAA  CVACTDCQ P +EKR
Subjt:  KRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPAA--CVACTDCQLPESEKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20390.1 unknown protein1.7e-6268.02Show/hide
Query:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK
        M A+R  S FK S T  P  +      G   S+GRKAVSFVLITVTGGVALSALDDL+IY  CSSKA+EK  N++ +I+AIGEPI KGPWYNASLAV+H+
Subjt:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK

Query:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPA
        RHS+SC+FPV GPQGTGIL LKAVRNGEDS   FL+ RDWDI+IMDAL+HVP+NEG QQTLRIN+ D + P+
Subjt:  RHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPA

AT2G20390.2 unknown protein4.8e-5756.25Show/hide
Query:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK
        M A+R  S FK S T  P  +      G   S+GRKAVSFVLITVTGGVALSALDDL+IY  CSSKA+EK  N++ +I+AIGEPI KGPWYNASLAV+H+
Subjt:  MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHK

Query:  RHSLSCTFPVSGPQGTGILQLKAVRNG------------------------------------EDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRIN
        RHS+SC+FPV GPQGTGIL LKAVRNG                                    EDS   FL+ RDWDI+IMDAL+HVP+NEG QQTLRIN
Subjt:  RHSLSCTFPVSGPQGTGILQLKAVRNG------------------------------------EDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRIN

Query:  LNDKLPPA
        + D + P+
Subjt:  LNDKLPPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGCGAAGAGGTTGGGTTCCATCTTCAAGCGCTCTCCAACGCCACAACCTTCCAGTTCCATAAAGCCAGCGGAGGAAGGGGTGAATAAATCCTGGGGTCGTAAAGC
GGTCTCCTTTGTACTTATTACTGTTACTGGTGGTGTAGCTTTGAGTGCTTTAGATGACCTTGCCATTTATCATAGCTGTAGCAGCAAAGCCATAGAGAAAGCCAGAAACA
ATCAAGCAGTTATTGATGCGATTGGAGAACCCATTGTTAAAGGTCCATGGTACAATGCATCACTTGCAGTAGCTCATAAGAGACATTCTCTGTCCTGCACGTTTCCTGTA
TCAGGACCACAAGGCACTGGTATCCTCCAATTGAAGGCAGTTCGTAATGGAGAGGACTCCTGGATTTCTTTTCTCCGGCCACGTGACTGGGACATTATAATCATGGATGC
TCTTCTACATGTTCCTGCAAACGAAGGAAAGCAGCAAACGTTGCGCATCAATCTCAACGACAAACTTCCTCCTGCTGCTTGTGTCGCATGCACTGATTGTCAGCTGCCAG
AGTCAGAGAAGAGAGCCATGAATAAACGCTTGGGATGGAGAGTTTTTGCAGTGATGATGGTGATGCTGCTGCCATGGCATGGAAACTCCCTCTCTTATTCAGAGCAGCCC
CATGACGTTGAGGCCTTGTCTCCTCTGCCCCAAAATGGCAACTTTGCCAAGCTGAAGATGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGCGAAGAGGTTGGGTTCCATCTTCAAGCGCTCTCCAACGCCACAACCTTCCAGTTCCATAAAGCCAGCGGAGGAAGGGGTGAATAAATCCTGGGGTCGTAAAGC
GGTCTCCTTTGTACTTATTACTGTTACTGGTGGTGTAGCTTTGAGTGCTTTAGATGACCTTGCCATTTATCATAGCTGTAGCAGCAAAGCCATAGAGAAAGCCAGAAACA
ATCAAGCAGTTATTGATGCGATTGGAGAACCCATTGTTAAAGGTCCATGGTACAATGCATCACTTGCAGTAGCTCATAAGAGACATTCTCTGTCCTGCACGTTTCCTGTA
TCAGGACCACAAGGCACTGGTATCCTCCAATTGAAGGCAGTTCGTAATGGAGAGGACTCCTGGATTTCTTTTCTCCGGCCACGTGACTGGGACATTATAATCATGGATGC
TCTTCTACATGTTCCTGCAAACGAAGGAAAGCAGCAAACGTTGCGCATCAATCTCAACGACAAACTTCCTCCTGCTGCTTGTGTCGCATGCACTGATTGTCAGCTGCCAG
AGTCAGAGAAGAGAGCCATGAATAAACGCTTGGGATGGAGAGTTTTTGCAGTGATGATGGTGATGCTGCTGCCATGGCATGGAAACTCCCTCTCTTATTCAGAGCAGCCC
CATGACGTTGAGGCCTTGTCTCCTCTGCCCCAAAATGGCAACTTTGCCAAGCTGAAGATGGAATAA
Protein sequenceShow/hide protein sequence
MLAKRLGSIFKRSPTPQPSSSIKPAEEGVNKSWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAVIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPV
SGPQGTGILQLKAVRNGEDSWISFLRPRDWDIIIMDALLHVPANEGKQQTLRINLNDKLPPAACVACTDCQLPESEKRAMNKRLGWRVFAVMMVMLLPWHGNSLSYSEQP
HDVEALSPLPQNGNFAKLKME